VLM For OCR
updated
Text Generation
• Updated • 119k
• 279
Image-to-Text
• 1B • Updated • 388
• 35
Text Generation
• 18B • Updated • 182
• 68
openbmb/MiniCPM-Llama3-V-2_5
Image-Text-to-Text
• 9B • Updated • 3.93k
• 1.41k
google/paligemma-3b-pt-896
Image-Text-to-Text
• 3B • Updated • 704
• 124
UCSC-VLAA/Recap-DataComp-1B
Viewer
• Updated • 1.88B • 12k
• 199
WildVision: Evaluating Vision-Language Models in the Wild with Human
Preferences
Paper
• 2406.11069
• Published • 14
pbevan11/synthetic-ocr-correction-gpt4o
Viewer
• Updated • 10k • 128
• 6
yifeihu/ACL-23-Paper-OCR-Markdown
Viewer
• Updated • 2.15k • 45
• 19
LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs
Paper
• 2406.15319
• Published • 64