Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Bk9x 's Collections
Data_Pretrain_NLP
Dataset_NLP
Small LM
Dataset_voice
Embedding
Automatic Speech Recognition
SDXL
TTS
LLM
model_NLP
VLM + OCR

VLM + OCR

updated 1 day ago
Upvote
-

  • 5CD-AI/Vintern-1B-v2

    Image-Text-to-Text • 0.9B • Updated Jan 17, 2025 • 644 • 80

  • erax-ai/EraX-VL-7B-V1.0

    Image-Text-to-Text • 8B • Updated Jan 15, 2025 • 237 • 43

  • Running on Zero
    Featured
    267

    granite-docling-258M demo

    📝
    267

    Convert images to structured text and answer questions


  • datalab-to/chandra

    Image-to-Text • 9B • Updated Oct 21, 2025 • 547k • 468

  • deepseek-ai/DeepSeek-OCR

    Image-Text-to-Text • 3B • Updated Nov 4, 2025 • 2.98M • 3.12k

  • Running on Zero
    MCP
    60

    Multimodal OCR3

    🌖
    60

    nanonets2-ocr / chandra-ocr / dots.ocr / olm-ocr2


  • lightonai/LightOnOCR-2-1B

    Image-Text-to-Text • 1B • Updated 1 day ago • 28.7k • 450

  • HuggingFaceFW/finepdfs

    Viewer • Updated 22 days ago • 476M • 33.7k • 810
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs