Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
PaddlePaddle 's Collections
PaddleOCR-VL
PP-StructureV3
PP-OCRv5
PP-OCRv4
PP-OCRv3

PaddleOCR-VL

updated 9 days ago

Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model

Upvote
16

  • PaddlePaddle/PaddleOCR-VL

    Image-Text-to-Text • 1.0B • Updated 3 days ago • 15.6k • 1.08k

  • Running
    136
    136

    PaddleOCR-VL Online Demo

    📈

    Recognize text and elements in images


  • PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model

    Paper • 2510.14528 • Published 10 days ago • 67
Upvote
16
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs