Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Main
Tasks
1
Libraries
Languages
Licenses
Other
Tasks
Reset Tasks
Text Generation
Any-to-Any
Image-Text-to-Text
Image-to-Text
Image-to-Image
Text-to-Image
Text-to-Video
Text-to-Speech
+ 42
Parameters
Reset Parameters
< 1B
6B
12B
32B
128B
> 500B
< 1B
> 500B
Libraries
PyTorch
google-tensorflow
TensorFlow
JAX
Transformers
Diffusers
Safetensors
ONNX
GGUF
Transformers.js
MLX
Keras
+ 41
Apps
vLLM
TGI
llama.cpp
MLX LM
LM Studio
Ollama
Jan
+ 13
Inference Providers
Groq
Novita
Nebius AI
Cerebras
SambaNova
Nscale
fal
Hyperbolic
+ 9
Apply filters
Models
9,038
Full-text search
Inference Available
Edit filters
Sort: Trending
Active filters:
image-to-text
Clear all
shreyanshu09/block_diagram_global_information
Image-to-Text
•
Updated
Jun 3, 2024
•
7
•
3
lenamerkli/ingredient-scanner
Image-to-Text
•
0.5B
•
Updated
Jul 22, 2024
•
68
•
4
U4R/StructTable-InternVL2-1B
Image-to-Text
•
0.9B
•
Updated
Dec 12, 2024
•
1.68k
•
41
kazars24/trocr-base-handwritten-ru
Image-to-Text
•
0.3B
•
Updated
Oct 27, 2024
•
19.5k
•
15
onnx-community/Qwen2-VL-2B-Instruct
Image-to-Text
•
Updated
Mar 6
•
65
•
11
Bllossom/llama-3.2-Korean-Bllossom-AICA-5B
Image-to-Text
•
5B
•
Updated
Mar 14
•
403
•
94
HuggingFaceTB/SmolVLM-256M-Base
Image-to-Text
•
0.3B
•
Updated
Jan 20
•
6.35k
•
17
allenai/olmOCR-7B-0225-preview
Image-to-Text
•
8B
•
Updated
Aug 19
•
11.1k
•
703
qualcomm/EasyOCR
Image-to-Text
•
Updated
1 day ago
•
843
•
32
Alyon-AI/UMA-VLM-Engine-v1
Image-to-Text
•
8B
•
Updated
Mar 16
•
4
•
2
Muizzzz8/phi3-prescription-reader
Image-to-Text
•
Updated
Jun 20
•
84
•
1
scb10x/typhoon-ocr-7b
Image-to-Text
•
8B
•
Updated
Jul 11
•
23k
•
74
PaddlePaddle/PP-OCRv5_mobile_det
Image-to-Text
•
Updated
Jul 22
•
38.3k
•
15
PaddlePaddle/PP-OCRv5_server_rec
Image-to-Text
•
Updated
Jul 22
•
78.8k
•
16
PaddlePaddle/PP-OCRv5_mobile_rec
Image-to-Text
•
Updated
Jul 22
•
9.23k
•
7
PaddlePaddle/PP-OCRv4_server_seal_det
Image-to-Text
•
Updated
Jul 22
•
1.25k
•
1
PaddlePaddle/PP-OCRv4_server_det
Image-to-Text
•
Updated
Jul 22
•
1.19k
•
1
PaddlePaddle/PP-DocLayout_plus-L
Image-to-Text
•
Updated
Jul 22
•
9.46k
•
10
PaddlePaddle/PP-OCRv4_server_rec_doc
Image-to-Text
•
Updated
Jul 22
•
1.04k
•
1
PaddlePaddle/PP-DocBee-2B
Image-to-Text
•
Updated
Aug 27
•
36
•
1
PaddlePaddle/PP-OCRv4_server_rec
Image-to-Text
•
Updated
Jul 22
•
478
•
1
l0wgear/manga-ocr-2025-onnx
Image-to-Text
•
Updated
Jun 30
•
246
•
4
PaddlePaddle/korean_PP-OCRv5_mobile_rec
Image-to-Text
•
Updated
Jul 22
•
2.83k
•
9
RinNguyen103/Vietnamese-Image-Captioning
Image-to-Text
•
Updated
Aug 13
•
2
allenai/olmOCR-7B-0825-FP8
Image-to-Text
•
8B
•
Updated
7 days ago
•
139k
•
9
WeightedAI/Persian_OCR
Image-to-Text
•
Updated
about 1 month ago
•
119
•
7
facebook/DepthLM
Image-to-Text
•
13B
•
Updated
29 days ago
•
432
•
18
CodeGoat24/UniGenBench-EvalModel-qwen-72b-v1
Image-to-Text
•
73B
•
Updated
5 days ago
•
42
•
3
AhmedZaky1/DIMI-Arabic-OCR
Image-to-Text
•
Updated
21 days ago
•
2
mradermacher/Nanonets-OCR2-3B-GGUF
Image-to-Text
•
3B
•
Updated
16 days ago
•
9.31k
•
12
Previous
1
2
3
4
...
100
Next