facebook/vjepa2-vitl-fpc64-256 Video Classification β’ 0.3B β’ Updated Aug 11, 2025 β’ 79.8k β’ 169
ibm-granite/granite-docling-258M Image-Text-to-Text β’ 0.3B β’ Updated Sep 23, 2025 β’ 195k β’ 1.07k
Runtime error 36 Multimodal RAG with Granite Vision π 36 RAG example using Granite [vision, embedding, instruct]
Running on Zero Featured 259 granite-docling-258M demo π 259 Convert images to structured text and answer questions
docling-project/SmolDocling-256M-preview Image-Text-to-Text β’ 0.3B β’ Updated Sep 17, 2025 β’ 55.7k β’ 1.6k
Running on A100 220 Omnilingual ASR Media Transcription π 220 Transcribe audio or video into text in any language