microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated May 1 • 384k • 1.53k
docling-project/SmolDocling-256M-preview Image-Text-to-Text • 0.3B • Updated Sep 17 • 361k • 1.59k
Babel: Open Multilingual Large Language Models Serving Over 90% of Global Speakers Paper • 2503.00865 • Published Mar 2 • 64
DeepSolution: Boosting Complex Engineering Solution Design via Tree-based Exploration and Bi-point Thinking Paper • 2502.20730 • Published Feb 28 • 38