view article Article Llama 3.1 - 405B, 70B & 8B with multilinguality and long context +6 Jul 23, 2024 • 241
Baseer: A Vision-Language Model for Arabic Document-to-Markdown OCR Paper • 2509.18174 • Published Sep 17, 2025 • 128
view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 Dec 1, 2025 • 262
Step-Audio-R1 Collection Step-Audio-R1 is the first audio language model to successfully unlock test-time compute scaling. • 3 items • Updated Nov 21, 2025 • 15
Olmo 3 Post-training Collection All artifacts for post-training Olmo 3. Datasets follow the model that resulted from training on them. • 32 items • Updated 9 days ago • 46
Olmo 3 Pre-training Collection All artifacts related to Olmo 3 pre-training • 10 items • Updated 9 days ago • 32
view article Article Building for an Open Future - our new partnership with Google Cloud Nov 13, 2025 • 47