view article Article Promoter-GPT: Writing DNA Instructions with Language Models By hugging-science β’ 5 days ago β’ 19
view article Article Scaling Test-Time Compute to Achieve Gold Medal at IOI 2025 with Open-Weight Models By nvidia and 3 others β’ 7 days ago β’ 14
view article Article FineWeb-C: A Community-Driven Dataset for Educational Quality Annotations in 122 Languages By davanstrien and 5 others β’ Jul 8 β’ 31
view article Article Explore, Build, and Innovate AI Reasoning with NVIDIAβs Open Models and Recipes By nvidia and 2 others β’ Jun 4 β’ 21
SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics Paper β’ 2506.01844 β’ Published Jun 2 β’ 140
SmolVLM: Redefining small and efficient multimodal models Paper β’ 2504.05299 β’ Published Apr 7 β’ 200
Unified Reward Model for Multimodal Understanding and Generation Paper β’ 2503.05236 β’ Published Mar 7 β’ 123
view article Article A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality Mar 4 β’ 77
Cohere Labs Aya Vision Collection Aya Vision is a state-of-the-art family of vision models that brings multimodal capabilities to 23 languages. β’ 5 items β’ Updated Jul 31 β’ 70
CHASE Collection Generate challenging synthetic data to evaluate LLMs β’ 5 items β’ Updated Feb 21 β’ 4
How to Get Your LLM to Generate Challenging Problems for Evaluation Paper β’ 2502.14678 β’ Published Feb 20 β’ 18
MMTEB: Massive Multilingual Text Embedding Benchmark Paper β’ 2502.13595 β’ Published Feb 19 β’ 41
From Tools to Teammates: Evaluating LLMs in Multi-Session Coding Interactions Paper β’ 2502.13791 β’ Published Feb 19 β’ 5
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training Paper β’ 2501.17161 β’ Published Jan 28 β’ 123
The Lessons of Developing Process Reward Models in Mathematical Reasoning Paper β’ 2501.07301 β’ Published Jan 13 β’ 99