LLaDA2.0: Scaling Up Diffusion Language Models to 100B Paper • 2512.15745 • Published 26 days ago • 78
Nemotron-Cascade: Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models Paper • 2512.13607 • Published 20 days ago • 28
NVIDIA Nemotron v3 Collection Open, Production-ready Enterprise Models • 6 items • Updated 5 days ago • 111
Nemotron-Cascade Collection Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models • 18 items • Updated 4 days ago • 41
Olmo 3.1 Collection The latest members of the Olmo 3 family: another 3 weeks of RL for 32B Think, the 32B Instruct model, large post-training research datasets... • 9 items • Updated 12 days ago • 42
CUDA-L2: Surpassing cuBLAS Performance for Matrix Multiplication through Reinforcement Learning Paper • 2512.02551 • Published Dec 2, 2025 • 12
XVLA Collection X-VLA is a soft-prompted Transformer for cross-embodiment robot learning • 6 items • Updated Dec 4, 2025 • 11
Open X-Embodiment Collection Datasets from Open X-Embodiment (OXE) in LeRobot dataset format • 57 items • Updated Oct 2, 2025 • 8
DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning Paper • 2511.22570 • Published Nov 27, 2025 • 85
Olmo 3 Post-training Collection All artifacts for post-training Olmo 3. Datasets follow the model that resulted from training on them. • 32 items • Updated 12 days ago • 46
SwallowCode-v2 Collection Apache-2.0 Open High Quality Code Corpus • 31 items • Updated Nov 5, 2025 • 1
SwallowMath-v2 Collection Apache-2.0 Open High Quality Math Corpus • 16 items • Updated Nov 4, 2025 • 1
view article Article SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data +7 Jun 3, 2025 • 305
World Simulation with Video Foundation Models for Physical AI Paper • 2511.00062 • Published Oct 28, 2025 • 40