Nemotron Elastic: Towards Efficient Many-in-One Reasoning LLMs Paper • 2511.16664 • Published 3 days ago • 17
Souper-Model: How Simple Arithmetic Unlocks State-of-the-Art LLM Performance Paper • 2511.13254 • Published 6 days ago • 126
P1: Mastering Physics Olympiads with Reinforcement Learning Paper • 2511.13612 • Published 6 days ago • 127
MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling Paper • 2511.11793 • Published 9 days ago • 150
Agent READMEs: An Empirical Study of Context Files for Agentic Coding Paper • 2511.12884 • Published 6 days ago • 5
Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning Paper • 2511.14460 • Published 5 days ago • 15
A Style is Worth One Code: Unlocking Code-to-Style Image Generation with Discrete Style Space Paper • 2511.10555 • Published 10 days ago • 52
Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models Paper • 2511.08577 • Published 12 days ago • 95
miniF2F-Lean Revisited: Reviewing Limitations and Charting a Path Forward Paper • 2511.03108 • Published 18 days ago • 2
From Proof to Program: Characterizing Tool-Induced Reasoning Hallucinations in Large Language Models Paper • 2511.10899 • Published 9 days ago • 2
Agentic Refactoring: An Empirical Study of AI Coding Agents Paper • 2511.04824 • Published 16 days ago • 4
Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds Paper • 2511.08892 • Published 11 days ago • 175
The Path Not Taken: RLVR Provably Learns Off the Principals Paper • 2511.08567 • Published 12 days ago • 27
MathSE: Improving Multimodal Mathematical Reasoning via Self-Evolving Iterative Reflection and Reward-Guided Fine-Tuning Paper • 2511.06805 • Published 13 days ago • 12
LoopTool: Closing the Data-Training Loop for Robust LLM Tool Calls Paper • 2511.09148 • Published 11 days ago • 15
LUT-LLM: Efficient Large Language Model Inference with Memory-based Computations on FPGAs Paper • 2511.06174 • Published 14 days ago • 5
Reinforcement Learning Improves Traversal of Hierarchical Knowledge in LLMs Paper • 2511.05933 • Published 15 days ago • 7
Generating an Image From 1,000 Words: Enhancing Text-to-Image With Structured Captions Paper • 2511.06876 • Published 13 days ago • 22