112 84 78

Hugo Laurençon

HugoLaurencon

HugoLaurencon

AI & ML interests

None yet

Recent Activity

upvoted a paper 9 days ago

Souper-Model: How Simple Arithmetic Unlocks State-of-the-Art LLM Performance

upvoted a paper 22 days ago

ChartM^3: A Multi-Stage Code-Driven Pipeline for Constructing Multi-Dimensional and Multi-Step Visual Reasoning Data in Chart Comprehension

View all activity

Organizations

upvoted a paper 9 days ago

Souper-Model: How Simple Arithmetic Unlocks State-of-the-Art LLM Performance

Paper • 2511.13254 • Published 10 days ago • 130

upvoted a paper 22 days ago

ChartM^3: A Multi-Stage Code-Driven Pipeline for Constructing Multi-Dimensional and Multi-Step Visual Reasoning Data in Chart Comprehension

Paper • 2511.02415 • Published 23 days ago • 4

upvoted 2 papers about 2 months ago

Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published Oct 6 • 487

Quantile Advantage Estimation for Entropy-Safe Reasoning

Paper • 2509.22611 • Published Sep 26 • 117

upvoted 2 papers 2 months ago

ARE: Scaling Up Agent Environments and Evaluations

Paper • 2509.17158 • Published Sep 21 • 35

Visual-TableQA: Open-Domain Benchmark for Reasoning over Table Images

Paper • 2509.07966 • Published Sep 9 • 4

upvoted 2 papers 3 months ago

ΔL Normalization: Rethink Loss Aggregation in RLVR

Paper • 2509.07558 • Published Sep 9 • 7

NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model

Paper • 2508.14444 • Published Aug 20 • 37

upvoted 2 papers 4 months ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24 • 311

Scaling Laws for Optimal Data Mixtures

Paper • 2507.09404 • Published Jul 12 • 36

upvoted a paper 5 months ago

Reinforcement Learning with Verifiable Rewards Implicitly Incentivizes Correct Reasoning in Base LLMs

Paper • 2506.14245 • Published Jun 17 • 44

upvoted 2 papers 6 months ago

REASONING GYM: Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Paper • 2505.24760 • Published May 30 • 74

Universal Reasoner: A Single, Composable Plug-and-Play Reasoner for Frozen LLMs

Paper • 2505.19075 • Published May 25 • 21

upvoted 2 papers 7 months ago

BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset

Paper • 2505.09568 • Published May 14 • 98

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published Apr 18 • 136

upvoted 2 papers 8 months ago

Scaling Laws for Native Multimodal Models Scaling Laws for Native Multimodal Models

Paper • 2504.07951 • Published Apr 10 • 30

OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training Tokens

Paper • 2504.07096 • Published Apr 9 • 76

upvoted a collection 8 months ago

Llama 4

Collection

Llama 4 release • 13 items • Updated Apr 29 • 663

upvoted a paper 8 months ago

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Paper • 2503.14476 • Published Mar 18 • 141

upvoted an article 9 months ago

Article

Open-source DeepResearch – Freeing our search agents

Feb 4

•

1.31k

Hugo Laurençon

AI & ML interests

Recent Activity

Organizations

HugoLaurencon's activity

Open-source DeepResearch – Freeing our search agents