Building on HF

10 22 4

Bo Liu

Benjamin-eecs

https://benjamin-eecs.github.io/

AI & ML interests

Reinforcement Learning, Reasoning, Machine Learning Systems

Recent Activity

liked a dataset 8 days ago

facebook/principia-bench

upvoted a paper 24 days ago

On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models

liked a dataset about 2 months ago

facebook/principia-collection

View all activity

Organizations

liked a dataset 8 days ago

facebook/principia-bench

Viewer • Updated 14 days ago • 2.24k • 690 • 9

upvoted a paper 24 days ago

On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models

Paper • 2512.07783 • Published 25 days ago • 36

liked a dataset about 2 months ago

facebook/principia-collection

Viewer • Updated 13 days ago • 554k • 430 • 39

authored a paper about 2 months ago

Scaling Agent Learning via Experience Synthesis

Paper • 2511.03773 • Published Nov 5, 2025 • 81

upvoted 2 papers about 2 months ago

Scaling Agent Learning via Experience Synthesis

Paper • 2511.03773 • Published Nov 5, 2025 • 81

ROVER: Benchmarking Reciprocal Cross-Modal Reasoning for Omnimodal Generation

Paper • 2511.01163 • Published Nov 3, 2025 • 31

authored a paper 2 months ago

SPICE: Self-Play In Corpus Environments Improves Reasoning

Paper • 2510.24684 • Published Oct 28, 2025 • 17

upvoted a paper 2 months ago

SPICE: Self-Play In Corpus Environments Improves Reasoning

Paper • 2510.24684 • Published Oct 28, 2025 • 17

authored a paper 3 months ago

BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution

Paper • 2510.08697 • Published Oct 9, 2025 • 36

upvoted a paper 3 months ago

BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution

Paper • 2510.08697 • Published Oct 9, 2025 • 36

authored a paper 3 months ago

Agent Learning via Early Experience

Paper • 2510.08558 • Published Oct 9, 2025 • 270

upvoted 2 papers 3 months ago

Large Reasoning Models Learn Better Alignment from Flawed Thinking

Paper • 2510.00938 • Published Oct 1, 2025 • 58

Agent Learning via Early Experience

Paper • 2510.08558 • Published Oct 9, 2025 • 270

liked a Space 3 months ago

BigCodeArena

🚀

Compare two AI models by sending them code and seeing their responses

authored a paper 3 months ago

GEM: A Gym for Agentic LLMs

Paper • 2510.01051 • Published Oct 1, 2025 • 89

upvoted a paper 3 months ago

GEM: A Gym for Agentic LLMs

Paper • 2510.01051 • Published Oct 1, 2025 • 89

commented a paper 3 months ago

SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning

Paper • 2506.24119 • Published Jun 30, 2025 • 50 •

authored 2 papers 3 months ago

The Era of Real-World Human Interaction: RL from User Conversations

Paper • 2509.25137 • Published Sep 29, 2025 • 18

Vision-Zero: Scalable VLM Self-Improvement via Strategic Gamified Self-Play

Paper • 2509.25541 • Published Sep 29, 2025 • 140

commented a paper 3 months ago

Who invented deep residual learning?

Paper • 2509.24732 • Published Sep 29, 2025 • 4 •

Bo Liu

AI & ML interests

Recent Activity

Organizations

Benjamin-eecs's activity

BigCodeArena