Building on HF

10 22 4

Bo Liu

Benjamin-eecs

https://benjamin-eecs.github.io/

AI & ML interests

Reinforcement Learning, Reasoning, Machine Learning Systems

Recent Activity

liked a dataset 7 days ago

facebook/principia-bench

upvoted a paper 23 days ago

On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models

liked a dataset about 2 months ago

facebook/principia-collection

View all activity

Organizations

authored a paper about 2 months ago

Scaling Agent Learning via Experience Synthesis

Paper • 2511.03773 • Published Nov 5, 2025 • 81

authored a paper 2 months ago

SPICE: Self-Play In Corpus Environments Improves Reasoning

Paper • 2510.24684 • Published Oct 28, 2025 • 17

authored 5 papers 3 months ago

BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution

Paper • 2510.08697 • Published Oct 9, 2025 • 36

authored a paper 4 months ago

LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model

Paper • 2509.00676 • Published Aug 31, 2025 • 84

authored a paper 6 months ago

SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning

Paper • 2506.24119 • Published Jun 30, 2025 • 50

authored a paper 9 months ago

TextArena

Paper • 2504.11442 • Published Apr 15, 2025 • 30

authored a paper 10 months ago

EnvPool: A Highly Parallel Reinforcement Learning Environment Execution Engine

Paper • 2206.10558 • Published Jun 21, 2022 • 2

authored a paper about 1 year ago

Natural Language Reinforcement Learning

Paper • 2411.14251 • Published Nov 21, 2024 • 31

authored 3 papers over 1 year ago

DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search

Paper • 2408.08152 • Published Aug 15, 2024 • 60

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

Paper • 2405.04434 • Published May 7, 2024 • 24

DeepSeek-Prover: Advancing Theorem Proving in LLMs through Large-Scale Synthetic Data

Paper • 2405.14333 • Published May 23, 2024 • 42

authored 2 papers almost 2 years ago

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Paper • 2403.05525 • Published Mar 8, 2024 • 48

DeepSeek LLM: Scaling Open-Source Language Models with Longtermism

Paper • 2401.02954 • Published Jan 5, 2024 • 51

Bo Liu

AI & ML interests

Recent Activity

Organizations

Benjamin-eecs's activity