63 38 173

Chujie Zheng

chujiezheng

https://chujiezheng.github.io/

AI & ML interests

Large Language Models

Recent Activity

upvoted a paper about 1 month ago

Soft Adaptive Policy Optimization

authored a paper about 1 month ago

Soft Adaptive Policy Optimization

authored a paper about 1 month ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

View all activity

Organizations

authored 2 papers about 1 month ago

Soft Adaptive Policy Optimization

Paper • 2511.20347 • Published Nov 25, 2025 • 41

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published Dec 1, 2025 • 95

authored a paper 5 months ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24, 2025 • 316

authored a paper 6 months ago

A Survey on Latent Reasoning

Paper • 2507.06203 • Published Jul 8, 2025 • 93

authored 2 papers 7 months ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published Jun 2, 2025 • 187

BARREL: Boundary-Aware Reasoning for Factual and Reliable LRMs

Paper • 2505.13529 • Published May 18, 2025 • 11

authored 2 papers 8 months ago

Qwen3 Technical Report

Paper • 2505.09388 • Published May 14, 2025 • 320

WorldPM: Scaling Human Preference Modeling

Paper • 2505.10527 • Published May 15, 2025 • 34

authored 2 papers 11 months ago

Aligning Instruction Tuning with Pre-training

Paper • 2501.09368 • Published Jan 16, 2025

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published Feb 20, 2025 • 106

authored a paper 12 months ago

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published Jan 13, 2025 • 99

authored 2 papers about 1 year ago

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published Dec 9, 2024 • 85

Yi-Lightning Technical Report

Paper • 2412.01253 • Published Dec 2, 2024 • 28

authored 2 papers over 1 year ago

Weak-to-Strong Extrapolation Expedites Alignment

Paper • 2404.16792 • Published Apr 25, 2024 • 11

Safe Unlearning: A Surprisingly Effective and Generalizable Solution to Defend Against Jailbreak Attacks

Paper • 2407.02855 • Published Jul 3, 2024 • 12

authored a paper almost 2 years ago

Prompt-Driven LLM Safeguarding via Directed Representation Optimization

Paper • 2401.18018 • Published Jan 31, 2024 • 1

authored 4 papers over 2 years ago

CASE: Aligning Coarse-to-Fine Cognition and Affection for Empathetic Response Generation

Paper • 2208.08845 • Published Aug 18, 2022

CEM: Commonsense-aware Empathetic Response Generation

Paper • 2109.05739 • Published Sep 13, 2021

PsyQA: A Chinese Dataset for Generating Long Counseling Text for Mental Health Support

Paper • 2106.01702 • Published Jun 3, 2021

On Large Language Models' Selection Bias in Multi-Choice Questions

Paper • 2309.03882 • Published Sep 7, 2023

Chujie Zheng

AI & ML interests

Recent Activity

Organizations

chujiezheng's activity