2 13 1

shawnxzhu

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

Reinforcement Learning Elicits Contextual Learning of Unseen Language Translation

authored a paper 20 days ago

EnvFactory: Scaling Tool-Use Agents via Executable Environments Synthesis and Robust RL

submitted a paper 20 days ago

EnvFactory: Scaling Tool-Use Agents via Executable Environments Synthesis and Robust RL

View all activity

Organizations

upvoted a paper 4 days ago

Reinforcement Learning Elicits Contextual Learning of Unseen Language Translation

Paper • 2606.06428 • Published 5 days ago • 23

authored a paper 20 days ago

EnvFactory: Scaling Tool-Use Agents via Executable Environments Synthesis and Robust RL

Paper • 2605.18703 • Published 22 days ago • 50

submitted a paper to Daily Papers 20 days ago

EnvFactory: Scaling Tool-Use Agents via Executable Environments Synthesis and Robust RL

Paper • 2605.18703 • Published 22 days ago • 50

upvoted a paper 20 days ago

EnvFactory: Scaling Tool-Use Agents via Executable Environments Synthesis and Robust RL

Paper • 2605.18703 • Published 22 days ago • 50

upvoted a paper about 2 months ago

A Self-Evolving Framework for Efficient Terminal Agents via Observational Context Compression

Paper • 2604.19572 • Published Apr 21 • 23

upvoted 2 papers 3 months ago

ACE: Attribution-Controlled Knowledge Editing for Multi-hop Factual Recall

Paper • 2510.07896 • Published Oct 9, 2025 • 11

On Data Engineering for Scaling LLM Terminal Capabilities

Paper • 2602.21193 • Published Feb 24 • 103

authored 2 papers 4 months ago

CHARM: Calibrating Reward Models With Chatbot Arena Scores

Paper • 2504.10045 • Published Apr 14, 2025

CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models

Paper • 2602.17684 • Published Feb 4 • 22

updated a collection 4 months ago

CodeScaler

Collection

5 items • Updated Mar 2 • 6

upvoted a paper 4 months ago

CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models

Paper • 2602.17684 • Published Feb 4 • 22

upvoted a collection 4 months ago

CodeScaler

Collection

5 items • Updated Mar 2 • 6

published 3 models 4 months ago

published a dataset 4 months ago

LARK-Lab/CodeScalerPair-51K

Viewer • Updated Feb 23 • 51.1k • 37 • 1

updated a dataset 4 months ago

LARK-Lab/CodeScalerPair-51K

Viewer • Updated Feb 23 • 51.1k • 37 • 1

updated 3 models 4 months ago

LARK-Lab/CodeScaler-8B

Text Classification • 8B • Updated Feb 23 • 6

LARK-Lab/CodeScaler-4B

Text Classification • 4B • Updated Feb 23 • 7

LARK-Lab/CodeScaler-1.7B

Text Classification • 2B • Updated Feb 23 • 79

shawnxzhu

AI & ML interests

Recent Activity

Organizations

shawnxzhu's activity