arxiv:2509.25779
Jason Zhu
Jason131313
AI & ML interests
None yet
Recent Activity
authored
a paper
about 2 months ago
Planner-R1: Reward Shaping Enables Efficient Agentic RL with Smaller
LLMs
upvoted
a
paper
about 2 months ago
Planner-R1: Reward Shaping Enables Efficient Agentic RL with Smaller
LLMs
liked
a Space
4 months ago
osunlp/TravelPlannerLeaderboard
Organizations
None yet