Meng Qu
mnqu
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
6 months ago
AdaCoT: Pareto-Optimal Adaptive Chain-of-Thought Triggering via
Reinforcement Learning
upvoted
a
paper
6 months ago
RM-R1: Reward Modeling as Reasoning
Organizations
None yet