arxiv:2506.04734
Xiaoqi Jian
mx1024
ยท
AI & ML interests
None yet
Recent Activity
liked
a model
about 2 months ago
miromind-ai/MiroThinker-32B-DPO-v0.2
upvoted
a
paper
about 2 months ago
SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn
Tool-Integrated Reasoning
authored
a paper
5 months ago
Stress Testing Generalization: How Minor Modifications Undermine Large
Language Model Performance