arxiv:2507.06804
Linfeng Song
freesunshine0316
ยท
AI & ML interests
Researcher @Tencent AI Lab working on reasoning and RLAIF with LLM, especially search + RL. Working on NLP since 2010.
Recent Activity
upvoted
a
paper
4 days ago
Every Question Has Its Own Value: Reinforcement Learning with Explicit
Human Values
upvoted
a
paper
24 days ago
VOGUE: Guiding Exploration with Visual Uncertainty Improves Multimodal
Reasoning
upvoted
a
paper
24 days ago
CLUE: Non-parametric Verification from Experience via Hidden-State
Clustering