wgq's picture

2

wgq

wwggqq

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

MatchTIR: Fine-Grained Supervision for Tool-Integrated Reasoning via Bipartite Matching

upvoted a paper 3 months ago

Information Gain-based Policy Optimization: A Simple and Effective Approach for Multi-Turn LLM Agents

View all activity

Organizations

None yet

upvoted a paper 4 days ago

MatchTIR: Fine-Grained Supervision for Tool-Integrated Reasoning via Bipartite Matching

Paper • 2601.10712 • Published 4 days ago • 23

upvoted a paper 3 months ago

Information Gain-based Policy Optimization: A Simple and Effective Approach for Multi-Turn LLM Agents

Paper • 2510.14967 • Published Oct 16, 2025 • 33