xts's picture

2 20

xts

xtsssss

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

Retrieval-Infused Reasoning Sandbox: A Benchmark for Decoupling Retrieval and Reasoning Capabilities

upvoted a paper 1 day ago

BABE: Biology Arena BEnchmark

upvoted a paper 8 days ago

ConceptMoE: Adaptive Token-to-Concept Compression for Implicit Compute Allocation

View all activity

Organizations

authored a paper 5 months ago

TreePO: Bridging the Gap of Policy Optimization and Efficacy and Inference Efficiency with Heuristic Tree-based Modeling

Paper • 2508.17445 • Published Aug 24, 2025 • 80

authored 3 papers 7 months ago

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published Feb 20, 2025 • 107

Scaling Test-time Compute for LLM Agents

Paper • 2506.12928 • Published Jun 15, 2025 • 63

First Return, Entropy-Eliciting Explore

Paper • 2507.07017 • Published Jul 9, 2025 • 24