2 20

xts

xtsssss

AI & ML interests

None yet

Recent Activity

upvoted a paper 10 days ago

Retrieval-Infused Reasoning Sandbox: A Benchmark for Decoupling Retrieval and Reasoning Capabilities

upvoted a paper 10 days ago

BABE: Biology Arena BEnchmark

upvoted a paper 17 days ago

ConceptMoE: Adaptive Token-to-Concept Compression for Implicit Compute Allocation

View all activity

Organizations

upvoted 2 papers 10 days ago

Retrieval-Infused Reasoning Sandbox: A Benchmark for Decoupling Retrieval and Reasoning Capabilities

Paper • 2601.21937 • Published 18 days ago • 19

BABE: Biology Arena BEnchmark

Paper • 2602.05857 • Published 11 days ago • 10

upvoted a paper 17 days ago

ConceptMoE: Adaptive Token-to-Concept Compression for Implicit Compute Allocation

Paper • 2601.21420 • Published 18 days ago • 42

updated a dataset about 2 months ago

xtsssss/deepwiki_case

Viewer • Updated Dec 31, 2025 • 1.85M • 31

published a dataset about 2 months ago

xtsssss/deepwiki_case

Viewer • Updated Dec 31, 2025 • 1.85M • 31

upvoted a paper 2 months ago

DAComp: Benchmarking Data Agents across the Full Data Intelligence Lifecycle

Paper • 2512.04324 • Published Dec 3, 2025 • 154

upvoted a paper 3 months ago

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

Paper • 2511.18538 • Published Nov 23, 2025 • 296

updated a dataset 4 months ago

xtsssss/acb_sampling32

Viewer • Updated Oct 21, 2025 • 196k • 36

published 3 datasets 4 months ago

updated a dataset 4 months ago

xtsssss/acb

Viewer • Updated Oct 15, 2025 • 357k • 45

upvoted 2 papers 5 months ago

Knapsack RL: Unlocking Exploration of LLMs via Optimizing Budget Allocation

Paper • 2509.25849 • Published Sep 30, 2025 • 48

FinSearchComp: Towards a Realistic, Expert-Level Evaluation of Financial Search and Reasoning

Paper • 2509.13160 • Published Sep 16, 2025 • 29

upvoted a paper 6 months ago

UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning

Paper • 2509.02544 • Published Sep 2, 2025 • 125

authored a paper 6 months ago

TreePO: Bridging the Gap of Policy Optimization and Efficacy and Inference Efficiency with Heuristic Tree-based Modeling

Paper • 2508.17445 • Published Aug 24, 2025 • 80

upvoted 4 papers 6 months ago

SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning

Paper • 2509.02479 • Published Sep 2, 2025 • 84

MM-BrowseComp: A Comprehensive Benchmark for Multimodal Browsing Agents

Paper • 2508.13186 • Published Aug 14, 2025 • 19

FutureX: An Advanced Live Benchmark for LLM Agents in Future Prediction

Paper • 2508.11987 • Published Aug 16, 2025 • 71

Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL

Paper • 2508.13167 • Published Aug 6, 2025 • 129

xts

AI & ML interests

Recent Activity

Organizations

xtsssss's activity