Submitted by
lulululuyi
AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
R-Horizon: How Far Can Your Large Reasoning Model Really Go in Breadth and Depth?
VitaBench: Benchmarking LLM Agents with Versatile Interactive Tasks in Real-world Applications