CiteGuard: Faithful Citation Attribution for LLMs via Retrieval-Augmented Validation Paper • 2510.17853 • Published Oct 15 • 7
UltraHorizon: Benchmarking Agent Capabilities in Ultra Long-Horizon Scenarios Paper • 2509.21766 • Published Sep 26 • 23
Thinking with Images for Multimodal Reasoning: Foundations, Methods, and Future Frontiers Paper • 2506.23918 • Published Jun 30 • 88