DeepAgent: A General Reasoning Agent with Scalable Toolsets Paper β’ 2510.21618 β’ Published 8 days ago β’ 90
Scaling Language-Centric Omnimodal Representation Learning Paper β’ 2510.11693 β’ Published 19 days ago β’ 97
LightMem: Lightweight and Efficient Memory-Augmented Generation Paper β’ 2510.18866 β’ Published 11 days ago β’ 106
LongCodeZip: Compress Long Context for Code Language Models Paper β’ 2510.00446 β’ Published Oct 1 β’ 107
Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models Paper β’ 2510.04618 β’ Published 26 days ago β’ 112
Spatial Forcing: Implicit Spatial Representation Alignment for Vision-language-action Model Paper β’ 2510.12276 β’ Published 18 days ago β’ 142
Diffusion Transformers with Representation Autoencoders Paper β’ 2510.11690 β’ Published 19 days ago β’ 160
MCPMark: A Benchmark for Stress-Testing Realistic and Comprehensive MCP Use Paper β’ 2509.24002 β’ Published Sep 28 β’ 170
Paper2Video: Automatic Video Generation from Scientific Papers Paper β’ 2510.05096 β’ Published 26 days ago β’ 109
DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search Paper β’ 2509.25454 β’ Published Sep 29 β’ 136
QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs Paper β’ 2510.11696 β’ Published 19 days ago β’ 168
Less is More: Recursive Reasoning with Tiny Networks Paper β’ 2510.04871 β’ Published 26 days ago β’ 461
The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain Paper β’ 2509.26507 β’ Published Sep 30 β’ 519
FinTrust: A Comprehensive Benchmark of Trustworthiness Evaluation in Finance Domain Paper β’ 2510.15232 β’ Published 15 days ago β’ 5
Build Your Personalized Research Group: A Multiagent Framework for Continual and Interactive Science Automation Paper β’ 2510.15624 β’ Published 15 days ago β’ 14
A Theoretical Study on Bridging Internal Probability and Self-Consistency for LLM Reasoning Paper β’ 2510.15444 β’ Published 15 days ago β’ 144