TrialPanorama: Database and Benchmark for Systematic Review and Design of Clinical Trials Paper • 2505.16097 • Published May 22
Tokenizing Single-Channel EEG with Time-Frequency Motif Learning Paper • 2502.16060 • Published Feb 22
SafeSwitch: Steering Unsafe LLM Behavior via Internal Activation Signals Paper • 2502.01042 • Published Feb 3 • 1
A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence Paper • 2507.21046 • Published Jul 28 • 81
UserBench: An Interactive Gym Environment for User-Centric Agents Paper • 2507.22034 • Published Jul 29 • 29
LoCoBench: A Benchmark for Long-Context Large Language Models in Complex Software Engineering Paper • 2509.09614 • Published Sep 11 • 7
UserRL: Training Interactive User-Centric Agent via Reinforcement Learning Paper • 2509.19736 • Published Sep 24 • 11
BAP v2: An Enhanced Task Framework for Instruction Following in Minecraft Dialogues Paper • 2501.10836 • Published Jan 18 • 1
Uncertainty in Action: Confidence Elicitation in Embodied Agents Paper • 2503.10628 • Published Mar 13
Part$^{2}$GS: Part-aware Modeling of Articulated Objects using 3D Gaussian Splatting Paper • 2506.17212 • Published Jun 20
Can Vision Language Models Infer Human Gaze Direction? A Controlled Study Paper • 2506.05412 • Published Jun 4 • 4
ReFoCUS: Reinforcement-guided Frame Optimization for Contextual Understanding Paper • 2506.01274 • Published Jun 2 • 3
s3: You Don't Need That Much Data to Train a Search Agent via RL Paper • 2505.14146 • Published May 20 • 18
CREATOR: Disentangling Abstract and Concrete Reasonings of Large Language Models through Tool Creation Paper • 2305.14318 • Published May 23, 2023
Tell Me More! Towards Implicit User Intention Understanding of Language Model Driven Agents Paper • 2402.09205 • Published Feb 14, 2024
Proactive Agent: Shifting LLM Agents from Reactive Responses to Active Assistance Paper • 2410.12361 • Published Oct 16, 2024