The Station: An Open-World Environment for AI-Driven Discovery Paper • 2511.06309 • Published 23 days ago • 35
SPA-RL: Reinforcing LLM Agents via Stepwise Progress Attribution Paper • 2505.20732 • Published May 27 • 1
STeCa: Step-level Trajectory Calibration for LLM Agent Learning Paper • 2502.14276 • Published Feb 20 • 1
E2CL: Exploration-based Error Correction Learning for Embodied Agents Paper • 2409.03256 • Published Sep 5, 2024 • 1
Why Safeguarded Ships Run Aground? Aligned Large Language Models' Safety Mechanisms Tend to Be Anchored in The Template Region Paper • 2502.13946 • Published Feb 19 • 10