Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward Paper • 2510.03222 • Published 30 days ago • 45
Cogito, Ergo Ludo: An Agent that Learns to Play by Reasoning and Planning Paper • 2509.25052 • Published Sep 29 • 4
Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward Paper • 2510.03222 • Published 30 days ago • 45
Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward Paper • 2510.03222 • Published 30 days ago • 45 • 2
PaSa: An LLM Agent for Comprehensive Academic Paper Search Paper • 2501.10120 • Published Jan 17 • 52
Are AI-Generated Text Detectors Robust to Adversarial Perturbations? Paper • 2406.01179 • Published Jun 3, 2024
PaSa: An LLM Agent for Comprehensive Academic Paper Search Paper • 2501.10120 • Published Jan 17 • 52
AGILE: A Novel Reinforcement Learning Framework of LLM Agents Paper • 2405.14751 • Published May 23, 2024
ArtifactsBench: Bridging the Visual-Interactive Gap in LLM Code Generation Evaluation Paper • 2507.04952 • Published Jul 7 • 9
Adaptive Termination for Multi-round Parallel Reasoning: An Universal Semantic Entropy-Guided Framework Paper • 2507.06829 • Published Jul 9