Generalization or Memorization: Dynamic Decoding for Mode Steering Paper • 2510.22099 • Published 6 days ago • 3
ReST-RL: Achieving Accurate Code Reasoning of LLMs with Optimized Self-Training and Decoding Paper • 2508.19576 • Published Aug 27 • 2
Agent-SafetyBench: Evaluating the Safety of LLM Agents Paper • 2412.14470 • Published Dec 19, 2024 • 13
MetaMind: Modeling Human Social Thoughts with Metacognitive Multi-Agent Systems Paper • 2505.18943 • Published May 25 • 24
Seeker: Towards Exception Safety Code Generation with Intermediate Language Agents Framework Paper • 2412.11713 • Published Dec 16, 2024 • 6
Seeker: Enhancing Exception Handling in Code with LLM-based Multi-Agent Approach Paper • 2410.06949 • Published Oct 9, 2024 • 6