DynaAct: Large Language Model Reasoning with Dynamic Action Spaces Paper • 2511.08043 • Published 12 days ago • 5
Beyond English: Toward Inclusive and Scalable Multilingual Machine Translation with LLMs Paper • 2511.07003 • Published 12 days ago • 31
Walking the Tightrope of LLMs for Software Development: A Practitioners' Perspective Paper • 2511.06428 • Published 13 days ago • 4
Adaptive Multi-Agent Response Refinement in Conversational Systems Paper • 2511.08319 • Published 11 days ago • 39
VideoSSR: Video Self-Supervised Reinforcement Learning Paper • 2511.06281 • Published 14 days ago • 21
The Path Not Taken: RLVR Provably Learns Off the Principals Paper • 2511.08567 • Published 11 days ago • 27
Beyond Fact Retrieval: Episodic Memory for RAG with Generative Semantic Workspaces Paper • 2511.07587 • Published 12 days ago • 8
KLASS: KL-Guided Fast Inference in Masked Diffusion Models Paper • 2511.05664 • Published 15 days ago • 35
Wasm: A Pipeline for Constructing Structured Arabic Interleaved Multimodal Corpora Paper • 2511.07080 • Published 12 days ago • 31
Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B Paper • 2511.06221 • Published 14 days ago • 114
HaluMem: Evaluating Hallucinations in Memory Systems of Agents Paper • 2511.03506 • Published 17 days ago • 88
CritiCal: Can Critique Help LLM Uncertainty or Confidence Calibration? Paper • 2510.24505 • Published 25 days ago • 3
HAFixAgent: History-Aware Automated Program Repair Agent Paper • 2511.01047 • Published 20 days ago • 3
VeriCoT: Neuro-symbolic Chain-of-Thought Validation via Logical Consistency Checks Paper • 2511.04662 • Published 16 days ago • 34
Too Good to be Bad: On the Failure of LLMs to Role-Play Villains Paper • 2511.04962 • Published 16 days ago • 50