From What to Why: A Multi-Agent System for Evidence-based Chemical Reaction Condition Reasoning Paper • 2509.23768 • Published Sep 28 • 49
LLMs Learn to Deceive Unintentionally: Emergent Misalignment in Dishonesty from Misaligned Samples to Biased Human-AI Interactions Paper • 2510.08211 • Published 23 days ago • 22
Memory Retrieval and Consolidation in Large Language Models through Function Tokens Paper • 2510.08203 • Published 23 days ago • 7
A^2Search: Ambiguity-Aware Question Answering with Reinforcement Learning Paper • 2510.07958 • Published 23 days ago • 4
Cache-to-Cache: Direct Semantic Communication Between Large Language Models Paper • 2510.03215 • Published 29 days ago • 93
Less is More: Recursive Reasoning with Tiny Networks Paper • 2510.04871 • Published 26 days ago • 461