SNLP: Layer-Parallel Inference via Structured Newton Corrections Paper • 2605.17842 • Published May 18 • 5
SlimQwen: Exploring the Pruning and Distillation in Large MoE Model Pre-training Paper • 2605.08738 • Published May 9 • 13
S2D2: Fast Decoding for Diffusion LLMs via Training-Free Self-Speculation Paper • 2603.25702 • Published Mar 26 • 8
MosaicMem: Hybrid Spatial Memory for Controllable Video World Models Paper • 2603.17117 • Published Mar 17 • 89
Memex(RL): Scaling Long-Horizon LLM Agents via Indexed Experience Memory Paper • 2603.04257 • Published Mar 4 • 19
Memex(RL): Scaling Long-Horizon LLM Agents via Indexed Experience Memory Paper • 2603.04257 • Published Mar 4 • 19
Mode Seeking meets Mean Seeking for Fast Long Video Generation Paper • 2602.24289 • Published Feb 27 • 41
Mode Seeking meets Mean Seeking for Fast Long Video Generation Paper • 2602.24289 • Published Feb 27 • 41
The Diffusion Duality, Chapter II: $Ψ$-Samplers and Efficient Curriculum Paper • 2602.21185 • Published Feb 24 • 4
Gecko: An Efficient Neural Architecture Inherently Processing Sequences with Arbitrary Lengths Paper • 2601.06463 • Published Jan 10 • 2
LYNX: Learning Dynamic Exits for Confidence-Controlled Reasoning Paper • 2512.05325 • Published Dec 5, 2025 • 5