MosaicMem: Hybrid Spatial Memory for Controllable Video World Models Paper • 2603.17117 • Published 3 days ago • 78
MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild Paper • 2603.17187 • Published 3 days ago • 109
ESPIRE: A Diagnostic Benchmark for Embodied Spatial Reasoning of Vision-Language Models Paper • 2603.13033 • Published 7 days ago • 13
GradMem: Learning to Write Context into Memory with Test-Time Gradient Descent Paper • 2603.13875 • Published 7 days ago • 27
Learning Latent Proxies for Controllable Single-Image Relighting Paper • 2603.15555 • Published 4 days ago • 8
From Sparse to Dense: Multi-View GRPO for Flow Models via Augmented Condition Space Paper • 2603.12648 • Published 8 days ago • 12
DVD: Deterministic Video Depth Estimation with Generative Priors Paper • 2603.12250 • Published 8 days ago • 26
DVD: Deterministic Video Depth Estimation with Generative Priors Paper • 2603.12250 • Published 8 days ago • 26
DVD: Deterministic Video Depth Estimation with Generative Priors Paper • 2603.12250 • Published 8 days ago • 26
Internalizing Meta-Experience into Memory for Guided Reinforcement Learning in Large Language Models Paper • 2602.10224 • Published Feb 10 • 19
When to Memorize and When to Stop: Gated Recurrent Memory for Long-Context Reasoning Paper • 2602.10560 • Published Feb 11 • 30
VLA-JEPA: Enhancing Vision-Language-Action Model with Latent World Model Paper • 2602.10098 • Published Feb 10 • 19