EgoPush: Learning End-to-End Egocentric Multi-Object Rearrangement for Mobile Robots Paper β’ 2602.18071 β’ Published 28 days ago β’ 22
Generated Reality: Human-centric World Simulation using Interactive Video Generation with Hand and Camera Control Paper β’ 2602.18422 β’ Published 28 days ago β’ 30
Does Your Reasoning Model Implicitly Know When to Stop Thinking? Paper β’ 2602.08354 β’ Published Feb 9 β’ 262
MultiShotMaster: A Controllable Multi-Shot Video Generation Framework Paper β’ 2512.03041 β’ Published Dec 2, 2025 β’ 66
GENIUS: Generative Fluid Intelligence Evaluation Suite Paper β’ 2602.11144 β’ Published Feb 11 β’ 55
OPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every Iteration Paper β’ 2602.05400 β’ Published Feb 5 β’ 349
DreamDojo: A Generalist Robot World Model from Large-Scale Human Videos Paper β’ 2602.06949 β’ Published Feb 6 β’ 36
Running on CPU Upgrade 1.22k Omni Image Editor πΌ 1.22k Image edit, text to image, image upscale, remove watermark
Running on Zero MCP 1.39k Wan2.2 14B Preview π 1.39k generate a video from an image with a text prompt
Running on Zero MCP Featured 1.11k Qwen-Image-Edit-2511-LoRAs-Fast π 1.11k Demo of the Collection of Qwen Image Edit LoRAs
Running on Zero Featured 1.72k Qwen3-TTS Demo π 1.72k Generate speech audio via voice design, cloning, or preset speakers
Running on Zero MCP 2.6k Z Image Turbo πΌ 2.6k Generate high-quality images from text prompts instantly
Green-VLA: Staged Vision-Language-Action Model for Generalist Robots Paper β’ 2602.00919 β’ Published Jan 31 β’ 317