GRPO-MA: Multi-Answer Generation in GRPO for Stable and Efficient Chain-of-Thought Training Paper • 2509.24494 • Published Sep 29 • 10 • 2
Genie Envisioner: A Unified World Foundation Platform for Robotic Manipulation Paper • 2508.05635 • Published Aug 7 • 73 • 2