Video-Thinker: Sparking "Thinking with Videos" via Reinforcement Learning Paper • 2510.23473 • Published 3 days ago • 75
Dimple: Discrete Diffusion Multimodal Large Language Model with Parallel Decoding Paper • 2505.16990 • Published May 22 • 22
REPAIR: Robust Editing via Progressive Adaptive Intervention and Reintegration Paper • 2510.01879 • Published 29 days ago • 8
Refusal Falls off a Cliff: How Safety Alignment Fails in Reasoning? Paper • 2510.06036 • Published 24 days ago • 6
Interleaving Reasoning for Better Text-to-Image Generation Paper • 2509.06945 • Published Sep 8 • 14
Seedance 1.0: Exploring the Boundaries of Video Generation Models Paper • 2506.09113 • Published Jun 10 • 102
Flow-GRPO: Training Flow Matching Models via Online RL Paper • 2505.05470 • Published May 8 • 85
Magic 1-For-1: Generating One Minute Video Clips within One Minute Paper • 2502.07701 • Published Feb 11 • 35
Direct Preference Optimization Using Sparse Feature-Level Constraints Paper • 2411.07618 • Published Nov 12, 2024 • 17