GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization Paper • 2601.05242 • Published 7 days ago • 182
Entropy-Adaptive Fine-Tuning: Resolving Confident Conflicts to Mitigate Forgetting Paper • 2601.02151 • Published 10 days ago • 93
VAR RL Done Right: Tackling Asynchronous Policy Conflicts in Visual Autoregressive Generation Paper • 2601.02256 • Published 10 days ago • 32
NextFlow: Unified Sequential Modeling Activates Multimodal Understanding and Generation Paper • 2601.02204 • Published 10 days ago • 56
DreamID-V:Bridging the Image-to-Video Gap for High-Fidelity Face Swapping via Diffusion Transformer Paper • 2601.01425 • Published 11 days ago • 49
sugarblock/music_genres_classification-finetuned-gtzan Audio Classification • 94.6M • Updated Feb 7, 2025 • 4
sugarblock/music_genres_classification-finetuned-gtzan Audio Classification • 94.6M • Updated Feb 7, 2025 • 4