ByteDance/veAgentBench
Updated
•
96
None defined yet.
Reg-DPO: SFT-Regularized Direct Preference Optimization with GT-Pair for Improving Video Generation
PairUni: Pairwise Training for Unified Multimodal Language Models