DyaDiT: A Multi-Modal Diffusion Transformer for Socially Favorable Dyadic Gesture Generation Paper • 2602.23165 • Published 15 days ago • 2
MIBURI: Towards Expressive Interactive Gesture Synthesis Paper • 2603.03282 • Published 10 days ago • 4
OmniLottie: Generating Vector Animations via Parameterized Lottie Tokens Paper • 2603.02138 • Published 11 days ago • 138
When Does RL Help Medical VLMs? Disentangling Vision, SFT, and RL Gains Paper • 2603.01301 • Published 12 days ago • 8
LoopFormer: Elastic-Depth Looped Transformers for Latent Reasoning via Shortcut Modulation Paper • 2602.11451 • Published 30 days ago • 15
EasyV2V: A High-quality Instruction-based Video Editing Framework Paper • 2512.16920 • Published Dec 18, 2025 • 18
CRISP: Contact-Guided Real2Sim from Monocular Video with Planar Scene Primitives Paper • 2512.14696 • Published Dec 16, 2025 • 8
FastHMR: Accelerating Human Mesh Recovery via Token and Layer Merging with Diffusion Decoding Paper • 2510.10868 • Published Oct 13, 2025 • 12
PickStyle: Video-to-Video Style Transfer with Context-Style Adapters Paper • 2510.07546 • Published Oct 8, 2025 • 22