VFXMaster: Unlocking Dynamic Visual Effect Generation via In-Context Learning Paper • 2510.25772 • Published 17 days ago • 32
Latent Diffusion Model without Variational Autoencoder Paper • 2510.15301 • Published 29 days ago • 48
AdaViewPlanner: Adapting Video Diffusion Models for Viewpoint Planning in 4D Scenes Paper • 2510.10670 • Published Oct 12 • 18
VideoCanvas: Unified Video Completion from Arbitrary Spatiotemporal Patches via In-Context Conditioning Paper • 2510.08555 • Published Oct 9 • 62
UniMMVSR: A Unified Multi-Modal Framework for Cascaded Video Super-Resolution Paper • 2510.08143 • Published Oct 9 • 20
When Tokens Talk Too Much: A Survey of Multimodal Long-Context Token Compression across Images, Videos, and Audios Paper • 2507.20198 • Published Jul 27 • 26