MIDAS: Multimodal Interactive Digital-human Synthesis via Real-time Autoregressive Video Generation Paper • 2508.19320 • Published Aug 26 • 29
Sana Collection ⚡️Sana: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer • 21 items • Updated Sep 13 • 97
SANA-1.5 Collection SANA-1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer • 6 items • Updated Sep 13 • 10
LongAI Collection Boost AI's Long ability, while keeping Efficient. Models in this collection includes LongVILA, LongVILA-R1, LongLive. • 8 items • Updated 27 days ago • 2
SANA-Video Collection 🎬 SANA-Video: Efficient Video Generation with Block Linear Diffusion Transformer • 4 items • Updated 29 days ago • 5
MotionStream: Real-Time Video Generation with Interactive Motion Controls Paper • 2511.01266 • Published 30 days ago • 27
UniAVGen: Unified Audio and Video Generation with Asymmetric Cross-Modal Interactions Paper • 2511.03334 • Published 28 days ago • 51
ChronoEdit Collection ChronoEdit: Towards Temporal Reasoning for Image Editing and World Simulation • 8 items • Updated 8 days ago • 10
MDGA Collection Make Diffusion Great Again. The resource list for Super Data Learners, Quokka, and OpenMoE 2. • 16 items • Updated 28 days ago • 8
Nemotron-Personas Collection A collection of multilingual, region-specific synthetic persona datasets that support sovereign AI development across many countries and regions. • 3 items • Updated 8 days ago • 13
Indic Parler-TTS Collection Collection of Parler-TTS models adapted to Indian languages. • 3 items • Updated Dec 4, 2024 • 9
TalkingMachines: Real-Time Audio-Driven FaceTime-Style Video via Autoregressive Diffusion Models Paper • 2506.03099 • Published Jun 3 • 19
Kimi-K2 Collection Moonshot's MoE LLMs with 1 trillion parameters, exceptional on agentic intellegence • 5 items • Updated 18 days ago • 154
AM-Thinking-v1: Advancing the Frontier of Reasoning at 32B Scale Paper • 2505.08311 • Published May 13 • 18