Persian Models Collection This is the largest collection of Persian models available on Huggingface • 772 items • Updated Aug 23 • 16
Persian Datasets Collection This the largest collection of Persian datasets available on Huggingface • 124 items • Updated Sep 14 • 14
NaturalVoices - Voice Conversion Datasets Collection This is a collaborative work of JHU Smile Lab and CMU MSP Lab. Please cite https://arxiv.org/abs/2511.00256 • 5 items • Updated 12 days ago • 4
Evolving Diagnostic Agents in a Virtual Clinical Environment Paper • 2510.24654 • Published 25 days ago • 11
POWSM: A Phonetic Open Whisper-Style Speech Foundation Model Paper • 2510.24992 • Published 25 days ago • 2
OmniX: From Unified Panoramic Generation and Perception to Graphics-Ready 3D Scenes Paper • 2510.26800 • Published 23 days ago • 21
Kimi Linear: An Expressive, Efficient Attention Architecture Paper • 2510.26692 • Published 23 days ago • 108
The End of Manual Decoding: Towards Truly End-to-End Language Models Paper • 2510.26697 • Published 23 days ago • 113
ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning Paper • 2510.27492 • Published 23 days ago • 79
π_RL: Online RL Fine-tuning for Flow-based Vision-Language-Action Models Paper • 2510.25889 • Published 24 days ago • 63
RLCR Collection Collection of models and datasets for Beyond Binary Rewards: Training LMs to Reason about their Uncertainty • 10 items • Updated Aug 6 • 7
view article Article Building a Healthcare Robot from Simulation to Deployment with NVIDIA Isaac 25 days ago • 26
VAMOS: A Hierarchical Vision-Language-Action Model for Capab Collection This collection contains VLM planner checkpoints, affordance module checkpoints for Spot and HOUND, training datasets, and a demo • 7 items • Updated 26 days ago • 1
Pico-Banana-400K: A Large-Scale Dataset for Text-Guided Image Editing Paper • 2510.19808 • Published about 1 month ago • 28