view article Article Transformers v5: Simple model definitions powering the AI ecosystem 1 day ago • 107
Multimodal Implementations Collection Comprehensive Demo of Multimodal VLMs on the Hub • 18 items • Updated 10 days ago • 8
view article Article Open ASR Leaderboard: Trends and Insights with New Multilingual & Long-Form Tracks 11 days ago • 19
Open ASR Leaderboard: Towards Reproducible and Transparent Multilingual and Long-Form Speech Recognition Evaluation Paper • 2510.06961 • Published Oct 8 • 8
view article Article Introducing AnyLanguageModel: One API for Local and Remote LLMs on Apple Platforms 12 days ago • 25
Meta CLIP 1 Collection Scaling CLIP data with transparent training distribution from an end-to-end pipeline. • 7 items • Updated 7 days ago • 21
view article Article Introducing AuraFace: Open-Source Face Recognition and Identity Preservation Models Aug 26, 2024 • 52
OlmoEarth Collection OlmoEarth pre-trained and fine-tuned foundation models for remote sensing • 10 items • Updated 3 days ago • 12
Kimi Linear: An Expressive, Efficient Attention Architecture Paper • 2510.26692 • Published Oct 30 • 113
SoulX-Podcast: Towards Realistic Long-form Podcasts with Dialectal and Paralinguistic Diversity Paper • 2510.23541 • Published Oct 27 • 13