view article Article Open ASR Leaderboard: Trends and Insights with New Multilingual & Long-Form Tracks 12 days ago • 19
MOSS-Speech: Towards True Speech-to-Speech Models Without Text Guidance Paper • 2510.00499 • Published Oct 1 • 19
Treble10: A high-quality dataset for far-field speech recognition, dereverberation, and enhancement Paper • 2510.23141 • Published Oct 27 • 4
view article Article Apriel-H1: The Surprising Key to Distilling Efficient Reasoning Models 13 days ago • 25
Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B Paper • 2511.06221 • Published 23 days ago • 125
view article Article Llasa Goes RL: Training LLaSA with GRPO for Improved Prosody and Expressiveness 27 days ago • 10
Open ASR Leaderboard: Towards Reproducible and Transparent Multilingual and Long-Form Speech Recognition Evaluation Paper • 2510.06961 • Published Oct 8 • 8
OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM Paper • 2510.15870 • Published Oct 17 • 88
view article Article High-Quality Datasets for Far-Field ASR (Treble Technologies x Hugging Face) Oct 13 • 16