stepfun-ai/Step-Audio-R1
Audio-Text-to-Text
•
33B
•
Updated
•
506
•
129
Open source models with audio understanding. Tracking mostly vendor releases in the audio and text to text subclassification of multimodal.
Analyze audio to recognize speech, translate, and more