AI & ML interests

voice-conversion speech-separation speech-enhancement speech-translation speech-synthesis speech-recognition spoken-language-understanding

Recent Activity

qingzhengwang  authored a paper 3 days ago
Fish Audio S2 Technical Report
JinchuanTian  updated a dataset 4 days ago
espnet/data_part7
JinchuanTian  published a dataset 4 days ago
espnet/data_part7
View all activity

espnet 's collections 10

OWLS: Scaling Laws for Speech Recognition and Translation
🦉 A suite of Whisper-style models from 250M to 18B parameters. Trained on up to 360K hours of data. 16k sampling rate.
OWSM: Fully Open Speech Recognition and Translation Models
A collection of models related to the Open Whisper-style Speech Models (OWSM) project from CMU: https://www.wavlab.org/activities/2024/owsm/
OWSM: Fully Open Speech Recognition and Translation Models
A collection of models related to the Open Whisper-style Speech Models (OWSM) project from CMU: https://www.wavlab.org/activities/2024/owsm/
OWLS: Scaling Laws for Speech Recognition and Translation
🦉 A suite of Whisper-style models from 250M to 18B parameters. Trained on up to 360K hours of data. 16k sampling rate.