ESPnet

non-profit

https://github.com/espnet/espnet

Activity Feed Request to join this org

AI & ML interests

voice-conversion speech-separation speech-enhancement speech-translation speech-synthesis speech-recognition spoken-language-understanding

Recent Activity

RishabA updated a model 1 day ago

espnet/ta_openslr127

RishabA published a model 2 days ago

espnet/ta_openslr127

d3bach authored a paper 10 days ago

MARQUIS: A Three-Stage Pipeline for Video Retrieval-Augmented Generation

View all activity

updated a model 1 day ago

espnet/ta_openslr127

Automatic Speech Recognition • Updated 1 day ago • 4 • 1

published a model 2 days ago

espnet/ta_openslr127

Automatic Speech Recognition • Updated 1 day ago • 4 • 1

authored 2 papers 4 days ago

End-to-end Whispered Speech Recognition with Frequency-weighted Approaches and Pseudo Whisper Pre-training

Paper • 2005.01972 • Published May 5, 2020

USAD 2.0: Scaling Representation Distillation for Universal Audio Understanding

Paper • 2606.06444 • Published 9 days ago • 3

updated a model 16 days ago

espnet/multi-talker-whisper-small-ami

Automatic Speech Recognition • Updated 16 days ago • 24

published a model 19 days ago

espnet/multi-talker-whisper-small-ami

Automatic Speech Recognition • Updated 16 days ago • 24

cjli

updated a model about 1 month ago

espnet/powsm_ctc

Automatic Speech Recognition • Updated May 4 • 26 • 5

authored 3 papers about 1 month ago

Improving Multilingual Speech Models on ML-SUPERB 2.0: Fine-tuning with Data Augmentation and LID-Aware CTC

Paper • 2505.24200 • Published May 30, 2025

ESPnet-SpeechLM: An Open Speech Language Model Toolkit

Paper • 2502.15218 • Published Feb 21, 2025

Nemotron 3 Nano Omni: Efficient and Open Multimodal Intelligence

Paper • 2604.24954 • Published Apr 27 • 25

Aniket-Tathe-08

updated a model about 2 months ago

espnet/marathi_lrec2020

Automatic Speech Recognition • Updated Apr 22 • 2

Aniket-Tathe-08

published a model about 2 months ago

espnet/marathi_lrec2020

Automatic Speech Recognition • Updated Apr 22 • 2

cjli

authored a paper 2 months ago

An Empirical Recipe for Universal Phone Recognition

Paper • 2603.29042 • Published Mar 30 • 5

in espnet/owsm_ctc_v4_1B 2 months ago

Add Open ASR Leaderboard evaluation results

#4 opened 2 months ago by

in espnet/owsm_ctc_v3.2_ft_1B 2 months ago

Add Open ASR Leaderboard evaluation results

#2 opened 2 months ago by

in espnet/owsm_ctc_v3.1_1B 2 months ago

Add Open ASR Leaderboard evaluation results

#3 opened 2 months ago by

in espnet/owsm_ctc_v4_1B 2 months ago

Add Open ASR Leaderboard evaluation results

#3 opened 2 months ago by

in espnet/owsm_ctc_v3.2_ft_1B 2 months ago

Add Open ASR Leaderboard evaluation results

#1 opened 2 months ago by

in espnet/owsm_ctc_v3.1_1B 2 months ago

Add Open ASR Leaderboard evaluation results

#2 opened 2 months ago by

authored a paper 2 months ago

An Empirical Recipe for Universal Phone Recognition

Paper • 2603.29042 • Published Mar 30 • 5