Running
1
Mms Tts Bul
π
mms-tts-bul
mms-tts-bul
Generate audio from text in Russian
Extract speech from audio files using Silero VAD
I would be using Silero STT(Since it works better with cpu)
audio tagging and genre and captioning model
Analyze speech segments in WAV files
Silero TTS