TTS
Collection
Text To Speech Models
•
3 items
•
Updated
BIA-XTTSv2-moore-v0 is a fine-tuned version of Coqui's XTTS v2 model specifically trained for Moore language, a Gur language spoken in Burkina Faso. This model enables high-quality text-to-speech synthesis for Moore speakers and supports various voice cloning capabilities.
mosimport torch, torchaudio, os
import numpy as np
from tqdm import tqdm
from TTS.tts.configs.xtts_config import XttsConfig
from TTS.tts.models.xtts import Xtts
checkpoint_path = "checkpints"
model_path = "best_model.pth"
device = "cuda:0" if torch.cuda.is_available() else "cpu"
xtts_checkpoint = os.path.join(checkpoint_path, model_path)
xtts_config = os.path.join(checkpoint_path,"config.json")
xtts_vocab = checkpoint_path+"vocab.json"
# Load model
config = XttsConfig()
config.load_json(xtts_config)
XTTS_MODEL = Xtts.init_from_config(config)
XTTS_MODEL.load_checkpoint(config,
checkpoint_path = xtts_checkpoint,
vocab_path = xtts_vocab,
If you use this model in your research, please cite:
@misc{bia-xtts-moore-v0,
title={BIA-XTTSv2-moore-v0: A Fine-tuned XTTS Model for Moore Language},
author={Salif SAWADOGO at Burkimbia},
year={2024},
howpublished={\url{https://huggingface.co/BIA/BIA-XTTSv2-moore-v0}}
}