Instructions to use maya-research/maya1 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use maya-research/maya1 with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-to-speech", model="maya-research/maya1")# Load model directly from transformers import AutoTokenizer, AutoModelForCausalLM tokenizer = AutoTokenizer.from_pretrained("maya-research/maya1") model = AutoModelForCausalLM.from_pretrained("maya-research/maya1") - Notebooks
- Google Colab
- Kaggle
audio quality is weird
#29
by AnusreeDas01 - opened
When serving via our previous HF Transformers pipeline, long inputs produced full-length audio with occasional US-accent drift. When switching to vLLM for faster inference, short utterances are usually OK, but long outputs end early or sound robotic; accent drift to a mild American accent occurs in outputs(with or without quantization).
also sometimes the model would read the voice description text with original input.
Also to explain better, the model works best when you create your description similarly as verbose as possible like the template in the description to more stable and consistent results. this alone would fix everything of what you've mentioned.
bharathkumarK changed discussion status to closed