Fhrozen
/

Spark-TTS-0.5B-ONNX

Model card Files Files and versions

Spark-TTS-0.5B-ONNX / README.md

Fhrozen's picture

update readme

a00b7cd 4 months ago

|

history blame contribute delete

710 Bytes

	---
	license: cc-by-nc-sa-4.0
	base_model: SparkAudio/Spark-TTS-0.5B
	tags:
	- ONNX
	- ONNXRuntime
	- text-to-speech
	language:
	- en
	- zh
	---

	# Spark-TTS: An Efficient LLM-Based Text-to-Speech Model with Single-Stream Decoupled Speech Tokens

	## Overview

	Spark-TTS is an advanced text-to-speech system that uses the power of large language models (LLM) for highly accurate and natural-sounding voice synthesis. It is designed to be efficient, flexible, and powerful for both research and production use.

	https://huggingface.co/SparkAudio/Spark-TTS-0.5B with ONNX weights.

	## Usage

	```python
	python test_spark_tts.py --text "Your text to synthesize" --model_dir "/path/to/model"
	```