Qwen3-TTS is an open-source series of TTS models
A TTS that fits in your CPU (and pocket)
High-Quality Voice Cloning TTS for 600+ Languages
TTS with kokoro and onnx runtime
On-device TTS model by Neuphonic
MOSS‑TTS Family open‑source speech and sound generation model
The official Python library for the Fish Audio API
NeuTTS model built from small LLM backbones
Instant voice cloning by MIT and MyShell. Audio foundation model
Industrial-level controllable zero-shot text-to-speech system
1 min voice data can also be used to train a good TTS model
A fast TTS architecture with conditional flow matching
A lightweight text-to-speech model with zero-shot voice cloning
SOTA Open Source TTS
Tokenizer-Free TTS for Multilingual Speech Generation
Converts text to speech in realtime
SoTA open-source TTS
Open-source multi-speaker long-form text-to-speech model
A nearly-live implementation of OpenAI's Whisper
Generate audiobooks from e-books
Multi-lingual large voice generation model, providing inference
MOSS-TTS-Nano is an open-source multilingual tiny speech generation
State-of-the-art TTS model under 25MB
An Open Source text-to-speech system built by inverting Whisper
A generative speech model for daily dialogue