TTS with kokoro and onnx runtime
Instant voice cloning by MIT and MyShell. Audio foundation model
1 min voice data can also be used to train a good TTS model
SOTA Open Source TTS
Generate audiobooks from e-books
Tokenizer-Free TTS for Multilingual Speech Generation
High-Quality Voice Cloning TTS for 600+ Languages
MOSS-TTS-Nano is an open-source multilingual tiny speech generation
Qwen3-TTS is an open-source series of TTS models
A nearly-live implementation of OpenAI's Whisper
Multi-lingual large voice generation model, providing inference
Open-source multi-speaker long-form text-to-speech model
A generative speech model for daily dialogue
State-of-the-art TTS model under 25MB
An Open Source text-to-speech system built by inverting Whisper
A sound cloning tool with a web interface, using your voice
A TTS that fits in your CPU (and pocket)
On-device TTS model by Neuphonic
MOSS‑TTS Family open‑source speech and sound generation model
TTS model capable of streaming conversational audio in realtime
The official Python library for the Fish Audio API
Speech-AI-Forge is a project developed around TTS generation model
NeuTTS model built from small LLM backbones
Industrial-level controllable zero-shot text-to-speech system
GLM-4-Voice | End-to-End Chinese-English Conversational Model