Qwen3-TTS is an open-source series of TTS models
Use Microsoft Edge's online text-to-speech service from Python
Towards Human-Sounding Speech
A TTS that fits in your CPU (and pocket)
A single Gradio + React WebUI with extensions for ACE-Step
State-of-the-art TTS model under 25MB
A lightweight text-to-speech model with zero-shot voice cloning
Controllable & emotion-expressive zero-shot TTS
A fast TTS architecture with conditional flow matching
Spark-TTS Inference Code
Speech to Text to Speech, sends text as OSC messages
Comprehensive Gradio WebUI for audio processing
1 min voice data can also be used to train a good TTS model
Converts text to speech in realtime
Free, high-quality text-to-speech API endpoint to replace OpenAI
SOTA Open Source TTS
Foundational model for human-like, expressive TTS
TTS with kokoro and onnx runtime
Speech-AI-Forge is a project developed around TTS generation model
Bailing is a voice dialogue robot similar to GPT-4o
EPUB to audiobook converter, optimized for Audiobookshelf
Real-time voice interactive digital human
Open-source multi-speaker long-form text-to-speech model
LLM Frontend for Power Users
Industrial-level controllable zero-shot text-to-speech system