Qwen3-TTS is an open-source series of TTS models
Offline Text To Speech synthesis for python
A TTS that fits in your CPU (and pocket)
Generate audiobooks from EPUBs, PDFs and text with captions
Industrial-level controllable zero-shot text-to-speech system
Offline inference engine for art, real-time voice conversations
A simple, high-quality voice conversion tool focused on ease of use
TTS with kokoro and onnx runtime
Converts text to speech in realtime
A fast TTS architecture with conditional flow matching
Controllable and fast Text-to-Speech for over 7000 languages
Reading book source
Synchronized Translation for Videos
Generate audiobooks from e-books, voice cloning & 1107+ languages
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
Comprehensive Gradio WebUI for audio processing
A lightweight text-to-speech model with zero-shot voice cloning
Framework for building neural networks
Free, high-quality text-to-speech API endpoint to replace OpenAI
A high-quality rapid TTS voice cloning model
End-to-end speech processing toolkit
Instant voice cloning by MIT and MyShell. Audio foundation model
Scalable generative AI framework built for researchers and developers
Official MiniMax Model Context Protocol (MCP) server
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles