Toolkit for conversational AI
State-of-the-art TTS model under 25MB
A generative speech model for daily dialogue
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
A simple, high-quality voice conversion tool focused on ease of use
Comprehensive Gradio WebUI for audio processing
TTS with kokoro and onnx runtime
Instant voice cloning by MIT and MyShell. Audio foundation model
Qwen3-TTS is an open-source series of TTS models
SoTA open-source TTS
Industrial-level controllable zero-shot text-to-speech system
Generate audiobooks from e-books, voice cloning & 1107+ languages
Long-form streaming TTS system for multi-speaker dialogue generation
Synchronized Translation for Videos
Tokenizer-Free TTS for Multilingual Speech Generation
Generate audiobooks from EPUBs, PDFs and text with captions
Controllable & emotion-expressive zero-shot TTS
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
A TTS that fits in your CPU (and pocket)
EPUB to audiobook converter, optimized for Audiobookshelf
Offline Text To Speech synthesis for python
A text-to-speech, speech-to-text and speech-to-speech library
A fast TTS architecture with conditional flow matching
Use Microsoft Edge's online text-to-speech service from Python
A nearly-live implementation of OpenAI's Whisper