A simple, high-quality voice conversion tool focused on ease of use
TTS with kokoro and onnx runtime
Synchronized Translation for Videos
Offline Text To Speech synthesis for python
Comprehensive Gradio WebUI for audio processing
Generate audiobooks from e-books, voice cloning & 1107+ languages
Use Microsoft Edge's online text-to-speech service from Python
Qwen3-TTS is an open-source series of TTS models
Instant voice cloning by MIT and MyShell. Audio foundation model
Offline inference engine for art, real-time voice conversations
State-of-the-art TTS model under 25MB
A text-to-speech, speech-to-text and speech-to-speech library
Generate audiobooks from EPUBs, PDFs and text with captions
Speech-AI-Forge is a project developed around TTS generation model
A TTS that fits in your CPU (and pocket)
A simple native web interface that uses ChatTTS to synthesize text
A nearly-live implementation of OpenAI's Whisper
A sound cloning tool with a web interface, using your voice
SOTA Open Source TTS
Industrial-level controllable zero-shot text-to-speech system
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
Virtual AI anchor that combines state-of-the-art technology
Generate audiobooks from e-books
Toolkit for conversational AI
Automatically translates the text of a video based on a subtitle file