A single Gradio + React WebUI with extensions for ACE-Step
A simple native web interface that uses ChatTTS to synthesize text
Comprehensive Gradio WebUI for audio processing
Build Vision Agents quickly with any model or video provider
SOTA Open Source TTS
Speech-AI-Forge is a project developed around TTS generation model
The open-source voice synthesis studio powered by Qwen3-TTS
TTS with kokoro and onnx runtime
A simple, high-quality voice conversion tool focused on ease of use
Synchronized Translation for Videos
Qwen3-TTS is an open-source series of TTS models
A sound cloning tool with a web interface, using your voice
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
Tokenizer-Free TTS for Multilingual Speech Generation
Offline Text To Speech synthesis for python
Instant voice cloning by MIT and MyShell. Audio foundation model
State-of-the-art TTS model under 25MB
Use Microsoft Edge's online text-to-speech service from Python
Readest is a modern, feature-rich ebook reader
Generate audiobooks from e-books, voice cloning & 1107+ languages
Generate audiobooks from e-books
Code for openai.fm, a demo for the OpenAI Speech API
A nearly-live implementation of OpenAI's Whisper
A text-to-speech, speech-to-text and speech-to-speech library
A cross-platform software for text translation and recognition