A simple, high-quality voice conversion tool focused on ease of use
Synchronized Translation for Videos
TTS with kokoro and onnx runtime
Offline Text To Speech synthesis for python
Comprehensive Gradio WebUI for audio processing
Use Microsoft Edge's online text-to-speech service from Python
A sound cloning tool with a web interface, using your voice
Instant voice cloning by MIT and MyShell. Audio foundation model
Generate audiobooks from e-books, voice cloning & 1107+ languages
A nearly-live implementation of OpenAI's Whisper
A high-quality rapid TTS voice cloning model
A TTS that fits in your CPU (and pocket)
Qwen3-TTS is an open-source series of TTS models
SOTA Open Source TTS
State-of-the-art TTS model under 25MB
Generate audiobooks from e-books
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model
Generate audiobooks from EPUBs, PDFs and text with captions
A simple native web interface that uses ChatTTS to synthesize text
Virtual AI anchor that combines state-of-the-art technology
The official Python SDK for the ElevenLabs API
A generative speech model for daily dialogue
Offline inference engine for art, real-time voice conversations
Python library and CLI tool to interface with Google Translate
Build Vision Agents quickly with any model or video provider