A simple, high-quality voice conversion tool focused on ease of use
TTS with kokoro and onnx runtime
Synchronized Translation for Videos
Comprehensive Gradio WebUI for audio processing
Offline Text To Speech synthesis for python
Instant voice cloning by MIT and MyShell. Audio foundation model
Generate audiobooks from e-books, voice cloning & 1107+ languages
State-of-the-art TTS model under 25MB
Use Microsoft Edge's online text-to-speech service from Python
A high-quality rapid TTS voice cloning model
A simple native web interface that uses ChatTTS to synthesize text
Qwen3-TTS is an open-source series of TTS models
SOTA Open Source TTS
A nearly-live implementation of OpenAI's Whisper
Industrial-level controllable zero-shot text-to-speech system
Generate audiobooks from EPUBs, PDFs and text with captions
Offline inference engine for art, real-time voice conversations
A sound cloning tool with a web interface, using your voice
A TTS that fits in your CPU (and pocket)
A generative speech model for daily dialogue
Towards Human-Sounding Speech
Generate audiobooks from e-books
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
A text-to-speech, speech-to-text and speech-to-speech library
Multi-lingual large voice generation model, providing inference