Clone a voice in 5 seconds to generate arbitrary speech in real-time
A sound cloning tool with a web interface, using your voice
Comprehensive Gradio WebUI for audio processing
Instant voice cloning by MIT and MyShell. Audio foundation model
A simple, high-quality voice conversion tool focused on ease of use
1 min voice data can also be used to train a good TTS model
The open-source voice synthesis studio powered by Qwen3-TTS
A high-quality rapid TTS voice cloning model
High-Quality Voice Cloning TTS for 600+ Languages
Generate audiobooks from e-books, voice cloning & 1107+ languages
Industrial-level controllable zero-shot text-to-speech system
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
Official PyTorch Implementation
A lightweight text-to-speech model with zero-shot voice cloning
Foundational model for human-like, expressive TTS
Tokenizer-Free TTS for Multilingual Speech Generation
One-stop AI digital human system with video voice synthesis tools
Multi-lingual large voice generation model, providing inference
Real-time voice interactive digital human
The official Python SDK for the ElevenLabs API
Video translation and dubbing tool powered by LLMs
Open-source framework for intelligent speech interaction
Spark-TTS Inference Code
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
MOSS-TTS-Nano is an open-source multilingual tiny speech generation