Clone a voice in 5 seconds to generate arbitrary speech in real-time
A sound cloning tool with a web interface, using your voice
Instant voice cloning by MIT and MyShell. Audio foundation model
Comprehensive Gradio WebUI for audio processing
A simple, high-quality voice conversion tool focused on ease of use
1 min voice data can also be used to train a good TTS model
A high-quality rapid TTS voice cloning model
Industrial-level controllable zero-shot text-to-speech system
Generate audiobooks from e-books, voice cloning & 1107+ languages
Official PyTorch Implementation
A lightweight text-to-speech model with zero-shot voice cloning
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
Multi-lingual large voice generation model, providing inference
Foundational model for human-like, expressive TTS
Real-time voice interactive digital human
The official Python SDK for the ElevenLabs API
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
Spark-TTS Inference Code
Open-source framework for intelligent speech interaction
Controllable & emotion-expressive zero-shot TTS
Easy-to-use Speech Toolkit including Self-Supervised Learning model
MARS5 speech model (TTS) from CAMB.AI
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
Official MiniMax Model Context Protocol (MCP) server
SOTA Open Source TTS