A simple, high-quality voice conversion tool focused on ease of use
Industrial-level controllable zero-shot text-to-speech system
State-of-the-art TTS model under 25MB
A high-quality rapid TTS voice cloning model
Tokenizer-Free TTS for Multilingual Speech Generation
High-Quality Voice Cloning TTS for 600+ Languages
TTS with kokoro and onnx runtime
A fast TTS architecture with conditional flow matching
Qwen3-TTS is an open-source series of TTS models
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
Multi-lingual large voice generation model, providing inference
Miso TTS is an 8 billion, highly emotive text-to-speech model
The official Python SDK for the ElevenLabs API
Use Microsoft Edge's online text-to-speech service from Python
SOTA Open Source TTS
Generate audiobooks from e-books
Controllable & emotion-expressive zero-shot TTS
MARS5 speech model (TTS) from CAMB.AI
A sound cloning tool with a web interface, using your voice
A text-to-speech, speech-to-text and speech-to-speech library
Converts text to speech in realtime
Free, high-quality text-to-speech API endpoint to replace OpenAI
Long-form streaming TTS system for multi-speaker dialogue generation
SOTA discrete acoustic codec models with 40/75 tokens per second
Interface for OuteTTS models