A simple, high-quality voice conversion tool focused on ease of use
State-of-the-art TTS model under 25MB
Industrial-level controllable zero-shot text-to-speech system
A high-quality rapid TTS voice cloning model
High-Quality Voice Cloning TTS for 600+ Languages
Tokenizer-Free TTS for Multilingual Speech Generation
TTS with kokoro and onnx runtime
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
A fast TTS architecture with conditional flow matching
Qwen3-TTS is an open-source series of TTS models
The official Python SDK for the ElevenLabs API
Multi-lingual large voice generation model, providing inference
Miso TTS is an 8 billion, highly emotive text-to-speech model
SOTA Open Source TTS
Code for openai.fm, a demo for the OpenAI Speech API
Use Microsoft Edge's online text-to-speech service from Python
Like the macOS say command, but with a modern voice
Generate audiobooks from e-books
Controllable & emotion-expressive zero-shot TTS
MARS5 speech model (TTS) from CAMB.AI
A text-to-speech, speech-to-text and speech-to-speech library
A sound cloning tool with a web interface, using your voice
Converts text to speech in realtime
Open source text-to-speech tool, supports extra-long text
Free, high-quality text-to-speech API endpoint to replace OpenAI