Industrial-level controllable zero-shot text-to-speech system
State-of-the-art TTS model under 25MB
High-Quality Voice Cloning TTS for 600+ Languages
Tokenizer-Free TTS for Multilingual Speech Generation
TTS with kokoro and onnx runtime
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
MOSS-TTS-Nano is an open-source multilingual tiny speech generation
Qwen3-TTS is an open-source series of TTS models
A fast TTS architecture with conditional flow matching
Multi-lingual large voice generation model, providing inference
Miso TTS is an 8 billion, highly emotive text-to-speech model
SOTA Open Source TTS
Generate audiobooks from e-books
Controllable & emotion-expressive zero-shot TTS
MARS5 speech model (TTS) from CAMB.AI
A text-to-speech, speech-to-text and speech-to-speech library
A sound cloning tool with a web interface, using your voice
Converts text to speech in realtime
Long-form streaming TTS system for multi-speaker dialogue generation
Interface for OuteTTS models
A TTS model capable of generating ultra-realistic dialogue
Towards Human-Sounding Speech
A Conversational Speech Generation Model
VITS2 backbone with multilingual-bert
AI powered speech denoising and enhancement