Synchronized Translation for Videos
Generate audiobooks from e-books, voice cloning & 1107+ languages
State-of-the-art TTS model under 25MB
A high-quality rapid TTS voice cloning model
Qwen3-TTS is an open-source series of TTS models
Controllable & emotion-expressive zero-shot TTS
SOTA Open Source TTS
Multi-lingual large voice generation model, providing inference
Build Vision Agents quickly with any model or video provider
Controllable and fast Text-to-Speech for over 7000 languages
A TTS model capable of generating ultra-realistic dialogue
Scalable generative AI framework built for researchers and developers
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model
Foundational model for human-like, expressive TTS
End-to-end speech processing toolkit
Toolkit for conversational AI
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
A lightweight text-to-speech model with zero-shot voice cloning
Framework for building neural networks
TTS with kokoro and onnx runtime
An Open Source text-to-speech system built by inverting Whisper
Towards Human-Sounding Speech
Interface for OuteTTS models
Multi-Voice and Prompt-Controlled TTS Engine
A small clipboard reader