Build Vision Agents quickly with any model or video provider
Qwen3-TTS is an open-source series of TTS models
A TTS that fits in your CPU (and pocket)
High-Quality Voice Cloning TTS for 600+ Languages
Self-host the powerful Chatterbox TTS model
Offline Text To Speech synthesis for python
Generate audiobooks from EPUBs, PDFs and text with captions
TTS with kokoro and onnx runtime
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
Industrial-level controllable zero-shot text-to-speech system
Offline inference engine for art, real-time voice conversations
Reading book source
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
A simple, high-quality voice conversion tool focused on ease of use
Converts text to speech in realtime
Comprehensive Gradio WebUI for audio processing
A fast TTS architecture with conditional flow matching
Free, high-quality text-to-speech API endpoint to replace OpenAI
Instant voice cloning by MIT and MyShell. Audio foundation model
SOTA Open Source TTS
Generate audiobooks from e-books, voice cloning & 1107+ languages
SoTA open-source TTS
A lightweight text-to-speech model with zero-shot voice cloning
Framework for building neural networks
Official MiniMax Model Context Protocol (MCP) server