Qwen3-TTS is an open-source series of TTS models
Generate audiobooks from EPUBs, PDFs and text with captions
A TTS that fits in your CPU (and pocket)
Reading book source
A fast TTS architecture with conditional flow matching
Offline Text To Speech synthesis for python
Converts text to speech in realtime
Offline inference engine for art, real-time voice conversations
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
Official MiniMax Model Context Protocol (MCP) server
Industrial-level controllable zero-shot text-to-speech system
A lightweight text-to-speech model with zero-shot voice cloning
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
Controllable and fast Text-to-Speech for over 7000 languages
Free, high-quality text-to-speech API endpoint to replace OpenAI
End-to-end speech processing toolkit
High-Quality Voice Cloning TTS for 600+ Languages
Scalable generative AI framework built for researchers and developers
Framework for building neural networks
VITS2 backbone with multilingual-bert
A Conversational Speech Generation Model
Offline desktop app to convert EPUB to MP3 using Kokoro-82M neural TTS
Multi-Voice and Prompt-Controlled TTS Engine
A webui for different audio related Neural Networks
WaveRNN Vocoder + TTS