State-of-the-art TTS model under 25MB
Instant voice cloning by MIT and MyShell. Audio foundation model
A high-quality rapid TTS voice cloning model
Converts text to speech in realtime
Qwen3-TTS is an open-source series of TTS models
Virtual AI anchor that combines state-of-the-art technology
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model
Multi-lingual large voice generation model, providing inference
Towards Human-Sounding Speech
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
Long-form streaming TTS system for multi-speaker dialogue generation
Framework for building neural networks
Controllable and fast Text-to-Speech for over 7000 languages
Toolkit for audio, music, and speech generation
Mice speech to text with MX Cinnamon OS ISO
Chinese voice dialogue robot/smart speaker project
General Speech Restoration
Real-Time State-of-the-art Speech Synthesis for Tensorflow 2
Conditional Variational Autoencoder with Adversarial Learning
Generative Adversarial Networks for Efficient and High Fidelity Speech
Vinux is an Ubuntu derived distribution for blind & visually impaired.
A cross-platform wrapper for common text-to-speech engines in Python