State-of-the-art TTS model under 25MB
Qwen3-TTS is an open-source series of TTS models
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
StreamSpeech is a seamless model for offline speech recognition
An Open Source text-to-speech system built by inverting Whisper
Official PyTorch Implementation
A simple native web interface that uses ChatTTS to synthesize text
GLM-4-Voice | End-to-End Chinese-English Conversational Model
LLM Large Model of Selling Anchor
Multi-lingual large voice generation model, providing inference
High-quality multi-lingual text-to-speech library by MyShell.ai
A Conversational Speech Generation Model
Best practice TTS based on BERT and VITS
Open source implementation of Microsoft's VALL-E X zero-shot TTS model
Unofficial Parallel WaveGAN
SoftVC VITS Singing Voice Conversion
PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)
Implementation of NÜWA, attention network for text to video synthesis
Real-time music generation using stable diffusion techniques AI
General Speech Restoration
Clone a voice in 5 seconds to generate arbitrary speech in real-time
DeepMind's Tacotron-2 Tensorflow implementation
PyTorch implementation of convolutional neural networks
TensorFlow Implementation of DC-TTS: yet another text-to-speech model
A cross-platform wrapper for common text-to-speech engines in Python