VITS2 backbone with multilingual-bert
A fast TTS architecture with conditional flow matching
SOTA discrete acoustic codec models with 40/75 tokens per second
Controllable and fast Text-to-Speech for over 7000 languages
One-click deployment (including offline integration package)
A TTS model capable of generating ultra-realistic dialogue
Bailing is a voice dialogue robot similar to GPT-4o
Reading book source
MARS5 speech model (TTS) from CAMB.AI
Multi-Voice and Prompt-Controlled TTS Engine
Text to Speech Utility
Unofficial Parallel WaveGAN
A Conversational Speech Generation Model
Open source implementation of Microsoft's VALL-E X zero-shot TTS model
Chuyển đổi văn bản thành giọng nói không giới hạn
Best practice TTS based on BERT and VITS
Mice speech to text with MX Cinnamon OS ISO
Chinese voice dialogue robot/smart speaker project
A webui for different audio related Neural Networks
Txt-2-Mp3 6.3 Mark 2 [Improved.Simplified.Alternative]
Singing Voice Synthesis via Shallow Diffusion Mechanism
WaveRNN Vocoder + TTS
Clone a voice in 5 seconds to generate arbitrary speech in real-time
General Speech Restoration