A TTS model capable of generating ultra-realistic dialogue
Towards Human-Sounding Speech
High-quality multi-lingual text-to-speech library by MyShell.ai
A Conversational Speech Generation Model
Offline desktop app to convert EPUB to MP3 using Kokoro-82M neural TTS
VITS2 backbone with multilingual-bert
Unofficial Parallel WaveGAN
Best practice TTS based on BERT and VITS
Singing Voice Synthesis via Shallow Diffusion Mechanism
WaveRNN Vocoder + TTS
General Speech Restoration
Conditional Variational Autoencoder with Adversarial Learning
Generative Adversarial Networks for Efficient and High Fidelity Speech
DeepMind's Tacotron-2 Tensorflow implementation
Toolkit for efficient experimentation with Speech Recognition