Clone a voice in 5 seconds to generate arbitrary speech in real-time
Industrial-level controllable zero-shot text-to-speech system
An Open Source text-to-speech system built by inverting Whisper
A fast TTS architecture with conditional flow matching
Unofficial Parallel WaveGAN
A deep learning toolkit for Text-to-Speech, battle-tested in research
Best practice TTS based on BERT and VITS
Audio generation using diffusion models, in PyTorch
Singing Voice Synthesis via Shallow Diffusion Mechanism
WaveRNN Vocoder + TTS
Clone a voice in 5 seconds to generate arbitrary speech in real-time
General Speech Restoration
PAddle PARAllel text-to-speech toolKIT
Real-Time State-of-the-art Speech Synthesis for Tensorflow 2
Conditional Variational Autoencoder with Adversarial Learning
Implementation of a Transformer based neural network
Generative Adversarial Networks for Efficient and High Fidelity Speech
DeepMind's Tacotron-2 Tensorflow implementation