A deep learning toolkit for Text-to-Speech, battle-tested in research
Best practice TTS based on BERT and VITS
Open source implementation of Microsoft's VALL-E X zero-shot TTS model
Chinese voice dialogue robot/smart speaker project
Singing voice change based on whisper, lora for singing voice clone
PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)
[WIP] VoiceSmith makes training text to speech models easy
Conditional Variational Autoencoder with Adversarial Learning
An implementation of Tacotron 2 that supports multilingual experiments
A python package to analyze and compare voices with deep learning
PyTorch implementation of convolutional neural networks
TensorFlow Implementation of DC-TTS: yet another text-to-speech model
Cross Audio-Visual Recognition using 3D Architectures
Beamforming and Speech Recognition Toolkit
Dia-1.6B generates lifelike English dialogue and vocal expressions