AI powered speech denoising and enhancement
Open source implementation of Microsoft's VALL-E X zero-shot TTS model
Multimodal AI Story Teller, built with Stable Diffusion, GPT, etc.
PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)
Conditional Variational Autoencoder with Adversarial Learning
Implementation of a Transformer based neural network
A python package to analyze and compare voices with deep learning
TensorFlow Implementation of DC-TTS: yet another text-to-speech model
Dia-1.6B generates lifelike English dialogue and vocal expressions