VITS2 backbone with multilingual-bert
SPPAS - the automatic annotation and analyses of speech
Offline desktop app to convert EPUB to MP3 using Kokoro-82M neural TTS
Multi-Voice and Prompt-Controlled TTS Engine
Best practice TTS based on BERT and VITS
Open source implementation of Microsoft's VALL-E X zero-shot TTS model
Unofficial Parallel WaveGAN
SoftVC VITS Singing Voice Conversion
Multimodal AI Story Teller, built with Stable Diffusion, GPT, etc.
PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)
A webui for different audio related Neural Networks
Official PyTorch Implementation of "Scalable Diffusion Models"
Large dataset of coding contests designed for AI and ML model training
Singing Voice Synthesis via Shallow Diffusion Mechanism
Codebase for Diffusion Models Beat GANS on Image Synthesis
WaveRNN Vocoder + TTS
GLIDE: a diffusion-based text-conditional image synthesis model
3D-aware GANs based on NeRF (arXiv)
Generative Adversarial Transformers
General Speech Restoration
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Main repository of Project Alice, contains main unit source code
PAddle PARAllel text-to-speech toolKIT
Real-Time State-of-the-art Speech Synthesis for Tensorflow 2