Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
A fast TTS architecture with conditional flow matching
SOTA discrete acoustic codec models with 40/75 tokens per second
Unofficial Parallel WaveGAN
A list of accessible speech corpora for ASR, TTS
DeepMind's Tacotron-2 Tensorflow implementation