1 min voice data can also be used to train a good TTS model
Instant voice cloning by MIT and MyShell. Audio foundation model
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Easy-to-use Speech Toolkit including Self-Supervised Learning model
A deep learning toolkit for Text-to-Speech, battle-tested in research
Singing voice change based on whisper, lora for singing voice clone
A Python/Pytorch app for easily synthesising human voices
PAddle PARAllel text-to-speech toolKIT
An implementation of Tacotron 2 that supports multilingual experiments