1 min voice data can also be used to train a good TTS model
Instant voice cloning by MIT and MyShell. Audio foundation model
Easy-to-use Speech Toolkit including Self-Supervised Learning model
elevenlabs-api is an open source Java wrapper around the ElevenLabs
Singing voice change based on whisper, lora for singing voice clone
An implementation of Tacotron 2 that supports multilingual experiments