Instant voice cloning by MIT and MyShell. Audio foundation model
1 min voice data can also be used to train a good TTS model
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Singing voice change based on whisper, lora for singing voice clone
[WIP] VoiceSmith makes training text to speech models easy
A Python/Pytorch app for easily synthesising human voices
An implementation of Tacotron 2 that supports multilingual experiments