Instant voice cloning by MIT and MyShell. Audio foundation model
Towards Human-Sounding Speech
Automatically translates the text of a video based on a subtitle file
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
MARS5 speech model (TTS) from CAMB.AI
Best practice TTS based on BERT and VITS