Instant voice cloning by MIT and MyShell. Audio foundation model
Towards Human-Sounding Speech
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
MARS5 speech model (TTS) from CAMB.AI
Automatically translates the text of a video based on a subtitle file
Di♪♪Rhythm: Blazingly Fast & Simple End-to-End Song Generation
Best practice TTS based on BERT and VITS