An Open Source text-to-speech system built by inverting Whisper
Instant voice cloning by MIT and MyShell. Audio foundation model
SOTA Open Source TTS
Long-form streaming TTS system for multi-speaker dialogue generation
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model
Two Integrated Text To Speech Engines uses MMS & Silero
PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)