State-of-the-art TTS model under 25MB
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model
A nearly-live implementation of OpenAI's Whisper
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
Towards Human-Sounding Speech
Generate audiobooks from e-books
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
A simple native web interface that uses ChatTTS to synthesize text
Interface for OuteTTS models
Generative Adversarial Networks for Efficient and High Fidelity Speech
Bangla text to speech synthesis in python