A TTS that fits in your CPU (and pocket)
State-of-the-art TTS model under 25MB
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model
MOSS-TTS-Nano is an open-source multilingual tiny speech generation
A nearly-live implementation of OpenAI's Whisper
SOTA Open Source TTS
Capable of understanding text, audio, vision, video
Generate audiobooks from e-books
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
A lightweight text-to-speech model with zero-shot voice cloning
Towards Human-Sounding Speech
Interface for OuteTTS models
Dia-1.6B generates lifelike English dialogue and vocal expressions