TTS with kokoro and onnx runtime
A simple, high-quality voice conversion tool focused on ease of use
Qwen3-TTS is an open-source series of TTS models
A TTS that fits in your CPU (and pocket)
High-Quality Voice Cloning TTS for 600+ Languages
Speech-AI-Forge is a project developed around TTS generation model
Instant voice cloning by MIT and MyShell. Audio foundation model
SOTA Open Source TTS
Offline Text To Speech synthesis for python
EPUB to audiobook converter, optimized for Audiobookshelf
A simple native web interface that uses ChatTTS to synthesize text
Offline inference engine for art, real-time voice conversations
A nearly-live implementation of OpenAI's Whisper
Synchronized Translation for Videos
A high-quality rapid TTS voice cloning model
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
Free, high-quality text-to-speech API endpoint to replace OpenAI
One-click deployment (including offline integration package)
Towards Human-Sounding Speech
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model
Build Vision Agents quickly with any model or video provider
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
Spark-TTS Inference Code
A lightweight text-to-speech model with zero-shot voice cloning
StreamSpeech is a seamless model for offline speech recognition