TTS with kokoro and onnx runtime
State-of-the-art TTS model under 25MB
Foundational model for human-like, expressive TTS
StreamSpeech is a seamless model for offline speech recognition
A TTS model capable of generating ultra-realistic dialogue
Multi-lingual large voice generation model, providing inference
MARS5 speech model (TTS) from CAMB.AI
A generative speech model for daily dialogue
A lightweight text-to-speech model with zero-shot voice cloning
Instant voice cloning by MIT and MyShell. Audio foundation model
A high-quality rapid TTS voice cloning model
A nearly-live implementation of OpenAI's Whisper
A sound cloning tool with a web interface, using your voice
Speech-AI-Forge is a project developed around TTS generation model
Interface for OuteTTS models
Generate audiobooks from e-books
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model
Qwen3-TTS is an open-source series of TTS models
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
One-click deployment (including offline integration package)
Framework for building neural networks
Controllable & emotion-expressive zero-shot TTS
A text-to-speech, speech-to-text and speech-to-speech library
A simple native web interface that uses ChatTTS to synthesize text
An Open Source text-to-speech system built by inverting Whisper