TTS with kokoro and onnx runtime
1 min voice data can also be used to train a good TTS model
SoTA open-source TTS
SOTA Open Source TTS
Instant voice cloning by MIT and MyShell. Audio foundation model
Qwen3-TTS is an open-source series of TTS models
High-Quality Voice Cloning TTS for 600+ Languages
Tokenizer-Free TTS for Multilingual Speech Generation
A generative speech model for daily dialogue
A nearly-live implementation of OpenAI's Whisper
Open-source multi-speaker long-form text-to-speech model
Generate audiobooks from e-books
State-of-the-art TTS model under 25MB
MOSS-TTS-Nano is an open-source multilingual tiny speech generation
Industrial-level controllable zero-shot text-to-speech system
A TTS that fits in your CPU (and pocket)
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
Python library and CLI tool to interface with Google Translate
TTS model capable of streaming conversational audio in realtime
The official Python library for the Fish Audio API
MOSS‑TTS Family open‑source speech and sound generation model
Miso TTS is an 8 billion, highly emotive text-to-speech model
A text-to-speech, speech-to-text and speech-to-speech library
Multi-lingual large voice generation model, providing inference
Foundational model for human-like, expressive TTS