TTS with kokoro and onnx runtime
TTS model capable of streaming conversational audio in realtime
Open-source multi-speaker long-form text-to-speech model
1 min voice data can also be used to train a good TTS model
State-of-the-art TTS model under 25MB
SoTA open-source TTS
Foundational model for human-like, expressive TTS
Multi-lingual large voice generation model, providing inference
MOSS‑TTS Family open‑source speech and sound generation model
Instant voice cloning by MIT and MyShell. Audio foundation model
MARS5 speech model (TTS) from CAMB.AI
A generative speech model for daily dialogue
GLM-4-Voice | End-to-End Chinese-English Conversational Model
MOSS-TTS-Nano is an open-source multilingual tiny speech generation
LLM-based Reinforcement Learning audio edit model
A TTS model capable of generating ultra-realistic dialogue
Miso TTS is an 8 billion, highly emotive text-to-speech model
A lightweight text-to-speech model with zero-shot voice cloning
A nearly-live implementation of OpenAI's Whisper
Generate audiobooks from e-books
Interface for OuteTTS models
An Open Source text-to-speech system built by inverting Whisper
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
Speech-AI-Forge is a project developed around TTS generation model
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model