Miso TTS is an 8 billion, highly emotive text-to-speech model
A high-quality rapid TTS voice cloning model
State-of-the-art TTS model under 25MB
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
Free, high-quality text-to-speech API endpoint to replace OpenAI
Controllable & emotion-expressive zero-shot TTS
Bailing is a voice dialogue robot similar to GPT-4o
Spark-TTS Inference Code
MOSS-TTS-Nano is an open-source multilingual tiny speech generation
Automatic Speech Recognition with Word-level Timestamps
Open source machine learning framework to automate text conversations
Self-host the powerful Chatterbox TTS model
Open source AI VTuber platform with voice chat and Live2D avatars
Generate audiobooks from e-books
A simple native web interface that uses ChatTTS to synthesize text
Fast multimodal LLM for real-time voice interaction and AI apps
Official MiniMax Model Context Protocol (MCP) server
Official PyTorch Implementation
One-click deployment (including offline integration package)
Offline Text To Speech synthesis for python
A text-to-speech, speech-to-text and speech-to-speech library
Framework for building realtime multimodal voice AI agents apps
TTS with kokoro and onnx runtime
MARS5 speech model (TTS) from CAMB.AI
TTS model capable of streaming conversational audio in realtime