Real-time voice interactive digital human
A high-quality rapid TTS voice cloning model
State-of-the-art TTS model under 25MB
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
Free, high-quality text-to-speech API endpoint to replace OpenAI
PersonaPlex code
Controllable & emotion-expressive zero-shot TTS
Bailing is a voice dialogue robot similar to GPT-4o
MOSS-TTS-Nano is an open-source multilingual tiny speech generation
Automatic Speech Recognition with Word-level Timestamps
Spark-TTS Inference Code
Speakr is a personal, self-hosted web application
Open source machine learning framework to automate text conversations
Self-host the powerful Chatterbox TTS model
Generate audiobooks from e-books
Open source AI VTuber platform with voice chat and Live2D avatars
A simple native web interface that uses ChatTTS to synthesize text
Fast multimodal LLM for real-time voice interaction and AI apps
Translate the video from one language to another and embed dubbing
Official MiniMax Model Context Protocol (MCP) server
One-click deployment (including offline integration package)
Offline Text To Speech synthesis for python
Official PyTorch Implementation
A text-to-speech, speech-to-text and speech-to-speech library
Framework for building realtime multimodal voice AI agents apps