A lightweight text-to-speech model with zero-shot voice cloning
The python library for real-time communication
Converts text to speech in realtime
Industrial-level controllable zero-shot text-to-speech system
Official MiniMax Model Context Protocol (MCP) server
Synchronized Translation for Videos
Generate audiobooks from EPUBs, PDFs and text with captions
Toolkit for conversational AI
Multi-lingual large voice generation model, providing inference
A text-to-speech, speech-to-text and speech-to-speech library
Lightning-fast, on-device TTS, running natively via ONNX
Scalable generative AI framework built for researchers and developers
Interface for OuteTTS models
A sound cloning tool with a web interface, using your voice
A fast TTS architecture with conditional flow matching
Free, high-quality text-to-speech API endpoint to replace OpenAI
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
Speech-AI-Forge is a project developed around TTS generation model
Foundational model for human-like, expressive TTS
Provides CTP stock options and Zhongtai Securities XTP
Build Vision Agents quickly with any model or video provider
Real-time voice interactive digital human
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model
SOTA discrete acoustic codec models with 40/75 tokens per second
A single Gradio + React WebUI with extensions for ACE-Step