Automatically translates the text of a video based on a subtitle file
A sound cloning tool with a web interface, using your voice
SOTA Open Source TTS
Scalable generative AI framework built for researchers and developers
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
StreamSpeech is a seamless model for offline speech recognition
Spark-TTS Inference Code
A generative speech model for daily dialogue
Long-form streaming TTS system for multi-speaker dialogue generation
Virtual AI anchor that combines state-of-the-art technology
Build Vision Agents quickly with any model or video provider
Official MiniMax Model Context Protocol (MCP) server
An Open Source text-to-speech system built by inverting Whisper
Towards Human-Sounding Speech
Free, high-quality text-to-speech API endpoint to replace OpenAI
Management of Yandex Station and other smart home devices
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
Interface for OuteTTS models
Converts text to speech in realtime
Toolkit for conversational AI
Controllable and fast Text-to-Speech for over 7000 languages
Towards Human-Level Text-to-Speech through Style Diffusion
Industrial-level controllable zero-shot text-to-speech system
Real-time voice interactive digital human