A lightweight text-to-speech model with zero-shot voice cloning
Generate audiobooks from e-books
Scalable generative AI framework built for researchers and developers
SOTA Open Source TTS
A generative speech model for daily dialogue
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
Long-form streaming TTS system for multi-speaker dialogue generation
StreamSpeech is a seamless model for offline speech recognition
Towards Human-Sounding Speech
Spark-TTS Inference Code
Virtual AI anchor that combines state-of-the-art technology
Official MiniMax Model Context Protocol (MCP) server
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model
Industrial-level controllable zero-shot text-to-speech system
Free, high-quality text-to-speech API endpoint to replace OpenAI
Build Vision Agents quickly with any model or video provider
Speech-AI-Forge is a project developed around TTS generation model
Management of Yandex Station and other smart home devices
An Open Source text-to-speech system built by inverting Whisper
Interface for OuteTTS models
Converts text to speech in realtime
Toolkit for conversational AI
Real-time voice interactive digital human
Controllable and fast Text-to-Speech for over 7000 languages
Towards Human-Level Text-to-Speech through Style Diffusion