A simple, high-quality voice conversion tool focused on ease of use
TTS with kokoro and onnx runtime
Management of Yandex Station and other smart home devices
Multi-lingual large voice generation model, providing inference
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
State-of-the-art TTS model under 25MB
A generative speech model for daily dialogue
A high-quality rapid TTS voice cloning model
Qwen3-TTS is an open-source series of TTS models
Generate audiobooks from EPUBs, PDFs and text with captions
A nearly-live implementation of OpenAI's Whisper
One-click deployment (including offline integration package)
A TTS that fits in your CPU (and pocket)
Speech-AI-Forge is a project developed around TTS generation model
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
Virtual AI anchor that combines state-of-the-art technology
Build Vision Agents quickly with any model or video provider
StreamSpeech is a seamless model for offline speech recognition
A fast TTS architecture with conditional flow matching
Controllable and fast Text-to-Speech for over 7000 languages
Interface for OuteTTS models
MARS5 speech model (TTS) from CAMB.AI
Official MiniMax Model Context Protocol (MCP) server
High-quality multi-lingual text-to-speech library by MyShell.ai
Toolkit for audio, music, and speech generation