TTS with kokoro and onnx runtime
Multi-lingual large voice generation model, providing inference
Management of Yandex Station and other smart home devices
A simple, high-quality voice conversion tool focused on ease of use
Qwen3-TTS is an open-source series of TTS models
Readest is a modern, feature-rich ebook reader
A generative speech model for daily dialogue
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
Speech Note Linux app. Note taking, reading and translating
A nearly-live implementation of OpenAI's Whisper
Generate audiobooks from EPUBs, PDFs and text with captions
One-click deployment (including offline integration package)
State-of-the-art TTS model under 25MB
A single Gradio + React WebUI with extensions for ACE-Step
A high-quality rapid TTS voice cloning model
Speech-AI-Forge is a project developed around TTS generation model
A TTS that fits in your CPU (and pocket)
Build Vision Agents quickly with any model or video provider
A fast TTS architecture with conditional flow matching
Official MiniMax Model Context Protocol (MCP) server
Workflow and speech recognition app
Like the macOS say command, but with a modern voice
StreamSpeech is a seamless model for offline speech recognition
Controllable and fast Text-to-Speech for over 7000 languages