TTS with kokoro and onnx runtime
A simple, high-quality voice conversion tool focused on ease of use
Multi-lingual large voice generation model, providing inference
Readest is a modern, feature-rich ebook reader
Qwen3-TTS is an open-source series of TTS models
Speech Note Linux app. Note taking, reading and translating
Generate audiobooks from EPUBs, PDFs and text with captions
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
A nearly-live implementation of OpenAI's Whisper
A generative speech model for daily dialogue
One-click deployment (including offline integration package)
State-of-the-art TTS model under 25MB
A single Gradio + React WebUI with extensions for ACE-Step
A TTS that fits in your CPU (and pocket)
A high-quality rapid TTS voice cloning model
Speech-AI-Forge is a project developed around TTS generation model
Build Vision Agents quickly with any model or video provider
A fast TTS architecture with conditional flow matching
Workflow and speech recognition app
Like the macOS say command, but with a modern voice
Official MiniMax Model Context Protocol (MCP) server
StreamSpeech is a seamless model for offline speech recognition
Controllable and fast Text-to-Speech for over 7000 languages
The python library for real-time communication