Build Vision Agents quickly with any model or video provider
A cross-platform software for text translation and recognition
Code for openai.fm, a demo for the OpenAI Speech API
Lightning-fast, on-device TTS, running natively via ONNX
Workflow and speech recognition app
The python library for real-time communication
The open-source voice synthesis studio powered by Qwen3-TTS
A simple, high-quality voice conversion tool focused on ease of use
TTS with kokoro and onnx runtime
Synchronized Translation for Videos
Generate audiobooks from e-books, voice cloning & 1107+ languages
A high-quality rapid TTS voice cloning model
Qwen3-TTS is an open-source series of TTS models
State-of-the-art TTS model under 25MB
High-quality multi-lingual text-to-speech library by MyShell.ai
SOTA Open Source TTS
EPUB to audiobook converter, optimized for Audiobookshelf
Interface for OuteTTS models
Instant voice cloning by MIT and MyShell. Audio foundation model
Comprehensive Gradio WebUI for audio processing
Readest is a modern, feature-rich ebook reader
Generate audiobooks from EPUBs, PDFs and text with captions
A sound cloning tool with a web interface, using your voice
Use Microsoft Edge's online text-to-speech service from Python
Generate audiobooks from e-books