Build Vision Agents quickly with any model or video provider
Python library and CLI tool to interface with Google Translate
A simple, high-quality voice conversion tool focused on ease of use
The open-source voice synthesis studio powered by Qwen3-TTS
TTS with kokoro and onnx runtime
A cross-platform software for text translation and recognition
Qwen3-TTS is an open-source series of TTS models
Readest is a modern, feature-rich ebook reader
Comprehensive Gradio WebUI for audio processing
Generate audiobooks from e-books, voice cloning & 1107+ languages
SOTA Open Source TTS
Use Microsoft Edge's online text-to-speech service from Python
Instant voice cloning by MIT and MyShell. Audio foundation model
Offline Text To Speech synthesis for python
State-of-the-art TTS model under 25MB
Synchronized Translation for Videos
Video translation and dubbing tool powered by LLMs
Code for openai.fm, a demo for the OpenAI Speech API
Cross-platform AI language practice app
Speech Note Linux app. Note taking, reading and translating
Generate audiobooks from EPUBs, PDFs and text with captions
A generative speech model for daily dialogue
A TTS that fits in your CPU (and pocket)
Virtual AI anchor that combines state-of-the-art technology
A fast TTS architecture with conditional flow matching