Instant voice cloning by MIT and MyShell. Audio foundation model
A simple, high-quality voice conversion tool focused on ease of use
Synchronized Translation for Videos
The open-source voice synthesis studio powered by Qwen3-TTS
Use Microsoft Edge's online text-to-speech service from Python
Generate audiobooks from e-books, voice cloning & 1107+ languages
Speech Note Linux app. Note taking, reading and translating
Industrial-level controllable zero-shot text-to-speech system
MARS5 speech model (TTS) from CAMB.AI
Video translation and dubbing tool powered by LLMs
A TTS that fits in your CPU (and pocket)
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model
Python library and CLI tool to interface with Google Translate
Bailing is a voice dialogue robot similar to GPT-4o
Converts text to speech in realtime
Virtual AI anchor that combines state-of-the-art technology
A simple native web interface that uses ChatTTS to synthesize text
A cross-platform software for text translation and recognition
A single Gradio + React WebUI with extensions for ACE-Step
A nearly-live implementation of OpenAI's Whisper
Automatically translates the text of a video based on a subtitle file
Speech-AI-Forge is a project developed around TTS generation model
Official MiniMax Model Context Protocol (MCP) server
Cross-platform AI language practice app
Framework for building neural networks