The official Python SDK for the ElevenLabs API
Use Microsoft Edge's online text-to-speech service from Python
EPUB to audiobook converter, optimized for Audiobookshelf
Comprehensive Gradio WebUI for audio processing
State-of-the-art TTS model under 25MB
TTS with kokoro and onnx runtime
Offline Text To Speech synthesis for python
A generative speech model for daily dialogue
Offline inference engine for art, real-time voice conversations
Python library and CLI tool to interface with Google Translate
Synchronized Translation for Videos
A text-to-speech, speech-to-text and speech-to-speech library
SoTA open-source TTS
Generate audiobooks from e-books, voice cloning & 1107+ languages
Industrial-level controllable zero-shot text-to-speech system
Automatically translates the text of a video based on a subtitle file
High-quality multi-lingual text-to-speech library by MyShell.ai
Towards Human-Sounding Speech
A simple, high-quality voice conversion tool focused on ease of use
Instant voice cloning by MIT and MyShell. Audio foundation model
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
Scalable generative AI framework built for researchers and developers
Build Vision Agents quickly with any model or video provider
Official MiniMax Model Context Protocol (MCP) server
A nearly-live implementation of OpenAI's Whisper