A text-to-speech, speech-to-text and speech-to-speech library
The open-source voice synthesis studio powered by Qwen3-TTS
Code for openai.fm, a demo for the OpenAI Speech API
Generate audiobooks from EPUBs, PDFs and text with captions
Generate audiobooks from e-books, voice cloning & 1107+ languages
A nearly-live implementation of OpenAI's Whisper
Video translation and dubbing tool powered by LLMs
The python library for real-time communication
Synchronized Translation for Videos
SOTA discrete acoustic codec models with 40/75 tokens per second
Comprehensive Gradio WebUI for audio processing
SOTA Open Source TTS
Instant voice cloning by MIT and MyShell. Audio foundation model
The official Python SDK for the ElevenLabs API
Open source text-to-speech tool, supports extra-long text
Free, high-quality text-to-speech API endpoint to replace OpenAI
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
Offline Text To Speech synthesis for python
Interface for OuteTTS models
Workflow and speech recognition app
Converts text to speech in realtime
Tokenizer-Free TTS for Multilingual Speech Generation
A single Gradio + React WebUI with extensions for ACE-Step
One-click deployment (including offline integration package)
Use Microsoft Edge's online text-to-speech service from Python