Toolkit for conversational AI
The official Python SDK for the ElevenLabs API
A generative speech model for daily dialogue
High-quality multi-lingual text-to-speech library by MyShell.ai
State-of-the-art TTS model under 25MB
SoTA open-source TTS
Comprehensive Gradio WebUI for audio processing
Offline Text To Speech synthesis for python
Use Microsoft Edge's online text-to-speech service from Python
Instant voice cloning by MIT and MyShell. Audio foundation model
TTS with kokoro and onnx runtime
Python library and CLI tool to interface with Google Translate
EPUB to audiobook converter, optimized for Audiobookshelf
Multi-Voice and Prompt-Controlled TTS Engine
Industrial-level controllable zero-shot text-to-speech system
A text-to-speech, speech-to-text and speech-to-speech library
Generate audiobooks from EPUBs, PDFs and text with captions
A nearly-live implementation of OpenAI's Whisper
Offline inference engine for art, real-time voice conversations
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
Generate audiobooks from e-books, voice cloning & 1107+ languages
A simple, high-quality voice conversion tool focused on ease of use
Synchronized Translation for Videos
Multi-lingual large voice generation model, providing inference
Scalable generative AI framework built for researchers and developers