Toolkit for conversational AI
High-quality multi-lingual text-to-speech library by MyShell.ai
A fast, local neural text to speech system
State-of-the-art TTS model under 25MB
A generative speech model for daily dialogue
Comprehensive Gradio WebUI for audio processing
Speech to Text to Speech, sends text as OSC messages
Instant voice cloning by MIT and MyShell. Audio foundation model
Industrial-level controllable zero-shot text-to-speech system
Generate audiobooks from EPUBs, PDFs and text with captions
Synchronized Translation for Videos
Readest is a modern, feature-rich ebook reader
A simple, high-quality voice conversion tool focused on ease of use
Offline Text To Speech synthesis for python
A cross-platform software for text translation and recognition
Toolkit for audio, music, and speech generation
Multi-lingual large voice generation model, providing inference
Cross-platform AI language practice app
Controllable and fast Text-to-Speech for over 7000 languages
One-click deployment (including offline integration package)
Python library and CLI tool to interface with Google Translate
A single Gradio + React WebUI with extensions for ACE-Step
The official Python SDK for the ElevenLabs API
A text-to-speech, speech-to-text and speech-to-speech library
A nearly-live implementation of OpenAI's Whisper