Comprehensive Gradio WebUI for audio processing
Toolkit for conversational AI
Generate audiobooks from EPUBs, PDFs and text with captions
End-to-end speech processing toolkit
Use Microsoft Edge's online text-to-speech service from Python
Controllable and fast Text-to-Speech for over 7000 languages
Towards Human-Sounding Speech
A sound cloning tool with a web interface, using your voice
Build Vision Agents quickly with any model or video provider
Offline desktop app to convert EPUB to MP3 using Kokoro-82M neural TTS
A Conversational Speech Generation Model
The open-source virtual assistant for Ubuntu based Linux distributions