Toolkit for conversational AI
End-to-end speech processing toolkit
Comprehensive Gradio WebUI for audio processing
Use Microsoft Edge's online text-to-speech service from Python
A sound cloning tool with a web interface, using your voice
Generate audiobooks from EPUBs, PDFs and text with captions
Build Vision Agents quickly with any model or video provider
Towards Human-Sounding Speech
Controllable and fast Text-to-Speech for over 7000 languages
A Conversational Speech Generation Model
The open-source virtual assistant for Ubuntu based Linux distributions