Generate audiobooks from EPUBs, PDFs and text with captions
Comprehensive Gradio WebUI for audio processing
A sound cloning tool with a web interface, using your voice
Use Microsoft Edge's online text-to-speech service from Python
Towards Human-Sounding Speech
Build Vision Agents quickly with any model or video provider
Controllable and fast Text-to-Speech for over 7000 languages
A Conversational Speech Generation Model