Comprehensive Gradio WebUI for audio processing
Toolkit for conversational AI
Readest is a modern, feature-rich ebook reader
Generate audiobooks from EPUBs, PDFs and text with captions
End-to-end speech processing toolkit
Use Microsoft Edge's online text-to-speech service from Python
Lightning-fast, on-device TTS, running natively via ONNX
Video translation and dubbing tool powered by LLMs
Controllable and fast Text-to-Speech for over 7000 languages
The python library for real-time communication
Towards Human-Sounding Speech
A sound cloning tool with a web interface, using your voice
Build Vision Agents quickly with any model or video provider
A Conversational Speech Generation Model
Chinese text-to-speech engine
Text-to-Speech for Basque and Spanish
PHP SDK for processing phone calls and SMS through the VoiceShot API.
.NET SDK for processing phone calls and SMS through the VoiceShot API.
ASP SDK for processing phone calls and SMS through the VoiceShot API.
Text-to-Speech TTS for Basque, Spanish, Catalan, Galician and English
Process large speech data wrt transcription, labeling and annotation
This project includes basic NLP and DSP techniques for Text-to-Speech