Toolkit for conversational AI
End-to-end speech processing toolkit
Comprehensive Gradio WebUI for audio processing
Readest is a modern, feature-rich ebook reader
Speech Note Linux app. Note taking, reading and translating
Use Microsoft Edge's online text-to-speech service from Python
Build Vision Agents quickly with any model or video provider
Generate audiobooks from EPUBs, PDFs and text with captions
A sound cloning tool with a web interface, using your voice
Video translation and dubbing tool powered by LLMs
Towards Human-Sounding Speech
Lightning-fast, on-device TTS, running natively via ONNX
Controllable and fast Text-to-Speech for over 7000 languages
The python library for real-time communication
A Conversational Speech Generation Model
Chinese text-to-speech engine
Text-to-Speech for Basque and Spanish
The open-source virtual assistant for Ubuntu based Linux distributions
PHP SDK for processing phone calls and SMS through the VoiceShot API.
.NET SDK for processing phone calls and SMS through the VoiceShot API.
ASP SDK for processing phone calls and SMS through the VoiceShot API.
Text-to-Speech TTS for Basque, Spanish, Catalan, Galician and English
Process large speech data wrt transcription, labeling and annotation
This project includes basic NLP and DSP techniques for Text-to-Speech