Toolkit for conversational AI
Speech Note Linux app. Note taking, reading and translating
Comprehensive Gradio WebUI for audio processing
End-to-end speech processing toolkit
Readest is a modern, feature-rich ebook reader
Generate audiobooks from EPUBs, PDFs and text with captions
Use Microsoft Edge's online text-to-speech service from Python
Video translation and dubbing tool powered by LLMs
A sound cloning tool with a web interface, using your voice
Lightning-fast, on-device TTS, running natively via ONNX
Towards Human-Sounding Speech
Build Vision Agents quickly with any model or video provider
Controllable and fast Text-to-Speech for over 7000 languages
The python library for real-time communication
Offline desktop app to convert EPUB to MP3 using Kokoro-82M neural TTS
A Conversational Speech Generation Model
Chinese text-to-speech engine
Text-to-Speech for Basque and Spanish
The open-source virtual assistant for Ubuntu based Linux distributions
PHP SDK for processing phone calls and SMS through the VoiceShot API.
.NET SDK for processing phone calls and SMS through the VoiceShot API.
ASP SDK for processing phone calls and SMS through the VoiceShot API.
Text-to-Speech TTS for Basque, Spanish, Catalan, Galician and English
Process large speech data wrt transcription, labeling and annotation
This project includes basic NLP and DSP techniques for Text-to-Speech