Comprehensive Gradio WebUI for audio processing
Virtual AI anchor that combines state-of-the-art technology
Toolkit for conversational AI
Offline inference engine for art, real-time voice conversations
Foundational model for human-like, expressive TTS
Spark-TTS Inference Code
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
Framework for building neural networks
High-quality multi-lingual text-to-speech library by MyShell.ai
Offline desktop app to convert EPUB to MP3 using Kokoro-82M neural TTS
Multi-Voice and Prompt-Controlled TTS Engine
A webui for different audio related Neural Networks
General Speech Restoration
Pre-trained and Reproduced Deep Learning Models
The open-source virtual assistant for Ubuntu based Linux distributions
Toolkit for efficient experimentation with Speech Recognition