Comprehensive Gradio WebUI for audio processing
The open-source voice synthesis studio powered by Qwen3-TTS
Toolkit for conversational AI
Spark-TTS Inference Code
Video translation and dubbing tool powered by LLMs
Offline inference engine for art, real-time voice conversations
Foundational model for human-like, expressive TTS
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
Framework for building neural networks
Virtual AI anchor that combines state-of-the-art technology
High-quality multi-lingual text-to-speech library by MyShell.ai
Multi-Voice and Prompt-Controlled TTS Engine
A webui for different audio related Neural Networks
The deep learning toolkit for speech-to-text
General Speech Restoration
Deep learning for text to speech
Pre-trained and Reproduced Deep Learning Models
Toolkit for efficient experimentation with Speech Recognition
This project includes basic NLP and DSP techniques for Text-to-Speech