Comprehensive Gradio WebUI for audio processing
Virtual AI anchor that combines state-of-the-art technology
The open-source voice synthesis studio powered by Qwen3-TTS
Toolkit for conversational AI
Video translation and dubbing tool powered by LLMs
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
Offline inference engine for art, real-time voice conversations
Spark-TTS Inference Code
Foundational model for human-like, expressive TTS
Framework for building neural networks
High-quality multi-lingual text-to-speech library by MyShell.ai
Offline desktop app to convert EPUB to MP3 using Kokoro-82M neural TTS
Multi-Voice and Prompt-Controlled TTS Engine
Tool to remotely activate Text-To-Speech (TTS) on a server
A webui for different audio related Neural Networks
The deep learning toolkit for speech-to-text
General Speech Restoration
Deep learning for text to speech
Pre-trained and Reproduced Deep Learning Models
Toolkit for efficient experimentation with Speech Recognition
This project includes basic NLP and DSP techniques for Text-to-Speech