A simple, high-quality voice conversion tool focused on ease of use
Generate audiobooks from e-books, voice cloning & 1107+ languages
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
Speech-AI-Forge is a project developed around TTS generation model
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model
Build Vision Agents quickly with any model or video provider
A fast TTS architecture with conditional flow matching
Interface for OuteTTS models
Virtual AI anchor that combines state-of-the-art technology
Towards Human-Level Text-to-Speech through Style Diffusion
Multi-Voice and Prompt-Controlled TTS Engine
Chinese voice dialogue robot/smart speaker project
A webui for different audio related Neural Networks
WaveRNN Vocoder + TTS
Clone a voice in 5 seconds to generate arbitrary speech in real-time
General Speech Restoration
Real-Time State-of-the-art Speech Synthesis for Tensorflow 2
Implementation of a Transformer based neural network
Toolkit for efficient experimentation with Speech Recognition