EPUB to audiobook converter, optimized for Audiobookshelf
A simple, high-quality voice conversion tool focused on ease of use
TTS with kokoro and onnx runtime
Offline inference engine for art, real-time voice conversations
A nearly-live implementation of OpenAI's Whisper
Open source text-to-speech tool, supports extra-long text
Official MiniMax Model Context Protocol (MCP) server
Scalable generative AI framework built for researchers and developers
Controllable and fast Text-to-Speech for over 7000 languages
A single Gradio + React WebUI with extensions for ACE-Step
Framework for building neural networks
A fast TTS architecture with conditional flow matching
Interface for OuteTTS models
Toolkit for audio, music, and speech generation
Towards Human-Level Text-to-Speech through Style Diffusion
VITS2 backbone with multilingual-bert
Chinese text-to-speech engine
A webui for different audio related Neural Networks
Tool that can record speech synthesis
Free and open source text-to-speech software
Generative Adversarial Networks for Efficient and High Fidelity Speech
Toolkit for efficient experimentation with Speech Recognition