EPUB to audiobook converter, optimized for Audiobookshelf
TTS with kokoro and onnx runtime
A simple, high-quality voice conversion tool focused on ease of use
Offline inference engine for art, real-time voice conversations
Scalable generative AI framework built for researchers and developers
Official MiniMax Model Context Protocol (MCP) server
A fast TTS architecture with conditional flow matching
A nearly-live implementation of OpenAI's Whisper
Interface for OuteTTS models
Framework for building neural networks
Controllable and fast Text-to-Speech for over 7000 languages
Toolkit for audio, music, and speech generation
Towards Human-Level Text-to-Speech through Style Diffusion
VITS2 backbone with multilingual-bert
A webui for different audio related Neural Networks
Generative Adversarial Networks for Efficient and High Fidelity Speech
Toolkit for efficient experimentation with Speech Recognition