Scalable generative AI framework built for researchers and developers
Toolkit for conversational AI
A generative speech model for daily dialogue
Toolkit for audio, music, and speech generation
Towards Human-Sounding Speech
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
Controllable & emotion-expressive zero-shot TTS
A fast TTS architecture with conditional flow matching
Official MiniMax Model Context Protocol (MCP) server
A webui for different audio related Neural Networks
Generative Adversarial Networks for Efficient and High Fidelity Speech