A single Gradio + React WebUI with extensions for ACE-Step
Offline Text To Speech synthesis for python
Offline inference engine for art, real-time voice conversations
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
Industrial-level controllable zero-shot text-to-speech system
Scalable generative AI framework built for researchers and developers
Generate audiobooks from EPUBs, PDFs and text with captions
Official MiniMax Model Context Protocol (MCP) server
Provides CTP stock options and Zhongtai Securities XTP
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
VITS2 backbone with multilingual-bert
The python library for real-time communication
Free, high-quality text-to-speech API endpoint to replace OpenAI
Reading book source
Controllable and fast Text-to-Speech for over 7000 languages
Converts text to speech in realtime
A fast TTS architecture with conditional flow matching
Framework for building neural networks
End-to-end speech processing toolkit
Multi-Voice and Prompt-Controlled TTS Engine
A Conversational Speech Generation Model
A webui for different audio related Neural Networks
WaveRNN Vocoder + TTS
Clone a voice in 5 seconds to generate arbitrary speech in real-time
General Speech Restoration