A sound cloning tool with a web interface, using your voice
A simple native web interface that uses ChatTTS to synthesize text
Towards Human-Sounding Speech
End-to-end speech processing toolkit
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
A TTS model capable of generating ultra-realistic dialogue
Converts text to speech in realtime
Official MiniMax Model Context Protocol (MCP) server
One-click deployment (including offline integration package)
Virtual AI anchor that combines state-of-the-art technology
Real-time voice interactive digital human
Generate audiobooks from e-books
Bailing is a voice dialogue robot similar to GPT-4o
SOTA discrete acoustic codec models with 40/75 tokens per second
Free, high-quality text-to-speech API endpoint to replace OpenAI
Automatically translates the text of a video based on a subtitle file
Towards Human-Level Text-to-Speech through Style Diffusion
Mice speech to text with MX Cinnamon OS ISO
VITS2 backbone with multilingual-bert
Multi-Voice and Prompt-Controlled TTS Engine
Best practice TTS based on BERT and VITS
A webui for different audio related Neural Networks
Chinese voice dialogue robot/smart speaker project
Singing Voice Synthesis via Shallow Diffusion Mechanism