Bailing is a voice dialogue robot similar to GPT-4o
Interface for OuteTTS models
Multi-lingual large voice generation model, providing inference
A single Gradio + React WebUI with extensions for ACE-Step
Scalable generative AI framework built for researchers and developers
End-to-end speech processing toolkit
A TTS model capable of generating ultra-realistic dialogue
One-click deployment (including offline integration package)
Provides CTP stock options and Zhongtai Securities XTP
MARS5 speech model (TTS) from CAMB.AI
Framework for building neural networks
SOTA discrete acoustic codec models with 40/75 tokens per second
Virtual AI anchor that combines state-of-the-art technology
High-quality multi-lingual text-to-speech library by MyShell.ai
Towards Human-Level Text-to-Speech through Style Diffusion
Mice speech to text with MX Cinnamon OS ISO
A Conversational Speech Generation Model
Toolkit for audio, music, and speech generation
Offline desktop app to convert EPUB to MP3 using Kokoro-82M neural TTS
VITS2 backbone with multilingual-bert
Multi-Voice and Prompt-Controlled TTS Engine
Open source implementation of Microsoft's VALL-E X zero-shot TTS model
Unofficial Parallel WaveGAN
Chinese text-to-speech engine