Toolkit for conversational AI
Tokenizer-Free TTS for Multilingual Speech Generation
Controllable and fast Text-to-Speech for over 7000 languages
Qwen3-TTS is an open-source series of TTS models
Generate audiobooks from EPUBs, PDFs and text with captions
Towards Human-Sounding Speech
High-Quality Voice Cloning TTS for 600+ Languages
Bailing is a voice dialogue robot similar to GPT-4o
End-to-end speech processing toolkit
Spark-TTS Inference Code
Long-form streaming TTS system for multi-speaker dialogue generation
Controllable & emotion-expressive zero-shot TTS
Build Vision Agents quickly with any model or video provider
Offline desktop app to convert EPUB to MP3 using Kokoro-82M neural TTS
Towards Human-Level Text-to-Speech through Style Diffusion
Best practice TTS based on BERT and VITS
Chinese voice dialogue robot/smart speaker project
Pre-trained and Reproduced Deep Learning Models
The open-source virtual assistant for Ubuntu based Linux distributions