Toolkit for conversational AI
Tokenizer-Free TTS for Multilingual Speech Generation
Controllable and fast Text-to-Speech for over 7000 languages
Qwen3-TTS is an open-source series of TTS models
Speech Note Linux app. Note taking, reading and translating
Towards Human-Sounding Speech
Generate audiobooks from EPUBs, PDFs and text with captions
End-to-end speech processing toolkit
High-Quality Voice Cloning TTS for 600+ Languages
Bailing is a voice dialogue robot similar to GPT-4o
Readest is a modern, feature-rich ebook reader
Spark-TTS Inference Code
Long-form streaming TTS system for multi-speaker dialogue generation
Controllable & emotion-expressive zero-shot TTS
Lightning-fast, on-device TTS, running natively via ONNX
Build Vision Agents quickly with any model or video provider
Offline desktop app to convert EPUB to MP3 using Kokoro-82M neural TTS
Towards Human-Level Text-to-Speech through Style Diffusion
Amica is an open source interface for interactive communication
Chinese voice dialogue robot/smart speaker project
Pre-trained and Reproduced Deep Learning Models
The open-source virtual assistant for Ubuntu based Linux distributions
This project includes basic NLP and DSP techniques for Text-to-Speech