Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
Multi-lingual large voice generation model, providing inference
Curated collection of Amazing Python scripts
In-App assistant SDK to build a multimodal conversational UX websites
Large Audio Language Model built for natural interactions
Long-form streaming TTS system for multi-speaker dialogue generation
A simple, high-quality voice conversion tool focused on ease of use
Realtime AI Voice Agents with SoTA Multimodal AI models on Arduino ESP
Open speech-to-speech models and pipelines by Hugging Face toolkit AI
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
TTS with kokoro and onnx runtime
A text-to-speech, speech-to-text and speech-to-speech library
Conversational voice AI agents
Production ready toolkit to run AI locally
Assistant SDK to build a multimodal conversational UX for Android
Real-time voice interactive digital human
In-App assistant SDK to build a multimodal conversational UX for iOS
A simple native web interface that uses ChatTTS to synthesize text
Adds support for Yandex Smart Home (Alice voice assistant)
A robust, efficient, low-latency speech-to-text library
Open Source Speech Language Model
Focus on prompting and generating
Component library and custom registry built on top of shadcn/ui
Build your own AI friend
Generate audiobooks from e-books, voice cloning & 1107+ languages