Curated collection of Amazing Python scripts
In-App assistant SDK to build a multimodal conversational UX websites
Large Audio Language Model built for natural interactions
Multi-lingual large voice generation model, providing inference
Long-form streaming TTS system for multi-speaker dialogue generation
A simple, high-quality voice conversion tool focused on ease of use
Open speech-to-speech models and pipelines by Hugging Face toolkit AI
Realtime AI Voice Agents with SoTA Multimodal AI models on Arduino ESP
A text-to-speech, speech-to-text and speech-to-speech library
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
TTS with kokoro and onnx runtime
Production ready toolkit to run AI locally
Conversational voice AI agents
Real-time voice interactive digital human
Adds support for Yandex Smart Home (Alice voice assistant)
A robust, efficient, low-latency speech-to-text library
A simple native web interface that uses ChatTTS to synthesize text
Focus on prompting and generating
Open Source Speech Language Model
AI tool for automatic batch short video creation and editing
Component library and custom registry built on top of shadcn/ui
Build your own AI friend
A lightweight text-to-speech model with zero-shot voice cloning
Generate audiobooks from e-books, voice cloning & 1107+ languages
On-device Speech-to-Intent engine powered by deep learning