Open-source framework for intelligent speech interaction
Spark-TTS Inference Code
Curated collection of Amazing Python scripts
In-App assistant SDK to build a multimodal conversational UX websites
Large Audio Language Model built for natural interactions
Multi-lingual large voice generation model, providing inference
Long-form streaming TTS system for multi-speaker dialogue generation
Open speech-to-speech models and pipelines by Hugging Face toolkit AI
A simple, high-quality voice conversion tool focused on ease of use
Realtime AI Voice Agents with SoTA Multimodal AI models on Arduino ESP
A text-to-speech, speech-to-text and speech-to-speech library
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
TTS with kokoro and onnx runtime
Production ready toolkit to run AI locally
Conversational voice AI agents
Assistant SDK to build a multimodal conversational UX for Android
Real-time voice interactive digital human
In-App assistant SDK to build a multimodal conversational UX for iOS
Adds support for Yandex Smart Home (Alice voice assistant)
A simple native web interface that uses ChatTTS to synthesize text
A robust, efficient, low-latency speech-to-text library
Focus on prompting and generating
Open Source Speech Language Model
AI tool for automatic batch short video creation and editing
Build your own AI friend