Self-host the powerful Chatterbox TTS model
Towards Human-Sounding Speech
Framework for building realtime multimodal voice AI agents apps
The Classical Language Toolkit
Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD
Trained models & code to predict toxic comments
Lightning-fast, on-device TTS, running natively via ONNX
Framework for building real-time voice and multimodal AI agents
Apache OpenNLP
LLM Large Model of Selling Anchor
A suite of advanced multi-modal LLMs
A sound cloning tool with a web interface, using your voice
Open source AI VTuber platform with voice chat and Live2D avatars
The free, Open Source alternative to OpenAI, Claude and others
Controllable and fast Text-to-Speech for over 7000 languages
Instantly generate AI-powered subtitles on your device
Python Audio Analysis Library: Feature Extraction, Classification
The python library for real-time communication
Industrial-strength Natural Language Processing (NLP)
In-App assistant SDK to build a multimodal conversational UX websites
Readest is a modern, feature-rich ebook reader
AI-powered video clipping and highlight generation
Declarative way to run AI models in React Native on device
Large Audio Language Model built for natural interactions
Multi-modal large language model designed for audio understanding