Video translation and dubbing tool powered by LLMs
Run local LLMs like llama, deepseek, kokoro etc. inside your browser
Self-host the powerful Chatterbox TTS model
Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD
The Classical Language Toolkit
Framework for building realtime multimodal voice AI agents apps
Towards Human-Sounding Speech
Trained models & code to predict toxic comments
Framework for building real-time voice and multimodal AI agents
Lightning-fast, on-device TTS, running natively via ONNX
Apache OpenNLP
Textream is a free macOS teleprompter app for streamers, interviewers
A sound cloning tool with a web interface, using your voice
A suite of advanced multi-modal LLMs
Open source AI VTuber platform with voice chat and Live2D avatars
Python Audio Analysis Library: Feature Extraction, Classification
LLM Large Model of Selling Anchor
The python library for real-time communication
The free, Open Source alternative to OpenAI, Claude and others
Instantly generate AI-powered subtitles on your device
Industrial-strength Natural Language Processing (NLP)
Readest is a modern, feature-rich ebook reader
In-App assistant SDK to build a multimodal conversational UX websites
Controllable and fast Text-to-Speech for over 7000 languages
The purpose of the project is to develop audio processing algorithms