Fast and accurate automatic speech recognition (ASR) for edge devices
OpenCL integration for Python, plus shiny features
A simple, high-quality voice conversion tool focused on ease of use
Telegram Desktop messaging app
Multi-lingual large voice generation model, providing inference
Qwen3-TTS is an open-source series of TTS models
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
Component library and custom registry built on top of shadcn/ui
Open source AI VTuber platform with voice chat and Live2D avatars
Framework to interpret and transpile JVM bytecode to JavaScript
Framework for building real-time voice and multimodal AI agents
A high-quality rapid TTS voice cloning model
TEN, a voice agent framework to create conversational AI.
Fast multimodal LLM for real-time voice interaction and AI apps
Official PyTorch Implementation
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
Large Audio Language Model built for natural interactions
One-click deployment (including offline integration package)
State-of-the-art TTS model under 25MB
Build your own AI friend
Production ready toolkit to run AI locally
Open Source Computer Vision Library
Minimal plugin that lets Claude Code call you on the phone
Automatic Speech Recognition with Word-level Timestamps
ChatOllama is an open-source AI chatbot