Fast and accurate automatic speech recognition (ASR) for edge devices
OpenCL integration for Python, plus shiny features
A simple, high-quality voice conversion tool focused on ease of use
Telegram Desktop messaging app
Multi-lingual large voice generation model, providing inference
Qwen3-TTS is an open-source series of TTS models
Component library and custom registry built on top of shadcn/ui
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
A high-quality rapid TTS voice cloning model
Open source AI VTuber platform with voice chat and Live2D avatars
Framework for building real-time voice and multimodal AI agents
TEN, a voice agent framework to create conversational AI.
Fast multimodal LLM for real-time voice interaction and AI apps
Framework to interpret and transpile JVM bytecode to JavaScript
Official PyTorch Implementation
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
Large Audio Language Model built for natural interactions
One-click deployment (including offline integration package)
State-of-the-art TTS model under 25MB
Production ready toolkit to run AI locally
Build your own AI friend
ChatOllama is an open-source AI chatbot
Automatic Speech Recognition with Word-level Timestamps
Chat with it via text and voice
Open Source Computer Vision Library