Comprehensive Gradio WebUI for audio processing
OCR software, free and offline
StreamSpeech is a seamless model for offline speech recognition
Offline Text To Speech synthesis for python
Offline inference engine for art, real-time voice conversations
Video-based AI memory library. Store millions of text chunks in MP4
A TTS that fits in your CPU (and pocket)
Speech recognition module for Python
AI tool that removes hardcoded subtitles and text from videos locally
One-click deployment (including offline integration package)
Qwen3-omni is a natively end-to-end, omni-modal LLM
Implementation of "MobileCLIP" CVPR 2024
Powerful Android AI agent with tools, automation, and Linux shell
Python & command-line tool to gather text on the Web
Voice Recognition to Text Tool
A lightweight text-to-speech model with zero-shot voice cloning
Open source AI VTuber platform with voice chat and Live2D avatars
Chat with it via text and voice
A sound cloning tool with a web interface, using your voice
Algorithms for outlier, adversarial and drift detection
Chinese version of Google open source project style guide
OpenRecall is a fully open-source, privacy-first alternative
Translate English to Bangla using CSV file format and range wise.
Text to Speech Utility
Offline clipboard manager for Windows with history, search, and locked