Comprehensive Gradio WebUI for audio processing
OCR software, free and offline
Offline Text To Speech synthesis for python
StreamSpeech is a seamless model for offline speech recognition
Offline inference engine for art, real-time voice conversations
Video-based AI memory library. Store millions of text chunks in MP4
A TTS that fits in your CPU (and pocket)
Speech recognition module for Python
AI tool that removes hardcoded subtitles and text from videos locally
One-click deployment (including offline integration package)
Qwen3-omni is a natively end-to-end, omni-modal LLM
Implementation of "MobileCLIP" CVPR 2024
Powerful Android AI agent with tools, automation, and Linux shell
Voice Recognition to Text Tool
Open source AI VTuber platform with voice chat and Live2D avatars
A lightweight text-to-speech model with zero-shot voice cloning
Chat with it via text and voice
A sound cloning tool with a web interface, using your voice
Algorithms for outlier, adversarial and drift detection
Translate English to Bangla using CSV file format and range wise.
OpenRecall is a fully open-source, privacy-first alternative
Text to Speech Utility
Unlimited, private and free Speech-To-Text program
Offline desktop app to convert EPUB to MP3 using Kokoro-82M neural TTS
An OCR translator tool made by utilizing tesseract & python-opencv