Enhances Tesseract OCR output using LLMs (local or API)
State-of-the-art 2D and 3D Face Analysis Project
Run Local LLMs on Any Device. Open-source
Industry leading face manipulation platform
TTS with kokoro and onnx runtime
A simple, high-quality voice conversion tool focused on ease of use
Implementation of TurboQuant (ICLR 2026)
Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge
AI-powered video generation skill for OpenClaw
OCR software, free and offline
AI bridge enabling assistants to control and automate Unity Editor
Qwen3-TTS is an open-source series of TTS models
Open-Sora: Democratizing Efficient Video Production for All
AI tool that removes hardcoded subtitles and text from videos locally
CNCF Sandbox Project
Generate high-definition story short videos with one click using AI
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Synchronized Translation for Videos
From-scratch PyTorch implementation of Google's TurboQuant
Speech-AI-Forge is a project developed around TTS generation model
NBA sports betting using machine learning
AI agent harness for AI coding agents
Machine learning image inpainting task that removes watermarks
DeepSeek Coder: Let the Code Write Itself
AI video generator optimized for low VRAM and older GPUs use