MCP server enabling AI agents to control and automate Windows OS
Awesome multilingual OCR toolkits based on PaddlePaddle
gpt-4o for windows, macos and linux
GUI for a Vocal Remover that uses Deep Neural Networks
Real time face swap and one-click video deepfake
Focus on prompting and generating
State-of-the-art 2D and 3D Face Analysis Project
Industry leading face manipulation platform
Personal AI, On Personal Devices
TTS with kokoro and onnx runtime
A Simple and Universal Swarm Intelligence Engine
Stable Diffusion web UI
The most powerful and modular diffusion model GUI, api and backend
Run Local LLMs on Any Device. Open-source
An enhanced tool for CodexApp, striving to make Codex better to use
Oobabooga - The definitive Web UI for local AI, with powerful features
A simple, high-quality voice conversion tool focused on ease of use
Agentic, Reasoning, and Coding (ARC) foundation models
World's first open-source, agentic video production system
Unified web UI for training and running open models locally
AI tool that removes hardcoded subtitles and text from videos locally
Public repository for Agent Skills
OCRmyPDF adds an OCR text layer to scanned PDF files
AI video generator optimized for low VRAM and older GPUs use
Official Python inference and LoRA trainer package