MCP server enabling AI agents to control and automate Windows OS
GUI for a Vocal Remover that uses Deep Neural Networks
Real time face swap and one-click video deepfake
State-of-the-art 2D and 3D Face Analysis Project
Industry leading face manipulation platform
Stable Diffusion web UI
Comprehensive Gradio WebUI for audio processing
A Simple and Universal Swarm Intelligence Engine
Focus on prompting and generating
Personal AI, On Personal Devices
The most powerful and modular diffusion model GUI, api and backend
World's first open-source, agentic video production system
Run Local LLMs on Any Device. Open-source
Deep Research framework, combining language models with tools
Official Python inference and LoRA trainer package
The agent that grows with you
Public repository for Agent Skills
The most powerful local music generation model
3D reconstruction software
A simple, high-quality voice conversion tool focused on ease of use
OCRmyPDF adds an OCR text layer to scanned PDF files
TTS with kokoro and onnx runtime
Open-source, high-performance AI model with advanced reasoning
Powerful AI language model (MoE) optimized for efficiency/performance
Wan2.2: Open and Advanced Large-Scale Video Generative Model