State-of-the-art 2D and 3D Face Analysis Project
Industry leading face manipulation platform
Focus on prompting and generating
Stable Diffusion web UI
Official Python inference and LoRA trainer package
Run Local LLMs on Any Device. Open-source
The most powerful and modular diffusion model GUI, api and backend
Personal AI, On Personal Devices
3D reconstruction software
Public repository for Agent Skills
Image inpainting tool powered by SOTA AI Model
Deep Research framework, combining language models with tools
A simple, high-quality voice conversion tool focused on ease of use
OCRmyPDF adds an OCR text layer to scanned PDF files
The agent that grows with you
TTS with kokoro and onnx runtime
AI tool that removes hardcoded subtitles and text from videos locally
Powerful AI language model (MoE) optimized for efficiency/performance
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Fast and memory-efficient exact attention
Awesome multilingual OCR toolkits based on PaddlePaddle
The most powerful local music generation model
Agentic, Reasoning, and Coding (ARC) foundation models
Open-source, high-performance AI model with advanced reasoning
1 min voice data can also be used to train a good TTS model