Stable Diffusion web UI
A Lightweight Face Recognition and Facial Attribute Analysis
Industry leading face manipulation platform
The most powerful and modular diffusion model GUI, api and backend
Run Local LLMs on Any Device. Open-source
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Open-source, high-performance AI model with advanced reasoning
Deep Research framework, combining language models with tools
Video-based AI memory library. Store millions of text chunks in MP4
Powerful AI language model (MoE) optimized for efficiency/performance
TTS with kokoro and onnx runtime
Image inpainting tool powered by SOTA AI Model
OCRmyPDF adds an OCR text layer to scanned PDF files
3D reconstruction software
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Implementation of TurboQuant (ICLR 2026)
A simple, high-quality voice conversion tool focused on ease of use
Robust Speech Recognition via Large-Scale Weak Supervision
YOLOv5 is the world's most loved vision AI
OCR software, free and offline
The most powerful local music generation model
Autonomous research from idea to paper. Chat an Idea. Get a Paper 🦞
1 min voice data can also be used to train a good TTS model
Advanced language and coding AI model
Python inference and LoRA trainer package for the LTX-2 audio–video