Qwen-Image is a powerful image generation foundation model
Image processing in Python
Qwen-Image-Layered: Layered Decomposition for Inherent Editablity
The largest collection of PyTorch image encoders / backbones
Image polygonal annotation with Python
Real time face swap and one-click video deepfake
Focus on prompting and generating
Stable Diffusion web UI
Web interface for generating images using Stable Diffusion models
Official inference repo for FLUX.2 models
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
OCRmyPDF adds an OCR text layer to scanned PDF files
A Powerful Native Multimodal Model for Image Generation
Image inpainting tool powered by SOTA AI Model
Models for object and human mesh reconstruction
Official DeiT repository
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Stable Diffusion built-in to Blender
The most powerful and modular diffusion model GUI, api and backend
Label Studio is a multi-type data labeling and annotation tool
Multimodal-Driven Architecture for Customized Video Generation
InvokeAI is a leading creative engine for Stable Diffusion models
Chat & pretrained large vision language model
text and image to video generation: CogVideoX (2024) and CogVideo
Ready-to-use OCR with 80+ supported languages