Image generation model with single-stream diffusion transformer
Qwen-Image is a powerful image generation foundation model
GLM-Image: Auto-regressive for Dense-knowledge and High-fidelity Image
Deep Learning-based Image Fusion: A Survey
Image processing in Python
Qwen-Image-Layered: Layered Decomposition for Inherent Editablity
The largest collection of PyTorch image encoders / backbones
Image polygonal annotation with Python
A high-performance image compression microservice based on MCP
Real time face swap and one-click video deepfake
Open Source OCR Engine
Focus on prompting and generating
Stable Diffusion web UI
Resources for deep learning with satellite & aerial imagery
Web interface for generating images using Stable Diffusion models
Free and Open Source AI Image Upscaler for Linux, MacOS and Windows
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
Official inference repo for FLUX.2 models
A pure Javascript Multilingual OCR
A Powerful Native Multimodal Model for Image Generation
OCRmyPDF adds an OCR text layer to scanned PDF files
Image inpainting tool powered by SOTA AI Model
Easily turn large sets of image urls to an image dataset
Official DeiT repository
Stable Diffusion built-in to Blender