ControlNet

ControlNet is a neural network architecture that enhances Stable Diffusion by enabling image generation conditioned on specific visual structures such as edges, poses, depth maps, and segmentation masks. By injecting these auxiliary inputs into the diffusion process, ControlNet gives users powerful control over the layout and composition of generated images while preserving the style and flexibility of generative models. It supports a wide range of conditioning types through pretrained modules, including Canny edges, HED (soft edges), Midas depth, OpenPose skeletons, normal maps, MLSD lines, scribbles, and ADE20k-based semantic segmentation. The system includes both ControlNet+SD1.5 model weights and compatible third-party detectors like Midas and OpenPose to extract input features. Each conditioning type is matched with a specific .pth model file to be used alongside Stable Diffusion for fine-grained control.

Features

Extends Stable Diffusion with image structure-based control
Supports conditioning via edges, depth, pose, scribbles, and more
Includes pretrained weights for multiple input types (e.g., Canny, Midas, OpenPose)
Allows precise manipulation of image composition and layout
Compatible with AUTOMATIC1111 Web UI and Hugging Face demos
Enables sketch-to-image, pose-to-image, and segmentation-guided generation
Includes training resources and detection models for setup
Released under OpenRAIL license to guide ethical use

Project Samples

Project Activity

See All Activity >

Follow ControlNet

ControlNet Web Site

Other Useful Business Software

Enterprise-grade ITSM, for every business

Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.

Try it Free

Rate This Project

User Reviews

Be the first to post a review of ControlNet!

Additional Project Details

Programming Language

Python

Related Categories

Python AI Image Generators, Python AI Models

Registered

2025-06-27

Similar Business Software

Qwen-Image

Qwen-Image is a multimodal diffusion transformer (MMDiT) foundation model offering state-of-the-art image generation, text rendering, editing, and understanding. It excels at complex text integration, seamlessly embedding alphabetic and logographic scripts into visuals with typographic fidelity,...

See Software
HunyuanVideo-Avatar

HunyuanVideo‑Avatar supports animating any input avatar images to high‑dynamic, emotion‑controllable videos using simple audio conditions. It is a multimodal diffusion transformer (MM‑DiT)‑based model capable of generating dynamic, emotion‑controllable, multi‑character dialogue videos. It...

See Software
AISixteen

The ability to convert text into images using artificial intelligence has gained significant attention in recent years. Stable diffusion is one effective method for achieving this task, utilizing the power of deep neural networks to generate images from textual descriptions. The first step is to...

See Software