stable-diffusion-v1-5 download

Stable Diffusion v1-5 is a latent text-to-image diffusion model capable of producing high-quality, photo-realistic images from natural language prompts. It builds upon the v1.2 checkpoint and was fine-tuned with 595,000 additional steps at 512x512 resolution on the “laion-aesthetics v2 5+” dataset. This model improves generation fidelity through classifier-free guidance sampling, including 10% prompt dropout during training. It leverages a CLIP ViT-L/14 text encoder and a UNet-based diffusion architecture operating in latent space to enable fast and efficient image synthesis. Stable Diffusion v1-5 is compatible with Diffusers, ComfyUI, AUTOMATIC1111, and other user interfaces. Its intended use is for research and creative applications such as digital art, design, and exploration of generative models. While powerful, it has known limitations with photorealism, compositionality, and cultural representation, and requires responsible usage under the CreativeML OpenRAIL-M license.

Features

Generates images from natural language prompts using latent diffusion
Fine-tuned on 595k steps for improved aesthetic quality and prompt alignment
Uses CLIP ViT-L/14 for text encoding and UNet for image generation
Supports classifier-free guidance for more accurate outputs
Resolution optimized at 512x512 for high-quality results
Compatible with Diffusers, ComfyUI, AUTOMATIC1111, SD.Next, and InvokeAI
Licensed under CreativeML OpenRAIL-M for responsible open use
Trained on laion-aesthetics v2 5+, optimized for visual appeal

Project Samples

Project Activity

See All Activity >

Follow stable-diffusion-v1-5

stable-diffusion-v1-5 Web Site

Other Useful Business Software

Gen AI apps are built with MongoDB Atlas

The database for AI-powered applications.

MongoDB Atlas is the developer-friendly database used to build, scale, and run gen AI and LLM-powered apps—without needing a separate vector database. Atlas offers built-in vector search, global availability across 115+ regions, and flexible document modeling. Start building AI apps faster, all in one place.

Start Free

Rate This Project

User Reviews

Be the first to post a review of stable-diffusion-v1-5!

Additional Project Details

Registered

2025-07-02

Similar Business Software

Imagen 3

Imagen 3 is the next evolution of Google's cutting-edge text-to-image AI generation technology. Building on the strengths of its predecessors, Imagen 3 offers significant advancements in image fidelity, resolution, and semantic alignment with user prompts. By employing enhanced diffusion models...

See Software
FLUX.1 Krea

FLUX.1 Krea is an open source, guidance-distilled 12 billion-parameter diffusion transformer released by Krea in collaboration with Black Forest Labs, engineered to deliver superior aesthetic control and photorealism while eschewing the generic “AI look.” Fully compatible with the FLUX.1-dev...

See Software
FLUX.1

FLUX.1 is a groundbreaking suite of open-source text-to-image models developed by Black Forest Labs, setting new benchmarks in AI-generated imagery with its 12 billion parameters. It surpasses established models like Midjourney V6, DALL-E 3, and Stable Diffusion 3 Ultra by offering superior...

See Software
Qwen-Image

Qwen-Image is a multimodal diffusion transformer (MMDiT) foundation model offering state-of-the-art image generation, text rendering, editing, and understanding. It excels at complex text integration, seamlessly embedding alphabetic and logographic scripts into visuals with typographic fidelity,...

See Software
Janus-Pro-7B

Janus-Pro-7B is an innovative open-source multimodal AI model from DeepSeek, designed to excel in both understanding and generating content across text, images, and videos. It leverages a unique autoregressive architecture with separate pathways for visual encoding, enabling high performance in...

See Software
Gemini Diffusion

Gemini Diffusion is our state-of-the-art research model exploring what diffusion means for language and text generation. Large-language models are the foundation of generative AI today. We’re using a technique called diffusion to explore a new kind of language model that gives users greater...

See Software