Stable Diffusion WebUI Forge is a platform on top of Stable Diffusion
High-Resolution Image Synthesis with Latent Diffusion Models
Extension index for stable-diffusion-webui
Release for Improved Denoising Diffusion Probabilistic Models
Stable Virtual Camera: Generative View Synthesis with Diffusion Models
Image generation model with single-stream diffusion transformer
Lets make video diffusion practical
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
RGBD video generation model conditioned on camera input
GLM-Image: Auto-regressive for Dense-knowledge and High-fidelity Image
Diffusion Transformer with Fine-Grained Chinese Understanding
Wan2.1: Open and Advanced Large-Scale Video Generative Model
HY-Motion model for 3D character animation generation
Open-source multi-speaker long-form text-to-speech model
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
Multimodal Diffusion with Representation Alignment
Personalize Any Characters with a Scalable Diffusion Transformer
Inference script for Oasis 500M
Generating Immersive, Explorable, and Interactive 3D Worlds
A PyTorch library for implementing flow matching algorithms
Official inference repo for FLUX.1 models
A Powerful Native Multimodal Model for Image Generation
Advanced language and coding AI model
NVIDIA Isaac GR00T N1.5 is the world's first open foundation model
Qwen3-omni is a natively end-to-end, omni-modal LLM