Stable Diffusion WebUI Forge is a platform on top of Stable Diffusion
High-Resolution Image Synthesis with Latent Diffusion Models
Image generation model with single-stream diffusion transformer
RGBD video generation model conditioned on camera input
GLM-Image: Auto-regressive for Dense-knowledge and High-fidelity Image
Diffusion Transformer with Fine-Grained Chinese Understanding
Open-source multi-speaker long-form text-to-speech model
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
Multimodal Diffusion with Representation Alignment
Inference script for Oasis 500M
A PyTorch library for implementing flow matching algorithms
Official inference repo for FLUX.1 models
A Powerful Native Multimodal Model for Image Generation
Advanced language and coding AI model
NVIDIA Isaac GR00T N1.5 is the world's first open foundation model
Official code for Style Aligned Image Generation via Shared Attention
Reference PyTorch implementation and models for DINOv3
Di♪♪Rhythm: Blazingly Fast & Simple End-to-End Song Generation
Access to Anthropic's safety-first language model APIs
Open-weight, large-scale hybrid-attention reasoning model
Let us control diffusion models
Official repo for consistency models
Official PyTorch Implementation of "Scalable Diffusion Models"
Code for reproducing key results in the paper
Open, non-commercial SDXL model for quality image generation