High-Resolution Image Synthesis with Latent Diffusion Models
Fast-stable-diffusion + DreamBooth
Easy Docker setup for Stable Diffusion with user-friendly UI
Stable Diffusion WebUI Forge is a platform on top of Stable Diffusion
Learning agent trained in a diffusion world model
Multi-user UI for managing and running Stable Diffusion workflows tool
A general fine-tuning kit geared toward image/video/audio diffusion
Open platform for sharing and discovering Stable Diffusion models
Stable diffusion for real-time music generation (web app)
dLLM: Simple Diffusion Language Modeling
100–200× Acceleration for Video Diffusion Models
Block Diffusion for Ultra-Fast Speculative Decoding
Image generation model with single-stream diffusion transformer
RGBD video generation model conditioned on camera input
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
A playground to generate images from any text prompt using SD
Autoregressive Model Beats Diffusion
Diffusion Transformer with Fine-Grained Chinese Understanding
Open-source multi-speaker long-form text-to-speech model
GLM-Image: Auto-regressive for Dense-knowledge and High-fidelity Image
Collection of CVPR 2026 Papers and Open Source Projects
Multimodal Diffusion with Representation Alignment
PyTorch implementation of JiT
Cosmos-RL is a flexible and scalable Reinforcement Learning framework
Official Python inference and LoRA trainer package