Fast-stable-diffusion + DreamBooth
High-Resolution Image Synthesis with Latent Diffusion Models
Easy Docker setup for Stable Diffusion with user-friendly UI
Stable Diffusion WebUI Forge is a platform on top of Stable Diffusion
Block Diffusion for Ultra-Fast Speculative Decoding
Image generation model with single-stream diffusion transformer
RGBD video generation model conditioned on camera input
Diffusion Transformer with Fine-Grained Chinese Understanding
Open-source multi-speaker long-form text-to-speech model
GLM-Image: Auto-regressive for Dense-knowledge and High-fidelity Image
PyTorch implementation of JiT
Multimodal Diffusion with Representation Alignment
Official Python inference and LoRA trainer package
Inference script for Oasis 500M
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
Official inference repo for FLUX.1 models
A PyTorch library for implementing flow matching algorithms
A Powerful Native Multimodal Model for Image Generation
NVIDIA Isaac GR00T N1.5 is the world's first open foundation model
Advanced language and coding AI model
Reference PyTorch implementation and models for DINOv3
Foundation model for image generation
Official code base for LeWorldModel: Stable End-to-End Joint-Embedding
Access to Anthropic's safety-first language model APIs
Long-form streaming TTS system for multi-speaker dialogue generation