Release for Improved Denoising Diffusion Probabilistic Models
High-Resolution Image Synthesis with Latent Diffusion Models
Stable Diffusion WebUI Forge is a platform on top of Stable Diffusion
High-Resolution Image Synthesis with Latent Diffusion Models
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
Lets make video diffusion practical
RGBD video generation model conditioned on camera input
Stable Virtual Camera: Generative View Synthesis with Diffusion Models
Diffusion Transformer with Fine-Grained Chinese Understanding
Wan2.1: Open and Advanced Large-Scale Video Generative Model
HY-Motion model for 3D character animation generation
Open-source multi-speaker long-form text-to-speech model
Repo for SeedVR2 & SeedVR
Inference script for Oasis 500M
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
Multimodal Diffusion with Representation Alignment
Personalize Any Characters with a Scalable Diffusion Transformer
State-of-the-art (SoTA) text-to-video pre-trained model
High-Fidelity and Controllable Generation of Textured 3D Assets
A SOTA open-source image editing model
Generating Immersive, Explorable, and Interactive 3D Worlds
A PyTorch library for implementing flow matching algorithms
A Powerful Native Multimodal Model for Image Generation
Official code for Style Aligned Image Generation via Shared Attention
Global weather forecasting model using graph neural networks and JAX