Stable Virtual Camera: Generative View Synthesis with Diffusion Models
Large-language-model & vision-language-model based on Linear Attention
Tooling for the Common Objects In 3D dataset
code for Mesh R-CNN, ICCV 2019
Language modeling in a sentence representation space
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
High-Resolution Image Synthesis with Latent Diffusion Models
Official code for Style Aligned Image Generation via Shared Attention
Powerful open source image generation model
Suite with Real-ESRGAN, BSRGAN , RealESRNet, IRCNN, GFPGAN & RIFE.
Let us control diffusion models
Official repo for consistency models
Official PyTorch Implementation of "Scalable Diffusion Models"
Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion
Code release for ConvNeXt V2 model
PyTorch implementation of MAE
GLIDE: a diffusion-based text-conditional image synthesis model
Per-Pixel Classification is Not All You Need for Semantic Segmentation
Large-scale autoregressive pixel model for image generation by OpenAI
Reproduces results of "Fixing the train-test resolution discrepancy"
A mix of GAN implementations including progressive growing
Code for "Image Generation from Scene Graphs", Johnson et al, CVPR 201