GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
PyTorch code and models for the DINOv2 self-supervised learning
GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning
Any model. Any hardware. Zero compromise
PyTorch code and models for V-JEPA self-supervised learning from video
Data Lake for Deep Learning. Build, manage, and query datasets
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Official DeiT repository
Embed images and sentences into fixed-length vectors
Text-to-Image generation. The repo for NeurIPS 2021 paper
Code release for "Detecting Twenty-thousand Classes
OpenMMLab Image Classification Toolbox and Benchmark
Official repo for consistency models
A large open dataset + tools to speed up MRI scans using ML
A latent text-to-image diffusion model
StudioGAN is a Pytorch library providing implementations of networks
Generative Adversarial Transformers
Turns your machine learning code into microservices with web API
A real-time approach for mapping all human pixels of 2D RGB images
Large-scale autoregressive pixel model for image generation by OpenAI
Copy code in "Glow: Generative Flow with Invertible 1x1 Convolutions"
VGGFace2 Dataset for Face Recognition
Super-scale your images and run experiments with Residual Dense
Image augmentation for machine learning experiments
Deep learning person re-identification in PyTorch