PyTorch code and models for VJEPA2 self-supervised learning from video
A lightweight vision library for performing large object detection
PyTorch code and models for V-JEPA self-supervised learning from video
Chat & pretrained large vision language model
CLIP + FFT/DWT/RGB = text to image/video
Powerful open source image generation model
A real-time approach for mapping all human pixels of 2D RGB images
Large-scale autoregressive pixel model for image generation by OpenAI
We estimate dense, flicker-free, geometrically consistent depth
Code for the paper "PixelCNN++: A PixelCNN Implementation..."
Image augmentation for machine learning experiments
PyTorch implementation of BigGAN with pretrained weights
Software for measuring and training an AI's general intelligence