AI tool that removes hardcoded subtitles and text from videos locally
Video Object and Interaction Deletion
A lightweight vision library for performing large object detection
PyTorch code and models for V-JEPA self-supervised learning from video
PyTorch code and models for VJEPA2 self-supervised learning from video
Chat & pretrained large vision language model
Powerful open source image generation model
CLIP + FFT/DWT/RGB = text to image/video
A MNIST-like fashion product database
A real-time approach for mapping all human pixels of 2D RGB images
Large-scale autoregressive pixel model for image generation by OpenAI
We estimate dense, flicker-free, geometrically consistent depth
Code for the paper "PixelCNN++: A PixelCNN Implementation..."
Image augmentation for machine learning experiments
PyTorch implementation of BigGAN with pretrained weights
A simple interface for editing natural photos
Software for measuring and training an AI's general intelligence