A tool to snap pixels to a perfect grid
PyTorch code and models for V-JEPA self-supervised learning from video
PyTorch code and models for VJEPA2 self-supervised learning from video
PyTorch implementation of JiT
RL research on Android devices
Marrying Grounding DINO with Segment Anything & Stable Diffusion
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
LISA: Reasoning Segmentation via Large Language Model
StarVector is a foundation model for SVG generation
Official Repo For "Sa2VA: Marrying SAM2 with LLaVA
Astronomical object/structure detection from 1D and 2D data sets.
Powerful open source image generation model
CoTracker is a model for tracking any point (pixel) on a video
High-Resolution 3D Human Digitization from A Single Image
Code release for "Masked-attention Mask Transformer
A MNIST-like fashion product database
A real-time approach for mapping all human pixels of 2D RGB images
simple algorithm for a realtime interactive visual cortex for painting
Metric monocular depth estimation (vision model)