A neural network that transforms a design mock-up into static websites
Structure-from-Motion and Multi-View Stereo
The repository provides code for running inference with SAM 2
Provides code for running inference with the SegmentAnything Model
Implementation of Vision Transformer, a simple way to achieve SOTA
Open Source Computer Vision Library
Phi-3.5 for Mac: Locally-run Vision and Language Models
ICLR2024 Spotlight: curation/training code, metadata, distribution
[CVPR 2025 Best Paper Award] VGGT
fast C++ library for linear algebra & scientific computing
A fast, powerful, and simple hierarchical vision transformer
Machine learning algorithms for advanced analytics
Blazeface is a lightweight model that detects faces in images
FAIR's research platform for object detection research
Resources to learn computer science in your spare time
High-Resolution 3D Human Digitization from A Single Image
Joint Face Detection and Alignment
Code release for ConvNeXt model
Class Activation Mapping
Codebase for Image Classification Research, written in PyTorch
A real-time approach for mapping all human pixels of 2D RGB images
Fast, modular reference implementation of Instance Segmentation