A neural network that transforms a design mock-up into static websites
The repository provides code for running inference with SAM 2
Structure-from-Motion and Multi-View Stereo
Implementation of Vision Transformer, a simple way to achieve SOTA
Provides code for running inference with the SegmentAnything Model
ICLR2024 Spotlight: curation/training code, metadata, distribution
Phi-3.5 for Mac: Locally-run Vision and Language Models
[CVPR 2025 Best Paper Award] VGGT
fast C++ library for linear algebra & scientific computing
A fast, powerful, and simple hierarchical vision transformer
Machine learning algorithms for advanced analytics
Blazeface is a lightweight model that detects faces in images
A computer vision framework to create and deploy apps in minutes
FAIR's research platform for object detection research
Resources to learn computer science in your spare time
High-Resolution 3D Human Digitization from A Single Image
Joint Face Detection and Alignment
Code release for ConvNeXt model
Class Activation Mapping
Codebase for Image Classification Research, written in PyTorch
A real-time approach for mapping all human pixels of 2D RGB images
Fast, modular reference implementation of Instance Segmentation
C++ library for image acquisition and visualization
Chrome Extension that displays automated image tags from Facebook