Structure-from-Motion and Multi-View Stereo
Provides code for running inference with the SegmentAnything Model
Implementation of Vision Transformer, a simple way to achieve SOTA
[CVPR 2025 Best Paper Award] VGGT
A fast, powerful, and simple hierarchical vision transformer
FAIR's research platform for object detection research
Codebase for Image Classification Research, written in PyTorch