Structure-from-Motion and Multi-View Stereo
3D reconstruction software
Interactive video and image annotation tool for computer vision
Java interface to OpenCV, FFmpeg, and more
Open Source Computer Vision Library
OpenVINO™ Toolkit repository
Google Testing and Mocking Framework
ArrayFire, a general purpose GPU library
Go package for computer vision using OpenCV 4 and beyond
Training data (data labeling, annotation, workflow) for all data types
The repository provides code for running inference with SAM 2
The open-source tool for building high-quality datasets
Datasets, transforms and models specific to Computer Vision
Medical imaging toolkit for deep learning
Open source framework for deep learning satellite and aerial imagery
AWS IoT FleetWise Edge Agent
Witness the aha moment of VLM with less than $3
A neural network that transforms a design mock-up into static websites
Phi-3.5 for Mac: Locally-run Vision and Language Models
Deep Learning-based Image Fusion: A Survey
A fast, powerful, and simple hierarchical vision transformer
Implementation of Vision Transformer, a simple way to achieve SOTA
ICLR2024 Spotlight: curation/training code, metadata, distribution
[CVPR 2025 Best Paper Award] VGGT
Provides code for running inference with the SegmentAnything Model