Structure-from-Motion and Multi-View Stereo
OpenVINO™ Toolkit repository
Interactive video and image annotation tool for computer vision
Java interface to OpenCV, FFmpeg, and more
Open Source Computer Vision Library
Google Testing and Mocking Framework
Training data (data labeling, annotation, workflow) for all data types
Datasets, transforms and models specific to Computer Vision
The repository provides code for running inference with SAM 2
A neural network that transforms a design mock-up into static websites
Medical imaging toolkit for deep learning
Go package for computer vision using OpenCV 4 and beyond
Visual Instruction Tuning: Large Language-and-Vision Assistant
Deep Learning-based Image Fusion: A Survey
Implementation of Vision Transformer, a simple way to achieve SOTA
ICLR2024 Spotlight: curation/training code, metadata, distribution
Provides code for running inference with the SegmentAnything Model
Fast image augmentation library and an easy-to-use wrapper
ArrayFire, a general purpose GPU library
Witness the aha moment of VLM with less than $3
Phi-3.5 for Mac: Locally-run Vision and Language Models
A fast, powerful, and simple hierarchical vision transformer
[CVPR 2025 Best Paper Award] VGGT
The open-source tool for building high-quality datasets
Making large AI models cheaper, faster and more accessible