Structure-from-Motion and Multi-View Stereo
Open Source Computer Vision Library
OpenVINO™ Toolkit repository
Google Testing and Mocking Framework
Interactive video and image annotation tool for computer vision
Java interface to OpenCV, FFmpeg, and more
Implementation of Vision Transformer, a simple way to achieve SOTA
Phi-3.5 for Mac: Locally-run Vision and Language Models
The repository provides code for running inference with SAM 2
Training data (data labeling, annotation, workflow) for all data types
The open-source tool for building high-quality datasets
Medical imaging toolkit for deep learning
AWS IoT FleetWise Edge Agent
Hub of ready-to-use datasets for ML models
ArrayFire, a general purpose GPU library
A neural network that transforms a design mock-up into static websites
[CVPR 2025 Best Paper Award] VGGT
Provides code for running inference with the SegmentAnything Model
Go package for computer vision using OpenCV 4 and beyond
Fast image augmentation library and an easy-to-use wrapper
Set of comprehensive computer vision & machine intelligence libraries
Witness the aha moment of VLM with less than $3
Deep Learning-based Image Fusion: A Survey
ICLR2024 Spotlight: curation/training code, metadata, distribution
Making large AI models cheaper, faster and more accessible