The Camera library that sees the vision
Open Vision Agents by Stream. Build voice and vision agents quickly
Visual intelligence for your home.
Open source framework for deep learning satellite and aerial imagery
Enable AI to control your desktop, mobile and HMI devices
Open-Source RPA Software (formerly Kantu)
Implementation of Vision Transformer, a simple way to achieve SOTA
Give Claude the ability to watch and understand videos
Interactive video and image annotation tool for computer vision
Open Source Computer Vision Library
Build Vision Agents quickly with any model or video provider
dovi_tool is a CLI tool combining multiple utilities
A computer vision closed-loop learning platform
Phi-3.5 for Mac: Locally-run Vision and Language Models
C++ and Python Examples
Open Source Differentiable Computer Vision Library
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
Witness the aha moment of VLM with less than $3
3D reconstruction software
Effortless data labeling with AI support from Segment Anything
The repository provides code for running inference with SAM 2
Go package for computer vision using OpenCV 4 and beyond
Collection of CVPR 2026 Papers and Open Source Projects
OpenVINO™ Toolkit repository
Structure-from-Motion and Multi-View Stereo