A framework to enable multimodal models to operate a computer
gpt-4o for windows, macos and linux
Interactive video and image annotation tool for computer vision
Open Source Computer Vision Library
3D reconstruction software
Open Source Differentiable Computer Vision Library
Set of comprehensive computer vision & machine intelligence libraries
Structure-from-Motion and Multi-View Stereo
OpenVINO™ Toolkit repository
Google Testing and Mocking Framework
Datasets, transforms and models specific to Computer Vision
Go package for computer vision using OpenCV 4 and beyond
Java interface to OpenCV, FFmpeg, and more
Fast image augmentation library and an easy-to-use wrapper
Control Any Computer Using LLMs
Training data (data labeling, annotation, workflow) for all data types
Medical imaging toolkit for deep learning
LLM Frontend for Power Users
The open-source tool for building high-quality datasets
ArrayFire, a general purpose GPU library
A generic, simple and fast implementation of Deepmind's AlphaZero
Visual Instruction Tuning: Large Language-and-Vision Assistant
Making large AI models cheaper, faster and more accessible
The Compute Library is a set of computer vision and machine learning
A natural language interface for computers