A framework to enable multimodal models to operate a computer
gpt-4o for windows, macos and linux
3D reconstruction software
Open Source Differentiable Computer Vision Library
Datasets, transforms and models specific to Computer Vision
Medical imaging toolkit for deep learning
A natural language interface for computers
Fast image augmentation library and an easy-to-use wrapper
Control Any Computer Using LLMs
Agent Zero AI framework
The open-source tool for building high-quality datasets
Training data (data labeling, annotation, workflow) for all data types
We write your reusable computer vision tools
Python SDK for the Computer Use model Lux, developed by OpenAGI
Making large AI models cheaper, faster and more accessible
Hub of ready-to-use datasets for ML models
YOLOv5 is the world's most loved vision AI
A lightweight vision library for performing large object detection
Phi-3.5 for Mac: Locally-run Vision and Language Models
c/ua is the Docker Container for Computer-Use AI Agents
Visual Instruction Tuning: Large Language-and-Vision Assistant
Automate browser-based workflows with LLMs and Computer Vision
Open source framework for deep learning satellite and aerial imagery
Witness the aha moment of VLM with less than $3
A fast, powerful, and simple hierarchical vision transformer