Open source framework for deep learning satellite and aerial imagery
A framework to enable multimodal models to operate a computer
Implementation of Vision Transformer, a simple way to achieve SOTA
Open Source Differentiable Computer Vision Library
Enable AI to control your desktop, mobile and HMI devices
Fast image augmentation library and an easy-to-use wrapper
Phi-3.5 for Mac: Locally-run Vision and Language Models
Witness the aha moment of VLM with less than $3
3D reconstruction software
Automatically find issues in image datasets
The repository provides code for running inference with SAM 2
Effortless data labeling with AI support from Segment Anything
A lightweight vision library for performing large object detection
Medical imaging toolkit for deep learning
Making large AI models cheaper, faster and more accessible
Datasets, transforms and models specific to Computer Vision
Advanced AI Explainability for computer vision
Training data (data labeling, annotation, workflow) for all data types
Open Source Computer Vision Library
ICLR2024 Spotlight: curation/training code, metadata, distribution
The open-source tool for building high-quality datasets
Gracefully face hCaptcha challenge with multimodal llms
We write your reusable computer vision tools
Automate browser-based workflows with LLMs and Computer Vision
Hub of ready-to-use datasets for ML models