Open source framework for deep learning satellite and aerial imagery
A framework to enable multimodal models to operate a computer
Implementation of Vision Transformer, a simple way to achieve SOTA
Open Source Differentiable Computer Vision Library
Enable AI to control your desktop, mobile and HMI devices
computer vision projects | Fun AI projects related to computer vision
Phi-3.5 for Mac: Locally-run Vision and Language Models
Fast image augmentation library and an easy-to-use wrapper
Medical imaging toolkit for deep learning
3D reconstruction software
Witness the aha moment of VLM with less than $3
The repository provides code for running inference with SAM 2
Datasets, transforms and models specific to Computer Vision
A lightweight vision library for performing large object detection
Effortless data labeling with AI support from Segment Anything
Making large AI models cheaper, faster and more accessible
The open-source tool for building high-quality datasets
Training data (data labeling, annotation, workflow) for all data types
Gracefully face hCaptcha challenge with multimodal llms
We write your reusable computer vision tools
Hub of ready-to-use datasets for ML models
Advanced AI Explainability for computer vision
ICLR2024 Spotlight: curation/training code, metadata, distribution
A neural network that transforms a design mock-up into static websites
AI tool for automating desktop tasks via natural language input