Phi-3.5 for Mac: Locally-run Vision and Language Models
A lightweight vision library for performing large object detection
Implementation of Vision Transformer, a simple way to achieve SOTA
ICLR2024 Spotlight: curation/training code, metadata, distribution
Provides code for running inference with the SegmentAnything Model
Blazeface is a lightweight model that detects faces in images
Chrome Extension that displays automated image tags from Facebook
A modern approach for Computer Vision on the web