Phi-3.5 for Mac: Locally-run Vision and Language Models
OpenVINO™ Toolkit repository
A lightweight vision library for performing large object detection
The repository provides code for running inference with SAM 2
Set of comprehensive computer vision & machine intelligence libraries
Provides code for running inference with the SegmentAnything Model
[CVPR 2025 Best Paper Award] VGGT
A neural network that transforms a design mock-up into static websites
Implementation of Vision Transformer, a simple way to achieve SOTA
A fast, powerful, and simple hierarchical vision transformer
Visual Instruction Tuning: Large Language-and-Vision Assistant
OpenFieldAI is an AI based Open Field Test Rodent Tracker
Blazeface is a lightweight model that detects faces in images
A computer vision framework to create and deploy apps in minutes
CoTracker is a model for tracking any point (pixel) on a video
High-Resolution 3D Human Digitization from A Single Image
Guide to deploying deep-learning inference networks
A real-time approach for mapping all human pixels of 2D RGB images
End-to-end object detection with transformers
Chrome Extension that displays automated image tags from Facebook
R-FCN: Object Detection via Region-based Fully Convolutional Networks
Python Computer Vision & Video Analytics Framework With Batteries Incl