Structure-from-Motion and Multi-View Stereo
Interactive video and image annotation tool for computer vision
Phi-3.5 for Mac: Locally-run Vision and Language Models
A neural network that transforms a design mock-up into static websites
Open source framework for deep learning satellite and aerial imagery
Implementation of Vision Transformer, a simple way to achieve SOTA
[CVPR 2025 Best Paper Award] VGGT
A lightweight vision library for performing large object detection
Fast image augmentation library and an easy-to-use wrapper
Set of comprehensive computer vision & machine intelligence libraries
Visual Automation IDE — automate anything you see on screen
An Embedded Computer Vision & Machine Learning Library
High-Resolution 3D Human Digitization from A Single Image
A python library built to empower developers
Joint Face Detection and Alignment
A simulator for drones, cars and more, built on Unreal Engine
Guide to deploying deep-learning inference networks
Code release for ConvNeXt model
Code Repository for Machine Learning with PyTorch and Scikit-Learn
Face Mask Detection system based on computer vision and deep learning
Code for "Large Pose 3D Face Reconstruction
Class Activation Mapping
Codebase for Image Classification Research, written in PyTorch
A real-time approach for mapping all human pixels of 2D RGB images
PyTorch implementation of SimCLR: A Simple Framework