Making large AI models cheaper, faster and more accessible
Open source framework for deep learning satellite and aerial imagery
The repository provides code for running inference with SAM 2
The open-source tool for building high-quality datasets
ICLR2024 Spotlight: curation/training code, metadata, distribution
Phi-3.5 for Mac: Locally-run Vision and Language Models
Training data (data labeling, annotation, workflow) for all data types
Implementation of Vision Transformer, a simple way to achieve SOTA
Hub of ready-to-use datasets for ML models
Datasets, transforms and models specific to Computer Vision
[CVPR 2025 Best Paper Award] VGGT
A lightweight vision library for performing large object detection
Deep learning library
OpenFieldAI is an AI based Open Field Test Rodent Tracker
A fast, powerful, and simple hierarchical vision transformer
Visual Instruction Tuning: Large Language-and-Vision Assistant
A computer vision framework to create and deploy apps in minutes
CoTracker is a model for tracking any point (pixel) on a video
FAIR's research platform for object detection research
High-Resolution 3D Human Digitization from A Single Image
A python library built to empower developers
Code release for ConvNeXt model
Face Mask Detection system based on computer vision and deep learning
Codebase for Image Classification Research, written in PyTorch
A real-time approach for mapping all human pixels of 2D RGB images