Visual Instruction Tuning: Large Language-and-Vision Assistant
Training data (data labeling, annotation, workflow) for all data types
Phi-3.5 for Mac: Locally-run Vision and Language Models
Fast image augmentation library and an easy-to-use wrapper
Witness the aha moment of VLM with less than $3
Medical imaging toolkit for deep learning
Open source framework for deep learning satellite and aerial imagery
3D reconstruction software
Making large AI models cheaper, faster and more accessible
Deep learning library
Hub of ready-to-use datasets for ML models
A lightweight vision library for performing large object detection
A neural network that transforms a design mock-up into static websites
ICLR2024 Spotlight: curation/training code, metadata, distribution
The open-source tool for building high-quality datasets
Datasets, transforms and models specific to Computer Vision
Implementation of Vision Transformer, a simple way to achieve SOTA
A fast, powerful, and simple hierarchical vision transformer
Open Source Differentiable Computer Vision Library
[CVPR 2025 Best Paper Award] VGGT
The repository provides code for running inference with SAM 2
human detection using yolov8
CoTracker is a model for tracking any point (pixel) on a video
Open Source Computer Vision Library
FAIR's research platform for object detection research