Witness the aha moment of VLM with less than $3
Visual Instruction Tuning: Large Language-and-Vision Assistant
Training data (data labeling, annotation, workflow) for all data types
Making large AI models cheaper, faster and more accessible
Phi-3.5 for Mac: Locally-run Vision and Language Models
Fast image augmentation library and an easy-to-use wrapper
Hub of ready-to-use datasets for ML models
3D reconstruction software
Medical imaging toolkit for deep learning
A lightweight vision library for performing large object detection
Open source framework for deep learning satellite and aerial imagery
Deep learning library
ICLR2024 Spotlight: curation/training code, metadata, distribution
The open-source tool for building high-quality datasets
A neural network that transforms a design mock-up into static websites
Implementation of Vision Transformer, a simple way to achieve SOTA
Open Source Differentiable Computer Vision Library
A fast, powerful, and simple hierarchical vision transformer
[CVPR 2025 Best Paper Award] VGGT
Datasets, transforms and models specific to Computer Vision
The repository provides code for running inference with SAM 2
Open Source Computer Vision Library
CoTracker is a model for tracking any point (pixel) on a video
FAIR's research platform for object detection research
OpenFieldAI is an AI based Open Field Test Rodent Tracker