3D reconstruction software
Datasets, transforms and models specific to Computer Vision
Medical imaging toolkit for deep learning
Deep learning library
Open Source Computer Vision Library
The repository provides code for running inference with SAM 2
Open source framework for deep learning satellite and aerial imagery
Witness the aha moment of VLM with less than $3
Implementation of Vision Transformer, a simple way to achieve SOTA
Training data (data labeling, annotation, workflow) for all data types
Fast image augmentation library and an easy-to-use wrapper
A neural network that transforms a design mock-up into static websites
Making large AI models cheaper, faster and more accessible
Phi-3.5 for Mac: Locally-run Vision and Language Models
A fast, powerful, and simple hierarchical vision transformer
ICLR2024 Spotlight: curation/training code, metadata, distribution
[CVPR 2025 Best Paper Award] VGGT
Provides code for running inference with the SegmentAnything Model
The open-source tool for building high-quality datasets
A lightweight vision library for performing large object detection
Open Source Differentiable Computer Vision Library
Hub of ready-to-use datasets for ML models
Visual Instruction Tuning: Large Language-and-Vision Assistant
human detection using yolov8
Open Source Computer Vision Library