3D reconstruction software
Phi-3.5 for Mac: Locally-run Vision and Language Models
Witness the aha moment of VLM with less than $3
Fast image augmentation library and an easy-to-use wrapper
Open source framework for deep learning satellite and aerial imagery
Training data (data labeling, annotation, workflow) for all data types
Making large AI models cheaper, faster and more accessible
A lightweight vision library for performing large object detection
The repository provides code for running inference with SAM 2
Hub of ready-to-use datasets for ML models
Medical imaging toolkit for deep learning
The open-source tool for building high-quality datasets
Deep learning library
ICLR2024 Spotlight: curation/training code, metadata, distribution
A neural network that transforms a design mock-up into static websites
Open Source Differentiable Computer Vision Library
Datasets, transforms and models specific to Computer Vision
Implementation of Vision Transformer, a simple way to achieve SOTA
[CVPR 2025 Best Paper Award] VGGT
Open Source Computer Vision Library
Visual Automation IDE — automate anything you see on screen
OpenFieldAI is an AI based Open Field Test Rodent Tracker
human detection using yolov8
A fast, powerful, and simple hierarchical vision transformer
Visual Instruction Tuning: Large Language-and-Vision Assistant