Phi-3.5 for Mac: Locally-run Vision and Language Models
3D reconstruction software
Witness the aha moment of VLM with less than $3
The repository provides code for running inference with SAM 2
Making large AI models cheaper, faster and more accessible
[CVPR 2025 Best Paper Award] VGGT
Hub of ready-to-use datasets for ML models
Deep learning library
Training data (data labeling, annotation, workflow) for all data types
The open-source tool for building high-quality datasets
A neural network that transforms a design mock-up into static websites
Implementation of Vision Transformer, a simple way to achieve SOTA
ICLR2024 Spotlight: curation/training code, metadata, distribution
Medical imaging toolkit for deep learning
A lightweight vision library for performing large object detection
Open source framework for deep learning satellite and aerial imagery
Open Source Differentiable Computer Vision Library
Fast image augmentation library and an easy-to-use wrapper
A fast, powerful, and simple hierarchical vision transformer
Visual Instruction Tuning: Large Language-and-Vision Assistant
Open Source Computer Vision Library
OpenFieldAI is an AI based Open Field Test Rodent Tracker
A computer vision framework to create and deploy apps in minutes
FAIR's research platform for object detection research
CoTracker is a model for tracking any point (pixel) on a video