Structure-from-Motion and Multi-View Stereo
Python SDK for the Computer Use model Lux, developed by OpenAGI
A neural network that transforms a design mock-up into static websites
Implementation of Vision Transformer, a simple way to achieve SOTA
Phi-3.5 for Mac: Locally-run Vision and Language Models
The repository provides code for running inference with SAM 2
[CVPR 2025 Best Paper Award] VGGT
Discover pretrained models for deep learning in MATLAB
A fast, powerful, and simple hierarchical vision transformer
ICLR2024 Spotlight: curation/training code, metadata, distribution
Provides code for running inference with the SegmentAnything Model
Pushing the Limits of Mathematical Reasoning in Open Language Models
Please do not feed the models
AI algorithm position job search strategy
RL research on Android devices
fast C++ library for linear algebra & scientific computing
A computer vision framework to create and deploy apps in minutes
SSD-based object detection model trained on Open Images V4
Blazeface is a lightweight model that detects faces in images
CoTracker is a model for tracking any point (pixel) on a video
FAIR's research platform for object detection research
Document papers compiled daily in computer vision/deep learning
Resources to learn computer science in your spare time
High-Resolution 3D Human Digitization from A Single Image
Machine learning algorithms for advanced analytics