A neural network that transforms a design mock-up into static websites
Phi-3.5 for Mac: Locally-run Vision and Language Models
[CVPR 2025 Best Paper Award] VGGT
ICLR2024 Spotlight: curation/training code, metadata, distribution
Code release for ConvNeXt model
A real-time approach for mapping all human pixels of 2D RGB images
Chrome Extension that displays automated image tags from Facebook