Phi-3.5 for Mac: Locally-run Vision and Language Models
Visual Instruction Tuning: Large Language-and-Vision Assistant
A computer vision framework to create and deploy apps in minutes
CoTracker is a model for tracking any point (pixel) on a video
ChainerCV: a Library for Deep Learning in Computer Vision