Code for running inference and finetuning with SAM 3 model
Agentic, Reasoning, and Coding (ARC) foundation models
Advanced language and coding AI model
The ChatGPT Retrieval Plugin lets you easily find personal documents
CLIP, Predict the most relevant text snippet given an image
Implementation of "MobileCLIP" CVPR 2024
PyTorch code and models for the DINOv2 self-supervised learning
Large Multimodal Models for Video Understanding and Editing
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Open Multilingual Multimodal Chat LMs
Reference implementation of the Transformer architecture optimized