Agentic, Reasoning, and Coding (ARC) foundation models
Image generation model with single-stream diffusion transformer
Code for running inference and finetuning with SAM 3 model
Powerful AI language model (MoE) optimized for efficiency/performance
Python example app from the OpenAI API quickstart tutorial
From Images to High-Fidelity 3D Assets
Reference PyTorch implementation and models for DINOv3
PyTorch code and models for the DINOv2 self-supervised learning
Tiny vision language model
DeepSeek Coder: Let the Code Write Itself
An AI-powered security review GitHub Action using Claude
GLM-Image: Auto-regressive for Dense-knowledge and High-fidelity Image
Code for running inference with the SAM 3D Body Model 3DB
Instructions on how to use the Realtime API on Microcontrollers
A Customizable Image-to-Video Model based on HunyuanVideo
Foundation Models for Time Series
Provides convenient access to the Anthropic REST API from any Python 3
Official inference repo for FLUX.1 models
Official implementation of DreamCraft3D
FlashMLA: Efficient Multi-head Latent Attention Kernels
Block Diffusion for Ultra-Fast Speculative Decoding
Open-weight, large-scale hybrid-attention reasoning model
Phi-3.5 for Mac: Locally-run Vision and Language Models
Large-language-model & vision-language-model based on Linear Attention
CLIP, Predict the most relevant text snippet given an image