A neural network that transforms a design mock-up into static websites
Image generation model with single-stream diffusion transformer
Use Claude Code as the foundation for coding infrastructure
Official inference repo for FLUX.1 models
CLIP, Predict the most relevant text snippet given an image
Official inference repo for FLUX.2 models
Models for object and human mesh reconstruction
Machine learning image inpainting task that removes watermarks
A Powerful Native Multimodal Model for Image Generation
Flexible Photo Recrafting While Preserving Your Identity
Open-source image generative foundation model
Code for running inference with the SAM 3D Body Model 3DB
AI-powered code assistant for Vim. OpenAI and ChatGPT plugin for Vim
A Customizable Image-to-Video Model based on HunyuanVideo
This repo contains the code for 1D tokenizer and generator
Documentation for Google's Gen AI site - including Gemini API & Gemma
Provides code for running inference with the SegmentAnything Model
Claude Code / Codex skill — generate Xiaohongshu carousels
Collection of Gemma 3 variants that are trained for performance
Recovering the Visual Space from Any Views
Draw wireframe sketches and generate HTML with AI vision models
Diffusion Transformer with Fine-Grained Chinese Understanding
Export and Share your ChatGPT conversation history
"Big Model" trains a visual multimodal VLM with 26M parameters
Node.js example app from the OpenAI API quickstart tutorial