Guiding Instruction-based Image Editing via Multimodal Large Language
This repository contains the official implementation of FastVLM
Refer and Ground Anything Anywhere at Any Granularity
Utilities intended for use with Llama models
ICLR2024 Spotlight: curation/training code, metadata, distribution
PyTorch code and models for V-JEPA self-supervised learning from video
A PyTorch library for implementing flow matching algorithms
An implementation of a deep learning recommendation model (DLRM)
Official DeiT repository
PyTorch3D is FAIR's library of reusable components for deep learning
[CVPR 2025 Best Paper Award] VGGT
PyTorch code and models for the DINOv2 self-supervised learning
Provides code for running inference with the SegmentAnything Model
Anthropic's Interactive Prompt Engineering Tutorial
Memory-efficient and performant finetuning of Mistral's models
Towards Ultimate Expert Specialization in Mixture-of-Experts Language
Official implementation of DreamCraft3D
Open-source large language model family from Tencent Hunyuan
Transformers4Rec is a flexible and efficient library
Sample code and notebooks for Generative AI on Google Cloud
Stable Diffusion WebUI Forge is a platform on top of Stable Diffusion
Unified Multimodal Understanding and Generation Models
LLM powered fuzzing via OSS-Fuzz
Beyond the Imitation Game collaborative benchmark for measuring
An alignment auditing agent capable of exploring alignment hypothesis