Implementation of model parallel autoregressive transformers on GPUs
Code release for ConvNeXt V2 model
A minimal PyTorch re-implementation of the OpenAI GPT
A collection of high-quality models for the MuJoCo physics engine
Learning to Act by Watching Unlabeled Online Videos
Open-source pre-training implementation of Google's LaMDA in PyTorch
Code release for "Masked-attention Mask Transformer
PyTorch implementation of MAE
GLIDE: a diffusion-based text-conditional image synthesis model
Per-Pixel Classification is Not All You Need for Semantic Segmentation
An implementation of model parallel GPT-2 and GPT-3-style models
The official pytorch implementation of our paper
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)
Reproduces results of "Fixing the train-test resolution discrepancy"
Large-scale autoregressive pixel model for image generation by OpenAI
Learning Continuous Signed Distance Functions for Shape Representation
Generate embeddings from large-scale graph-structured data
A library for Multilingual Unsupervised or Supervised word Embeddings
Code for the paper "Improved Techniques for Training GANs"
Code for reproducing key results in the paper
Code for "Image Generation from Scene Graphs", Johnson et al, CVPR 201
JetBrains’ 4B parameter code model for completions
Dia-1.6B generates lifelike English dialogue and vocal expressions
Tencent’s 36-language state-of-the-art translation model
OpenAI’s compact 20B open model for fast, agentic, and local use