Open-source pre-training implementation of Google's LaMDA in PyTorch
Code release for "Masked-attention Mask Transformer
PyTorch implementation of MAE
GLIDE: a diffusion-based text-conditional image synthesis model
Per-Pixel Classification is Not All You Need for Semantic Segmentation
An implementation of model parallel GPT-2 and GPT-3-style models
The official pytorch implementation of our paper
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)
Reproduces results of "Fixing the train-test resolution discrepancy"
Large-scale autoregressive pixel model for image generation by OpenAI
A mix of GAN implementations including progressive growing
Learning Continuous Signed Distance Functions for Shape Representation
Generate embeddings from large-scale graph-structured data
A library for Multilingual Unsupervised or Supervised word Embeddings
Code for the paper "Improved Techniques for Training GANs"
Code for reproducing key results in the paper
Code for "Image Generation from Scene Graphs", Johnson et al, CVPR 201
Dia-1.6B generates lifelike English dialogue and vocal expressions
JetBrains’ 4B parameter code model for completions
OpenAI’s compact 20B open model for fast, agentic, and local use
Tencent’s 36-language state-of-the-art translation model
CTC-based forced aligner for audio-text in 158 languages
Vision-language-action model for robot control via images and text