Open-source pre-training implementation of Google's LaMDA in PyTorch
Code release for "Masked-attention Mask Transformer
PyTorch implementation of MAE
GLIDE: a diffusion-based text-conditional image synthesis model
An implementation of model parallel GPT-2 and GPT-3-style models
The official pytorch implementation of our paper
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)
Large-scale autoregressive pixel model for image generation by OpenAI
Generate embeddings from large-scale graph-structured data
A library for Multilingual Unsupervised or Supervised word Embeddings
Code for the paper "Improved Techniques for Training GANs"
Code for reproducing key results in the paper
Code for "Image Generation from Scene Graphs", Johnson et al, CVPR 201
JetBrains’ 4B parameter code model for completions
Dia-1.6B generates lifelike English dialogue and vocal expressions
Tencent’s 36-language state-of-the-art translation model
OpenAI’s compact 20B open model for fast, agentic, and local use
CTC-based forced aligner for audio-text in 158 languages
Vision-language-action model for robot control via images and text