Learning to Act by Watching Unlabeled Online Videos
Open-source pre-training implementation of Google's LaMDA in PyTorch
Code release for "Masked-attention Mask Transformer
PyTorch implementation of MAE
GLIDE: a diffusion-based text-conditional image synthesis model
Per-Pixel Classification is Not All You Need for Semantic Segmentation
PyTorch implementation of YOLOv4
An implementation of model parallel GPT-2 and GPT-3-style models
The official pytorch implementation of our paper
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)
Large-scale autoregressive pixel model for image generation by OpenAI
Reproduces results of "Fixing the train-test resolution discrepancy"
Environment generation code for the paper "Emergent Tool Use"
A mix of GAN implementations including progressive growing
Code for the paper "Improved Techniques for Training GANs"
Code for "Image Generation from Scene Graphs", Johnson et al, CVPR 201
Dual LSTM Encoder for Dialog Response Generation
Open-source code agent designed for Lean 4
LL model providing reasoning and conversational capabilities
Open language model developed by NVIDIA as part of Nemotron-3 family
Model that fuses instruct, reasoning and agentic skills
Tencent’s 36-language state-of-the-art translation model
JetBrains’ 4B parameter code model for completions
High-compute ultra-reasoning model surpassing model surpassing GPT-5
High-efficiency reasoning and agentic intelligence model