Open-source pre-training implementation of Google's LaMDA in PyTorch
Code release for "Masked-attention Mask Transformer
PyTorch implementation of MAE
GLIDE: a diffusion-based text-conditional image synthesis model
An implementation of model parallel GPT-2 and GPT-3-style models
Facebook AI Research Sequence-to-Sequence Toolkit
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)
Large-scale autoregressive pixel model for image generation by OpenAI
A mix of GAN implementations including progressive growing
Learning embeddings for classification, retrieval and ranking
Learning Continuous Signed Distance Functions for Shape Representation
Generate embeddings from large-scale graph-structured data
A library for Multilingual Unsupervised or Supervised word Embeddings
Code for "Image Generation from Scene Graphs", Johnson et al, CVPR 201
React app for inspecting, building and debugging with the Realtime API
Vision-language-action model for robot control via images and text
Instruction-tuned 7B language model for chat and complex tasks
Open, non-commercial SDXL model for quality image generation
Lightweight multimodal translation model for 55 languages
Compact hybrid reasoning language model for intelligent responses
JetBrains’ 4B parameter code model for completions
Frontier-scale 675B multimodal base model for custom AI training
Text-to-image model optimized for artistic quality and safe generation
Lightweight 24B agentic coding model with vision and long context