Code release for "Masked-attention Mask Transformer
GLIDE: a diffusion-based text-conditional image synthesis model
Large-scale autoregressive pixel model for image generation by OpenAI
Environment generation code for the paper "Emergent Tool Use"
A library for Multilingual Unsupervised or Supervised word Embeddings
Code for "Image Generation from Scene Graphs", Johnson et al, CVPR 201
Large language model developed and released by NVIDIA
Instruction-tuned 7B language model for chat and complex tasks
OpenAI’s open-weight 120B model optimized for reasoning and tooling
VaultGemma: 1B DP-trained Gemma variant for private NLP tasks
Lightweight multimodal translation model for 55 languages
Hermes 4 FP8: hybrid reasoning Llama-3.1-405B model by Nous Research
JetBrains’ 4B parameter code model for completions
High-efficiency reasoning and agentic intelligence model
Efficient 13B MoE language model with long context and reasoning modes
Frontier-scale 675B multimodal instruct MoE model for enterprise AIMis
Vision-language-action model for robot control via images and text
Portuguese ASR model fine-tuned on XLSR-53 for 16kHz audio input