GLIDE: a diffusion-based text-conditional image synthesis model
An implementation of model parallel GPT-2 and GPT-3-style models
ICLR2024 Spotlight: curation/training code, metadata, distribution
JetBrains’ 4B parameter code model for completions
Vision-language-action model for robot control via images and text
Agentic 24B LLM optimized for coding tasks with 128k context support
Tencent’s 36-language state-of-the-art translation model
OpenAI’s compact 20B open model for fast, agentic, and local use