Official PyTorch Implementation of "Scalable Diffusion Models"
PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)
LLaMA: Open and Efficient Foundation Language Models
Implementation of model parallel autoregressive transformers on GPUs
A minimal PyTorch re-implementation of the OpenAI GPT
Open-source pre-training implementation of Google's LaMDA in PyTorch
GLIDE: a diffusion-based text-conditional image synthesis model
An implementation of model parallel GPT-2 and GPT-3-style models
JetBrainsā 4B parameter code model for completions
Vision-language-action model for robot control via images and text
OpenAIās compact 20B open model for fast, agentic, and local use
Tencentās 36-language state-of-the-art translation model
High-efficiency reasoning and agentic intelligence model
Kimi K2: 1T-param MoE model for advanced coding and agentic reasoning