LLM training in simple, raw C/CUDA
On the Structural Pruning of Large Language Models
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Central interface to connect your LLM's with external data
Building Mixture-of-Experts from LLaMA with Continual Pre-training
Open-source pre-training implementation of Google's LaMDA in PyTorch