Llama 2 LLM
Minimal C implementation for training and inferring Llama 2
llama2.c is an open source project by Andrej Karpathy that implements inference for Llama 2 in a single ~700-line C file. It allows users to train models in PyTorch and then run inference in pure C, emphasizing minimalism and educational clarity. The repository supports loading Meta’s official Llama 2 models, though current limitations restrict practical use to smaller models (up to ~7B parameters in fp32). It draws inspiration from projects like llama.cpp but is designed to be simpler...