Download Latest Version b7378 source code.tar.gz (28.1 MB)
Email in envelope

Get an email when there's a new version of llama.cpp

Home / b7378
Name Modified Size InfoDownloads / Week
Parent folder
cudart-llama-bin-win-cuda-12.4-x64.zip < 8 hours ago 391.4 MB
b7378 source code.tar.gz < 14 hours ago 28.1 MB
b7378 source code.zip < 14 hours ago 29.0 MB
README.md < 14 hours ago 2.4 kB
Totals: 4 Items   448.6 MB 0

[!WARNING] Release Format Update: Linux releases will soon use .tar.gz archives instead of .zip. Please make the necessary changes to your deployment scripts.

add llama-completion to completion-bash executables (#17976)

macOS/iOS: - macOS Apple Silicon (arm64) - macOS Intel (x64) - iOS XCFramework

Linux: - Ubuntu x64 (CPU) - Ubuntu x64 (Vulkan) - Ubuntu s390x (CPU)

Windows: - Windows x64 (CPU) - Windows arm64 (CPU) - Windows x64 (CUDA 12) - Windows x64 (CUDA 13) - Windows x64 (Vulkan) - Windows x64 (SYCL) - Windows x64 (HIP)

openEuler: - openEuler x86 (310p) - openEuler x86 (910b) - openEuler aarch64 (310p) - openEuler aarch64 (910b)

Source: README.md, updated 2025-12-13