Download Latest Version llama-b7423-bin-910b-openEuler-x86.tar.gz (47.5 MB)
Email in envelope

Get an email when there's a new version of llama.cpp

Home / b7418
Name Modified Size InfoDownloads / Week
Parent folder
llama-b7418-xcframework.zip < 10 hours ago 148.4 MB
llama-b7418-xcframework.tar.gz < 10 hours ago 148.5 MB
llama-b7418-bin-win-vulkan-x64.zip < 10 hours ago 34.8 MB
llama-b7418-bin-win-sycl-x64.zip < 10 hours ago 109.0 MB
llama-b7418-bin-win-opencl-adreno-arm64.zip < 10 hours ago 16.7 MB
llama-b7418-bin-win-hip-radeon-x64.zip < 10 hours ago 347.7 MB
llama-b7418-bin-win-cuda-13.1-x64.zip < 10 hours ago 92.5 MB
llama-b7418-bin-win-cuda-12.4-x64.zip < 10 hours ago 203.8 MB
llama-b7418-bin-win-cpu-x64.zip < 10 hours ago 19.8 MB
llama-b7418-bin-win-cpu-arm64.zip < 10 hours ago 16.1 MB
llama-b7418-bin-ubuntu-x64.zip < 10 hours ago 18.9 MB
llama-b7418-bin-ubuntu-x64.tar.gz < 10 hours ago 18.9 MB
llama-b7418-bin-ubuntu-vulkan-x64.zip < 10 hours ago 34.4 MB
llama-b7418-bin-ubuntu-vulkan-x64.tar.gz < 10 hours ago 34.4 MB
llama-b7418-bin-ubuntu-s390x.zip < 10 hours ago 18.9 MB
llama-b7418-bin-ubuntu-s390x.tar.gz < 10 hours ago 22.1 MB
llama-b7418-bin-macos-x64.zip < 10 hours ago 42.3 MB
llama-b7418-bin-macos-x64.tar.gz < 10 hours ago 42.4 MB
llama-b7418-bin-macos-arm64.zip < 10 hours ago 16.4 MB
llama-b7418-bin-macos-arm64.tar.gz < 10 hours ago 16.5 MB
llama-b7418-bin-910b-openEuler-x86.tar.gz < 10 hours ago 47.5 MB
llama-b7418-bin-910b-openEuler-aarch64.tar.gz < 10 hours ago 43.4 MB
llama-b7418-bin-310p-openEuler-x86.tar.gz < 10 hours ago 47.5 MB
llama-b7418-bin-310p-openEuler-aarch64.tar.gz < 10 hours ago 43.4 MB
cudart-llama-bin-win-cuda-13.1-x64.zip < 10 hours ago 402.6 MB
cudart-llama-bin-win-cuda-12.4-x64.zip < 10 hours ago 391.4 MB
b7418 source code.tar.gz < 12 hours ago 28.2 MB
b7418 source code.zip < 12 hours ago 29.1 MB
README.md < 12 hours ago 2.7 kB
Totals: 29 Items   2.4 GB 0

[!WARNING] Release Format Update: Linux releases will soon use .tar.gz archives instead of .zip. Please make the necessary changes to your deployment scripts.

llama : add support for NVIDIA Nemotron 3 Nano (#18058) * llama : add support for NVIDIA Nemotron Nano 3 This commit adds support for the NVIDIA Nemotron Nano 3 model, enabling the conversion and running of this model. Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

macOS/iOS: - macOS Apple Silicon (arm64) - macOS Intel (x64) - iOS XCFramework

Linux: - Ubuntu x64 (CPU) - Ubuntu x64 (Vulkan) - Ubuntu s390x (CPU)

Windows: - Windows x64 (CPU) - Windows arm64 (CPU) - Windows x64 (CUDA 12) - Windows x64 (CUDA 13) - Windows x64 (Vulkan) - Windows x64 (SYCL) - Windows x64 (HIP)

openEuler: - openEuler x86 (310p) - openEuler x86 (910b) - openEuler aarch64 (310p) - openEuler aarch64 (910b)

Source: README.md, updated 2025-12-16