Download Latest Version llama-b8299-bin-910b-openEuler-x86-aclgraph.tar.gz (64.7 MB)
Email in envelope

Get an email when there's a new version of llama.cpp

Home / b8295
Name Modified Size InfoDownloads / Week
Parent folder
llama-b8295-xcframework.zip < 9 hours ago 173.0 MB
llama-b8295-bin-win-vulkan-x64.zip < 9 hours ago 52.0 MB
llama-b8295-bin-win-sycl-x64.zip < 9 hours ago 130.8 MB
llama-b8295-bin-win-opencl-adreno-arm64.zip < 9 hours ago 29.2 MB
llama-b8295-bin-win-hip-radeon-x64.zip < 9 hours ago 349.2 MB
llama-b8295-bin-win-cuda-13.1-x64.zip < 9 hours ago 152.9 MB
llama-b8295-bin-win-cuda-12.4-x64.zip < 9 hours ago 224.6 MB
llama-b8295-bin-win-cpu-x64.zip < 9 hours ago 35.0 MB
llama-b8295-bin-win-cpu-arm64.zip < 9 hours ago 28.2 MB
llama-b8295-bin-ubuntu-x64.tar.gz < 9 hours ago 28.6 MB
llama-b8295-bin-ubuntu-vulkan-x64.tar.gz < 9 hours ago 45.8 MB
llama-b8295-bin-ubuntu-s390x.tar.gz < 9 hours ago 30.5 MB
llama-b8295-bin-ubuntu-rocm-7.2-x64.tar.gz < 9 hours ago 149.1 MB
llama-b8295-bin-macos-x64.tar.gz < 9 hours ago 92.8 MB
llama-b8295-bin-macos-arm64.tar.gz < 9 hours ago 35.9 MB
llama-b8295-bin-910b-openEuler-x86-aclgraph.tar.gz < 9 hours ago 64.7 MB
llama-b8295-bin-910b-openEuler-aarch64-aclgraph.tar.gz < 9 hours ago 58.1 MB
llama-b8295-bin-310p-openEuler-x86.tar.gz < 9 hours ago 64.7 MB
llama-b8295-bin-310p-openEuler-aarch64.tar.gz < 9 hours ago 58.1 MB
cudart-llama-bin-win-cuda-13.1-x64.zip < 9 hours ago 402.6 MB
cudart-llama-bin-win-cuda-12.4-x64.zip < 9 hours ago 391.4 MB
b8295 source code.tar.gz 2026-03-11 29.5 MB
b8295 source code.zip 2026-03-11 30.6 MB
README.md 2026-03-11 3.0 kB
Totals: 24 Items   2.7 GB 0
llama : add support for Nemotron 3 Super (#20411) * llama : add support for Nemotron 3 Super This commit adds support for the Nemotron 3 Super model (120B.A12B) enabling this model to be converted to GGUF format and run in llama.cpp. Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> Co-authored-by: Matt Clayton <156335168+mattjcly@users.noreply.github.com>

macOS/iOS:

Linux:

Windows:

openEuler:

Source: README.md, updated 2026-03-11