Download Latest Version llama-b8094-bin-910b-openEuler-x86-aclgraph.tar.gz (61.6 MB)
Email in envelope

Get an email when there's a new version of llama.cpp

Home / b8091
Name Modified Size InfoDownloads / Week
Parent folder
llama-b8091-xcframework.zip < 7 hours ago 167.3 MB
llama-b8091-bin-win-vulkan-x64.zip < 7 hours ago 47.7 MB
llama-b8091-bin-win-sycl-x64.zip < 7 hours ago 120.6 MB
llama-b8091-bin-win-opencl-adreno-arm64.zip < 7 hours ago 25.3 MB
llama-b8091-bin-win-hip-radeon-x64.zip < 7 hours ago 369.3 MB
llama-b8091-bin-win-cuda-13.1-x64.zip < 7 hours ago 148.5 MB
llama-b8091-bin-win-cuda-12.4-x64.zip < 7 hours ago 220.0 MB
llama-b8091-bin-win-cpu-x64.zip < 7 hours ago 31.0 MB
llama-b8091-bin-win-cpu-arm64.zip < 7 hours ago 24.4 MB
llama-b8091-bin-ubuntu-x64.tar.gz < 7 hours ago 24.6 MB
llama-b8091-bin-ubuntu-vulkan-x64.tar.gz < 7 hours ago 41.5 MB
llama-b8091-bin-ubuntu-s390x.tar.gz < 7 hours ago 25.7 MB
llama-b8091-bin-macos-x64.tar.gz < 7 hours ago 86.1 MB
llama-b8091-bin-macos-arm64.tar.gz < 7 hours ago 30.4 MB
llama-b8091-bin-910b-openEuler-x86-aclgraph.tar.gz < 7 hours ago 61.5 MB
llama-b8091-bin-910b-openEuler-aarch64-aclgraph.tar.gz < 7 hours ago 55.6 MB
llama-b8091-bin-310p-openEuler-x86.tar.gz < 7 hours ago 61.5 MB
llama-b8091-bin-310p-openEuler-aarch64.tar.gz < 7 hours ago 55.5 MB
cudart-llama-bin-win-cuda-13.1-x64.zip < 7 hours ago 402.6 MB
cudart-llama-bin-win-cuda-12.4-x64.zip < 7 hours ago 391.4 MB
b8091 source code.tar.gz < 13 hours ago 29.1 MB
b8091 source code.zip < 13 hours ago 30.1 MB
README.md < 13 hours ago 3.6 kB
Totals: 23 Items   2.4 GB 0
ggml webgpu: shader library organization (#19530) * Basic JIT compilation for mul_mat, get_rows, and scale (#17) * scale jit working * preliminary working jit for getrows and mulmat, needs refining * simplified mul_mat preprocessing switch statement * get_rows fixes, mul_mat refinement * formatted + last edits * removed some extraneous prints * fixed get_rows, fixed workgroup dispatch in mul_mat. no gibberish * small fix * some changes, working * get_rows and mul_mat jit fixed and working * Update formatting * formatting * Add header --------- Co-authored-by: Neha Abbas <nehaabbas@ReeseLevines-MacBook-Pro.local> Co-authored-by: Reese Levine <reeselevine1@gmail.com> * Start work on all-encompassing shader library * refactor argmax, set_rows * Refactor all but flashattention, mat mul * flashattention and matrix multiplication moved to new format * clean up preprocessing * Formatting * remove duplicate constants * Split large shaders into multiple static strings --------- Co-authored-by: neha-ha <137219201+neha-ha@users.noreply.github.com>

macOS/iOS:

Linux:

Windows:

openEuler:

Source: README.md, updated 2026-02-18