Download Latest Version llama-b7932-bin-910b-openEuler-x86-aclgraph.tar.gz (61.5 MB)
Email in envelope

Get an email when there's a new version of llama.cpp

Home / b7931
Name Modified Size InfoDownloads / Week
Parent folder
llama-b7931-xcframework.zip < 12 hours ago 175.3 MB
llama-b7931-bin-win-vulkan-x64.zip < 12 hours ago 47.3 MB
llama-b7931-bin-win-sycl-x64.zip < 12 hours ago 120.3 MB
llama-b7931-bin-win-opencl-adreno-arm64.zip < 12 hours ago 25.0 MB
llama-b7931-bin-win-hip-radeon-x64.zip < 12 hours ago 368.3 MB
llama-b7931-bin-win-cuda-13.1-x64.zip < 12 hours ago 147.8 MB
llama-b7931-bin-win-cuda-12.4-x64.zip < 12 hours ago 219.1 MB
llama-b7931-bin-win-cpu-x64.zip < 12 hours ago 30.7 MB
llama-b7931-bin-win-cpu-arm64.zip < 12 hours ago 24.2 MB
llama-b7931-bin-ubuntu-x64.tar.gz < 12 hours ago 24.5 MB
llama-b7931-bin-ubuntu-vulkan-x64.tar.gz < 12 hours ago 41.4 MB
llama-b7931-bin-ubuntu-s390x.tar.gz < 12 hours ago 25.3 MB
llama-b7931-bin-macos-x64.tar.gz < 12 hours ago 85.0 MB
llama-b7931-bin-macos-arm64.tar.gz < 12 hours ago 30.0 MB
llama-b7931-bin-910b-openEuler-x86-aclgraph.tar.gz < 12 hours ago 61.5 MB
llama-b7931-bin-910b-openEuler-aarch64-aclgraph.tar.gz < 12 hours ago 55.5 MB
llama-b7931-bin-310p-openEuler-x86.tar.gz < 12 hours ago 61.5 MB
llama-b7931-bin-310p-openEuler-aarch64.tar.gz < 12 hours ago 55.5 MB
cudart-llama-bin-win-cuda-13.1-x64.zip < 12 hours ago 402.6 MB
cudart-llama-bin-win-cuda-12.4-x64.zip < 12 hours ago 391.4 MB
b7931 source code.tar.gz < 17 hours ago 28.9 MB
b7931 source code.zip < 17 hours ago 29.9 MB
README.md < 17 hours ago 3.3 kB
Totals: 23 Items   2.5 GB 0
ggml-virtgpu: make the code thread safe (#19204) * ggml-virtgpu: regenerate_remoting.py: add the ability to deprecate a function * ggml-virtgpu: deprecate buffer_type is_host remoting not necessary * ggml-virtgpu: stop using static vars as cache The static init isn't thread safe. * ggml-virtgpu: protect the use of the shared memory to transfer data * ggml-virtgpu: make the remote calls thread-safe * ggml-virtgpu: backend: don't continue if couldn't allocate the tensor memory * ggml-virtgpu: add a cleanup function for consistency * ggml-virtgpu: backend: don't crash if buft->iface.get_max_size is missing * fix style and ordering * Remove the static variable in apir_device_get_count * ggml-virtgpu: improve the logging * fix review minor formatting changes

macOS/iOS: - macOS Apple Silicon (arm64) - macOS Intel (x64) - iOS XCFramework

Linux: - Ubuntu x64 (CPU) - Ubuntu x64 (Vulkan) - Ubuntu s390x (CPU)

Windows: - Windows x64 (CPU) - Windows arm64 (CPU) - Windows x64 (CUDA 12) - CUDA 12.4 DLLs - Windows x64 (CUDA 13) - CUDA 13.1 DLLs - Windows x64 (Vulkan) - Windows x64 (SYCL) - Windows x64 (HIP)

openEuler: - openEuler x86 (310p) - openEuler x86 (910b, ACL Graph) - openEuler aarch64 (310p) - openEuler aarch64 (910b, ACL Graph)

Source: README.md, updated 2026-02-04