Download Latest Version llama-b7524-bin-910b-openEuler-x86.tar.gz (48.1 MB)
Email in envelope

Get an email when there's a new version of llama.cpp

Home / b7519
Name Modified Size InfoDownloads / Week
Parent folder
llama-b7519-xcframework.zip < 18 hours ago 150.4 MB
llama-b7519-bin-win-vulkan-x64.zip < 18 hours ago 35.1 MB
llama-b7519-bin-win-sycl-x64.zip < 18 hours ago 109.2 MB
llama-b7519-bin-win-opencl-adreno-arm64.zip < 18 hours ago 16.9 MB
llama-b7519-bin-win-hip-radeon-x64.zip < 18 hours ago 347.8 MB
llama-b7519-bin-win-cuda-13.1-x64.zip < 18 hours ago 92.7 MB
llama-b7519-bin-win-cuda-12.4-x64.zip < 18 hours ago 204.0 MB
llama-b7519-bin-win-cpu-x64.zip < 18 hours ago 20.0 MB
llama-b7519-bin-win-cpu-arm64.zip < 18 hours ago 16.3 MB
llama-b7519-bin-ubuntu-x64.tar.gz < 18 hours ago 19.2 MB
llama-b7519-bin-ubuntu-vulkan-x64.tar.gz < 18 hours ago 34.7 MB
llama-b7519-bin-ubuntu-s390x.tar.gz < 18 hours ago 22.3 MB
llama-b7519-bin-macos-x64.tar.gz < 18 hours ago 43.0 MB
llama-b7519-bin-macos-arm64.tar.gz < 18 hours ago 16.7 MB
llama-b7519-bin-910b-openEuler-x86.tar.gz < 18 hours ago 48.1 MB
llama-b7519-bin-910b-openEuler-aarch64.tar.gz < 18 hours ago 43.9 MB
llama-b7519-bin-310p-openEuler-x86.tar.gz < 18 hours ago 48.1 MB
llama-b7519-bin-310p-openEuler-aarch64.tar.gz < 18 hours ago 43.9 MB
cudart-llama-bin-win-cuda-13.1-x64.zip < 18 hours ago 402.6 MB
cudart-llama-bin-win-cuda-12.4-x64.zip < 18 hours ago 391.4 MB
b7519 source code.tar.gz < 19 hours ago 28.6 MB
b7519 source code.zip < 19 hours ago 29.5 MB
README.md < 19 hours ago 4.8 kB
Totals: 23 Items   2.2 GB 0
ggml-hexagon: create generalized functions for cpu side op (#17500) * refactor: replace ggml_hexagon_mul_mat with template-based binary operation for improved flexibility * refactor: replace ggml_hexagon_mul_mat_id with template-based binary operation for improved flexibility * refactor: initialize buffer types and streamline dspqueue_buffers_init calls for clarity * add comment * refactor: remove redundant buffer checks in hexagon supported operations * wip * add missing include to fix weak symbol warning * add ggml_hexagon_op_generic * refactor: simplify tensor operation initialization and buffer management in hexagon implementation * refactor: streamline hexagon operation initialization and buffer management * refactor: update function signatures and streamline request handling in hexagon operations * wip * ggml-hexagon: clean up code formatting and improve unary operation handling * wip * rename * fix: add support for permuted F16 tensors and enhance quantization checks in matrix operations * refactor: replace ggml_hexagon_mul_mat with template-based binary operation for improved flexibility refactor: replace ggml_hexagon_mul_mat_id with template-based binary operation for improved flexibility refactor: initialize buffer types and streamline dspqueue_buffers_init calls for clarity refactor: remove redundant buffer checks in hexagon supported operations add missing include to fix weak symbol warning add ggml_hexagon_op_generic refactor: simplify tensor operation initialization and buffer management in hexagon implementation refactor: streamline hexagon operation initialization and buffer management refactor: update function signatures and streamline request handling in hexagon operations ggml-hexagon: clean up code formatting and improve unary operation handling fix: add support for permuted F16 tensors and enhance quantization checks in matrix operations # Conflicts: # ggml/src/ggml-hexagon/ggml-hexagon.cpp * hexagon: fix merge conflicts * hexagon: minor cleanup for buffer support checks * hexagon: factor out op_desc and the overal op logging * hexagon: further simplify and cleanup op dispatch logic * snapdragon: update adb scripts to use llama-cli and llama-completion * fix pipeline failure --------- Co-authored-by: Max Krasnyansky <maxk@qti.qualcomm.com>

macOS/iOS: - macOS Apple Silicon (arm64) - macOS Intel (x64) - iOS XCFramework

Linux: - Ubuntu x64 (CPU) - Ubuntu x64 (Vulkan) - Ubuntu s390x (CPU)

Windows: - Windows x64 (CPU) - Windows arm64 (CPU) - Windows x64 (CUDA 12) - CUDA 12.4 DLLs - Windows x64 (CUDA 13) - CUDA 13.1 DLLs - Windows x64 (Vulkan) - Windows x64 (SYCL) - Windows x64 (HIP)

openEuler: - openEuler x86 (310p) - openEuler x86 (910b) - openEuler aarch64 (310p) - openEuler aarch64 (910b)

Source: README.md, updated 2025-12-23