llama.cpp - Browse /b7519 at SourceForge.net

The interactive file manager requires Javascript. Please enable it or use sftp or scp.
You may still browse the files here.

Name	Modified	Size	InfoDownloads / Week
Parent folder
llama-b7519-xcframework.zip	< 18 hours ago	150.4 MB	0
llama-b7519-bin-win-vulkan-x64.zip	< 18 hours ago	35.1 MB	0
llama-b7519-bin-win-sycl-x64.zip	< 18 hours ago	109.2 MB	0
llama-b7519-bin-win-opencl-adreno-arm64.zip	< 18 hours ago	16.9 MB	0
llama-b7519-bin-win-hip-radeon-x64.zip	< 18 hours ago	347.8 MB	0
llama-b7519-bin-win-cuda-13.1-x64.zip	< 18 hours ago	92.7 MB	0
llama-b7519-bin-win-cuda-12.4-x64.zip	< 18 hours ago	204.0 MB	0
llama-b7519-bin-win-cpu-x64.zip	< 18 hours ago	20.0 MB	0
llama-b7519-bin-win-cpu-arm64.zip	< 18 hours ago	16.3 MB	0
llama-b7519-bin-ubuntu-x64.tar.gz	< 18 hours ago	19.2 MB	0
llama-b7519-bin-ubuntu-vulkan-x64.tar.gz	< 18 hours ago	34.7 MB	0
llama-b7519-bin-ubuntu-s390x.tar.gz	< 18 hours ago	22.3 MB	0
llama-b7519-bin-macos-x64.tar.gz	< 18 hours ago	43.0 MB	0
llama-b7519-bin-macos-arm64.tar.gz	< 18 hours ago	16.7 MB	0
llama-b7519-bin-910b-openEuler-x86.tar.gz	< 18 hours ago	48.1 MB	0
llama-b7519-bin-910b-openEuler-aarch64.tar.gz	< 18 hours ago	43.9 MB	0
llama-b7519-bin-310p-openEuler-x86.tar.gz	< 18 hours ago	48.1 MB	0
llama-b7519-bin-310p-openEuler-aarch64.tar.gz	< 18 hours ago	43.9 MB	0
cudart-llama-bin-win-cuda-13.1-x64.zip	< 18 hours ago	402.6 MB	0
cudart-llama-bin-win-cuda-12.4-x64.zip	< 18 hours ago	391.4 MB	0
b7519 source code.tar.gz	< 19 hours ago	28.6 MB	0
b7519 source code.zip	< 19 hours ago	29.5 MB	0
README.md	< 19 hours ago	4.8 kB	0
Totals: 23 Items		2.2 GB	0

ggml-hexagon: create generalized functions for cpu side op (#17500) * refactor: replace ggml_hexagon_mul_mat with template-based binary operation for improved flexibility * refactor: replace ggml_hexagon_mul_mat_id with template-based binary operation for improved flexibility * refactor: initialize buffer types and streamline dspqueue_buffers_init calls for clarity * add comment * refactor: remove redundant buffer checks in hexagon supported operations * wip * add missing include to fix weak symbol warning * add ggml_hexagon_op_generic * refactor: simplify tensor operation initialization and buffer management in hexagon implementation * refactor: streamline hexagon operation initialization and buffer management * refactor: update function signatures and streamline request handling in hexagon operations * wip * ggml-hexagon: clean up code formatting and improve unary operation handling * wip * rename * fix: add support for permuted F16 tensors and enhance quantization checks in matrix operations * refactor: replace ggml_hexagon_mul_mat with template-based binary operation for improved flexibility refactor: replace ggml_hexagon_mul_mat_id with template-based binary operation for improved flexibility refactor: initialize buffer types and streamline dspqueue_buffers_init calls for clarity refactor: remove redundant buffer checks in hexagon supported operations add missing include to fix weak symbol warning add ggml_hexagon_op_generic refactor: simplify tensor operation initialization and buffer management in hexagon implementation refactor: streamline hexagon operation initialization and buffer management refactor: update function signatures and streamline request handling in hexagon operations ggml-hexagon: clean up code formatting and improve unary operation handling fix: add support for permuted F16 tensors and enhance quantization checks in matrix operations # Conflicts: # ggml/src/ggml-hexagon/ggml-hexagon.cpp * hexagon: fix merge conflicts * hexagon: minor cleanup for buffer support checks * hexagon: factor out op_desc and the overal op logging * hexagon: further simplify and cleanup op dispatch logic * snapdragon: update adb scripts to use llama-cli and llama-completion * fix pipeline failure --------- Co-authored-by: Max Krasnyansky <maxk@qti.qualcomm.com>

macOS/iOS: - macOS Apple Silicon (arm64) - macOS Intel (x64) - iOS XCFramework

Linux: - Ubuntu x64 (CPU) - Ubuntu x64 (Vulkan) - Ubuntu s390x (CPU)

Windows: - Windows x64 (CPU) - Windows arm64 (CPU) - Windows x64 (CUDA 12) - CUDA 12.4 DLLs - Windows x64 (CUDA 13) - CUDA 13.1 DLLs - Windows x64 (Vulkan) - Windows x64 (SYCL) - Windows x64 (HIP)

openEuler: - openEuler x86 (310p) - openEuler x86 (910b) - openEuler aarch64 (310p) - openEuler aarch64 (910b)

Source: README.md, updated 2025-12-23

llama.cpp Files

Port of Facebook's LLaMA model in C/C++

llama.cpp Files

Port of Facebook's LLaMA model in C/C++

Get an email when there's a new version of llama.cpp