llama.cpp - Browse /b9717 at SourceForge.net

The interactive file manager requires Javascript. Please enable it or use sftp or scp.
You may still browse the files here.

Name	Modified	Size	InfoDownloads / Week
Parent folder
llama-b9717-xcframework.zip	< 9 hours ago	206.3 MB	0
llama-b9717-ui.tar.gz	< 9 hours ago	2.7 MB	0
llama-b9717-bin-win-vulkan-x64.zip	< 9 hours ago	38.9 MB	0
llama-b9717-bin-win-sycl-x64.zip	< 9 hours ago	114.3 MB	0
llama-b9717-bin-win-openvino-2026.2-x64.zip	< 9 hours ago	10.5 MB	0
llama-b9717-bin-win-opencl-adreno-arm64.zip	< 9 hours ago	11.7 MB	0
llama-b9717-bin-win-hip-radeon-x64.zip	< 9 hours ago	321.4 MB	0
llama-b9717-bin-win-cuda-13.3-x64.zip	< 9 hours ago	159.5 MB	0
llama-b9717-bin-win-cuda-12.4-x64.zip	< 9 hours ago	261.6 MB	0
llama-b9717-bin-win-cpu-x64.zip	< 9 hours ago	17.3 MB	0
llama-b9717-bin-win-cpu-arm64.zip	< 9 hours ago	11.2 MB	0
llama-b9717-bin-ubuntu-x64.tar.gz	< 9 hours ago	15.6 MB	0
llama-b9717-bin-ubuntu-vulkan-x64.tar.gz	< 9 hours ago	38.5 MB	0
llama-b9717-bin-ubuntu-vulkan-arm64.tar.gz	< 9 hours ago	31.9 MB	0
llama-b9717-bin-ubuntu-sycl-fp32-x64.tar.gz	< 9 hours ago	47.5 MB	0
llama-b9717-bin-ubuntu-sycl-fp16-x64.tar.gz	< 9 hours ago	47.7 MB	0
llama-b9717-bin-ubuntu-s390x.tar.gz	< 9 hours ago	14.7 MB	0
llama-b9717-bin-ubuntu-rocm-7.2-x64.tar.gz	< 9 hours ago	131.3 MB	0
llama-b9717-bin-ubuntu-openvino-2026.2-x64.tar.gz	< 9 hours ago	14.1 MB	0
llama-b9717-bin-ubuntu-arm64.tar.gz	< 9 hours ago	12.6 MB	0
llama-b9717-bin-macos-x64.tar.gz	< 9 hours ago	11.2 MB	0
llama-b9717-bin-macos-arm64.tar.gz	< 9 hours ago	10.9 MB	0
llama-b9717-bin-android-arm64.tar.gz	< 9 hours ago	76.7 MB	0
cudart-llama-bin-win-cuda-13.3-x64.zip	< 9 hours ago	391.0 MB	0
cudart-llama-bin-win-cuda-12.4-x64.zip	< 9 hours ago	391.4 MB	0
b9717 source code.tar.gz	< 9 hours ago	35.0 MB	0
b9717 source code.zip	< 9 hours ago	36.5 MB	0
README.md	< 9 hours ago	3.9 kB	0
Totals: 28 Items		2.5 GB	0

ggml-cpu: support K tails in power10 Q8/Q4 MMA matmul (#24753) * ggml-cpu: support K tails in Power10 MMA Q8/Q4 matmul This patch removes the requirement that K be divisible by kc in the tinyBlas_Q0_PPC tiled matmul path. Process the final K panel using its actual depth and pass the reduced panel size through packing and kernel execution. This allows more workloads to use the MMA kernel and reduces fallback to mnpack. * Apply suggestion from @taronaeo Co-authored-by: Aaron Teo <taronaeo@gmail.com> --------- Co-authored-by: Aaron Teo <taronaeo@gmail.com>

macOS/iOS: