The interactive file manager requires Javascript. Please enable it or use sftp or scp.
You may still browse the files here.

Name	Modified	Size	InfoDownloads / Week
Parent folder
node-llama-cpp-electron-example.Linux.3.18.0.x64.tar.gz	2026-03-15	301.6 MB	0
node-llama-cpp-electron-example.Linux.3.18.0.arm64.tar.gz	2026-03-15	149.4 MB	0
node-llama-cpp-electron-example.Linux.3.18.0.arm64.deb	2026-03-15	122.0 MB	0
node-llama-cpp-electron-example.Linux.3.18.0.amd64.deb	2026-03-15	256.0 MB	0
node-llama-cpp-electron-example.Linux.3.18.0.amd64.snap	2026-03-15	268.5 MB	0
node-llama-cpp-electron-example.Linux.3.18.0.x86_64.AppImage	2026-03-15	300.4 MB	0
node-llama-cpp-electron-example.Linux.3.18.0.arm64.AppImage	2026-03-15	157.3 MB	0
node-llama-cpp-electron-example.macOS.3.18.0.x64.zip	2026-03-15	160.0 MB	0
node-llama-cpp-electron-example.macOS.3.18.0.arm64.zip	2026-03-15	147.7 MB	0
node-llama-cpp-electron-example.macOS.3.18.0.x64.dmg	2026-03-15	165.7 MB	0
node-llama-cpp-electron-example.macOS.3.18.0.arm64.dmg	2026-03-15	153.1 MB	0
node-llama-cpp-electron-example.Windows.3.18.0.x64.exe	2026-03-15	370.4 MB	0
node-llama-cpp-electron-example.Windows.3.18.0.arm64.exe	2026-03-15	134.5 MB	0
node-llama-cpp-electron-example.Windows.3.18.0.exe	2026-03-15	504.2 MB	0
README.md	2026-03-15	2.8 kB	0
v3.18.0 source code.tar.gz	2026-03-15	21.9 MB	0
v3.18.0 source code.zip	2026-03-15	22.3 MB	0
Totals: 17 Items		3.2 GB	0

3.18.0 (2026-03-15)

automatic checkpoints for models that need it (#573) (c641959)
QwenChatWrapper: Qwen 3.5 support (#573) (c641959)
inspect gpu command: detect and report missing prebuilt binary modules and custom npm registry (#573) (c641959)

resolveModelFile: deduplicate concurrent downloads (#570) (cc105b9)
correct Vulkan URL casing in documentation links (#568) (5a44506)
Qwen 3.5 memory estimation (#573) (c641959)
grammar use with HarmonyChatWrapper (#573) (c641959)
add mistral think segment detection (#573) (c641959)
compress excessively long segments from the current response on context shift instead of throwing an error (#573) (c641959)
default thinking budget to 75% of the context size to prevent low-quality responses (#573) (c641959)

Shipped with llama.cpp release b8352

To use the latest llama.cpp release available, run npx -n node-llama-cpp source download --release latest. (learn more)

Source: README.md, updated 2026-03-15

node-llama-cpp Files