wllama Files

WebAssembly binding for llama.cpp - Enabling on-browser LLM inference

This is an exact mirror of the wllama project, hosted at https://github.com/ngxson/wllama. SourceForge is not affiliated with wllama.

The interactive file manager requires Javascript. Please enable it or use sftp or scp.
You may still browse the files here.

Name	Modified	Size	InfoDownloads / Week
Parent folder
3.0.0 source code.tar.gz	2026-05-08	3.9 MB	0
3.0.0 source code.zip	2026-05-08	3.9 MB	0
README.md	2026-05-08	820 Bytes	0
Totals: 3 Items		7.8 MB	0

Wllama version 3.0 is out - with multimodal and tool calling support 🚀🚀

V3.0 is a major architectural overhaul that replaces the custom wllama core with server-context, the inference component from llama-server. Key highlights:

🔥 Full OAI-compatible API: createChatCompletion, createCompletion, createEmbedding
🖼️ Multimodal support (vision/audio inputs)
🔨 Native tool calling support
🥷 Jinja-based chat template parsing (same as llama-server)

View full release note here: https://github.com/ngxson/wllama/blob/master/guides/intro-v3.md

What's Changed

Reuse llama-server source code (v3.0.0 - huge breaking changes ahead!) by @ngxson in https://github.com/ngxson/wllama/pull/213

Full Changelog: https://github.com/ngxson/wllama/compare/2.4.0...3.0.0

Source: README.md, updated 2026-05-08

Other Useful Business Software

Try Google Cloud Risk-Free With $300 in Credit Icon

Try Google Cloud Risk-Free With $300 in Credit

No hidden charges. No surprise bills. Cancel anytime.

Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.

Start Free

Our Free Plans just got better! | Auth0 Icon

Our Free Plans just got better! | Auth0

With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now

Try Google Cloud Risk-Free With $300 in Credit

No hidden charges. No surprise bills. Cancel anytime.

Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.

Start Free

Recommended Projects

LLamaSharp
C#/.NET binding of llama.cpp, including LLaMa/GPT model inference
node-llama-cpp
Run AI models locally on your machine with node.js bindings for llama
Clippy
Clippy, now with some AI
qvac-fabric-llm.cpp
QVAC Fabric: cross-platform LLM inference and fine-tuning
llama.cpp Python Bindings
Python bindings for llama.cpp