Ollama - Browse /v0.12.7 at SourceForge.net

The interactive file manager requires Javascript. Please enable it or use sftp or scp.
You may still browse the files here.

Name	Modified	Size	InfoDownloads / Week
Parent folder
Ollama.dmg	2025-10-29	48.4 MB	0
OllamaSetup.exe	2025-10-29	1.2 GB	5
ollama-linux-arm64.tgz	2025-10-29	2.0 GB	1
ollama-darwin.tgz	2025-10-29	25.6 MB	0
Ollama-darwin.zip	2025-10-29	48.3 MB	3
ollama-linux-amd64-rocm.tgz	2025-10-29	1.3 GB	1
ollama-linux-amd64.tgz	2025-10-29	1.9 GB	0
ollama-linux-arm64-jetpack5.tgz	2025-10-29	461.6 MB	0
ollama-linux-arm64-jetpack6.tgz	2025-10-29	365.4 MB	0
ollama-windows-amd64-rocm.zip	2025-10-29	356.9 MB	1
ollama-windows-amd64.zip	2025-10-29	1.9 GB	8
ollama-windows-arm64.zip	2025-10-29	22.2 MB	1
sha256sum.txt	2025-10-29	1.1 kB	0
README.md	2025-10-29	2.3 kB	0
v0.12.7 source code.tar.gz	2025-10-29	20.3 MB	0
v0.12.7 source code.zip	2025-10-29	20.8 MB	0
Totals: 16 Items		9.6 GB	20

Ollama screenshot 2025-10-29 at 13 56 55@2x

New models

Qwen3-VL: Qwen3-VL is now available in all parameter sizes ranging from 2B to 235B
MiniMax-M2: a 230 Billion parameter model built for coding & agentic workflows available on Ollama's cloud

Ollama's new app now includes a way to add one or many files when prompting the model:

Screenshot 2025-10-29 at 2 16 55 PM

For better responses, thinking levels can now be adjusted for the gpt-oss models:

Screenshot 2025-10-29 at 2 12 33 PM

New API documentation is available for Ollama's API: https://docs.ollama.com/api

Screenshot 2025-10-29 at 4 02 53 PM

Model load failures now include more information on Windows
Fixed embedding results being incorrect when running embeddinggemma
Fixed gemma3n on Vulkan backend
Increased time allocated for ROCm to discover devices
Fixed truncation error when generating embeddings
Fixed request status code when running cloud models
The OpenAI-compatible /v1/embeddings endpoint now supports encoding_format parameter
Ollama will now parse tool calls that don't conform to {"name": name, "arguments": args} (thanks @rick-github!)
Fixed prompt processing reporting in the llama runner
Increase speed when scheduling models
Fixed issue where FROM <model> would not inherit RENDERER or PARSER commands

Full Changelog: https://github.com/ollama/ollama/compare/v0.12.6...v0.12.7-rc0

Source: README.md, updated 2025-10-29