Ollama Files

Get up and running with Llama 2 and other large language models

This is an exact mirror of the Ollama project, hosted at https://github.com/jmorganca/ollama. SourceForge is not affiliated with Ollama. For more information, see the SourceForge Open Source Mirror Directory.

The interactive file manager requires Javascript. Please enable it or use sftp or scp.
You may still browse the files here.

Name	Modified	Size	InfoDownloads / Week
Parent folder
Ollama.dmg	2025-12-08	59.6 MB	5
OllamaSetup.exe	2025-12-08	1.2 GB	27
ollama-linux-arm64.tgz	2025-12-08	2.1 GB	3
ollama-linux-arm64-jetpack6.tgz	2025-12-08	369.6 MB	2
ollama-linux-arm64-jetpack5.tgz	2025-12-08	468.2 MB	0
ollama-linux-amd64.tgz	2025-12-08	2.0 GB	0
ollama-linux-amd64-rocm.tgz	2025-12-08	1.3 GB	0
ollama-darwin.tgz	2025-12-08	27.4 MB	0
ollama-windows-arm64.zip	2025-12-08	22.3 MB	0
ollama-windows-amd64.zip	2025-12-08	2.0 GB	2
ollama-windows-amd64-rocm.zip	2025-12-08	376.4 MB	27
Ollama-darwin.zip	2025-12-08	59.6 MB	0
sha256sum.txt	2025-12-08	1.1 kB	0
README.md	2025-12-08	861 Bytes	1
v0.13.2 source code.tar.gz	2025-12-08	20.9 MB	0
v0.13.2 source code.zip	2025-12-08	21.6 MB	0
Totals: 16 Items		9.9 GB	67

New models

Qwen3-Next: The first installment in the Qwen3-Next series with strong performance in terms of both parameter efficiency and inference speed.

What's Changed

Flash attention is now enabled by default for vision models such as mistral-3, gemma3, qwen3-vl and more. This improves memory utilization and performance when providing images as input.
Fixed GPU detection on multi-GPU CUDA machines
Fixed issue where deepseek-v3.1 would always think even with thinking is disabled in Ollama's app

New Contributors

@chengcheng84 made their first contribution in https://github.com/ollama/ollama/pull/13265
@nathan-hook made their first contribution in https://github.com/ollama/ollama/pull/13256

Full Changelog: https://github.com/ollama/ollama/compare/v0.13.1...v0.13.2

Source: README.md, updated 2025-12-08

Other Useful Business Software

Orchestrate Your AI Agents with Zenflow Icon

Orchestrate Your AI Agents with Zenflow

The multi-agent workflow engine for modern teams. Zenflow executes coding, testing, and verification with deep repo awareness

Zenflow orchestrates AI agents like a real engineering system. With parallel execution, spec-driven workflows, and deep multi-repo understanding, agents plan, implement, test, and verify end-to-end. Upgrade to AI workflows that work the way your team does.

Try free now

Our Free Plans just got better! | Auth0 Icon

Our Free Plans just got better! | Auth0

With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now

Recommended Projects

Chinese-LLaMA-Alpaca 2
Chinese LLaMA-2 & Alpaca-2 Large Model Phase II Project
fullmoon
Chat with private and local large language models
llama.cpp
Port of Facebook's LLaMA model in C/C++
LLM CLI
Access large language models from the command-line
GPT4All
Run Local LLMs on Any Device. Open-source