Download Latest Version ollama-linux-arm64-jetpack6.tgz (374.9 MB)
Email in envelope

Get an email when there's a new version of Ollama

Home / v0.13.2
Name Modified Size InfoDownloads / Week
Parent folder
Ollama.dmg 2025-12-08 59.6 MB
OllamaSetup.exe 2025-12-08 1.2 GB
ollama-linux-arm64.tgz 2025-12-08 2.1 GB
ollama-linux-arm64-jetpack6.tgz 2025-12-08 369.6 MB
ollama-linux-arm64-jetpack5.tgz 2025-12-08 468.2 MB
ollama-linux-amd64.tgz 2025-12-08 2.0 GB
ollama-linux-amd64-rocm.tgz 2025-12-08 1.3 GB
ollama-darwin.tgz 2025-12-08 27.4 MB
ollama-windows-arm64.zip 2025-12-08 22.3 MB
ollama-windows-amd64.zip 2025-12-08 2.0 GB
ollama-windows-amd64-rocm.zip 2025-12-08 376.4 MB
Ollama-darwin.zip 2025-12-08 59.6 MB
sha256sum.txt 2025-12-08 1.1 kB
README.md 2025-12-08 861 Bytes
v0.13.2 source code.tar.gz 2025-12-08 20.9 MB
v0.13.2 source code.zip 2025-12-08 21.6 MB
Totals: 16 Items   9.9 GB 67

New models

  • Qwen3-Next: The first installment in the Qwen3-Next series with strong performance in terms of both parameter efficiency and inference speed.

What's Changed

  • Flash attention is now enabled by default for vision models such as mistral-3, gemma3, qwen3-vl and more. This improves memory utilization and performance when providing images as input.
  • Fixed GPU detection on multi-GPU CUDA machines
  • Fixed issue where deepseek-v3.1 would always think even with thinking is disabled in Ollama's app

New Contributors

Full Changelog: https://github.com/ollama/ollama/compare/v0.13.1...v0.13.2

Source: README.md, updated 2025-12-08