| Name | Modified | Size | Downloads / Week |
|---|---|---|---|
| Parent folder | |||
| Ollama.dmg | 2025-12-08 | 59.6 MB | |
| OllamaSetup.exe | 2025-12-08 | 1.2 GB | |
| ollama-linux-arm64.tgz | 2025-12-08 | 2.1 GB | |
| ollama-linux-arm64-jetpack6.tgz | 2025-12-08 | 369.6 MB | |
| ollama-linux-arm64-jetpack5.tgz | 2025-12-08 | 468.2 MB | |
| ollama-linux-amd64.tgz | 2025-12-08 | 2.0 GB | |
| ollama-linux-amd64-rocm.tgz | 2025-12-08 | 1.3 GB | |
| ollama-darwin.tgz | 2025-12-08 | 27.4 MB | |
| ollama-windows-arm64.zip | 2025-12-08 | 22.3 MB | |
| ollama-windows-amd64.zip | 2025-12-08 | 2.0 GB | |
| ollama-windows-amd64-rocm.zip | 2025-12-08 | 376.4 MB | |
| Ollama-darwin.zip | 2025-12-08 | 59.6 MB | |
| sha256sum.txt | 2025-12-08 | 1.1 kB | |
| README.md | 2025-12-08 | 861 Bytes | |
| v0.13.2 source code.tar.gz | 2025-12-08 | 20.9 MB | |
| v0.13.2 source code.zip | 2025-12-08 | 21.6 MB | |
| Totals: 16 Items | 9.9 GB | 67 | |
New models
- Qwen3-Next: The first installment in the Qwen3-Next series with strong performance in terms of both parameter efficiency and inference speed.
What's Changed
- Flash attention is now enabled by default for vision models such as
mistral-3,gemma3,qwen3-vland more. This improves memory utilization and performance when providing images as input. - Fixed GPU detection on multi-GPU CUDA machines
- Fixed issue where
deepseek-v3.1would always think even with thinking is disabled in Ollama's app
New Contributors
- @chengcheng84 made their first contribution in https://github.com/ollama/ollama/pull/13265
- @nathan-hook made their first contribution in https://github.com/ollama/ollama/pull/13256
Full Changelog: https://github.com/ollama/ollama/compare/v0.13.1...v0.13.2