Download Latest Version v0.5.10 source code.tar.gz (1.1 MB)
Email in envelope

Get an email when there's a new version of whichllm

Home / v0.5.7
Name Modified Size InfoDownloads / Week
Parent folder
README.md 2026-05-19 567 Bytes
v0.5.7 source code.tar.gz 2026-05-19 1.1 MB
v0.5.7 source code.zip 2026-05-19 1.2 MB
Totals: 3 Items   2.3 MB 0

What's Changed

  • Detect DGX Spark / NVIDIA GB10 as a shared-memory NVIDIA GPU when NVIDIA reports memory.total as unavailable.
  • Fix whichllm run crashes for large Transformers models by providing an offload_folder.
  • Respect XDG_CACHE_HOME for cache paths, while ignoring relative values per the XDG spec.
  • Treat Apple Silicon as shared memory in fit detection.
  • Inline LiveBench fallback data and speed up benchmark score fetching.

Validation

  • ruff format --check .
  • ruff check .
  • pytest -q -s
  • python -m build
  • twine check dist/*
Source: README.md, updated 2026-05-19