Download Latest Version v0.5.10 source code.tar.gz (1.1 MB)
Email in envelope

Get an email when there's a new version of whichllm

Home / v0.5.6
Name Modified Size InfoDownloads / Week
Parent folder
README.md 2026-05-17 550 Bytes
v0.5.6 source code.tar.gz 2026-05-17 1.1 MB
v0.5.6 source code.zip 2026-05-17 1.1 MB
Totals: 3 Items   2.3 MB 0

What's Changed

  • Add speed estimate confidence metadata and estimated tok/s ranges.
  • Improve MoE speed estimates using active parameters and bandwidth-scaled read floors.
  • Add Windows AMD/Intel GPU detection fallback through Win32_VideoController and registry memory reads.
  • Treat Ryzen AI / Radeon 890M-class Windows iGPUs as shared-memory AMD GPUs.
  • Avoid summing dedicated GPU VRAM with shared-memory iGPU system RAM as one full-GPU target.

Validation

  • ruff format --check .
  • ruff check .
  • pytest -q -s
  • python -m build
Source: README.md, updated 2026-05-17