Download Latest Version v0.5.10 source code.tar.gz (1.1 MB)
Email in envelope

Get an email when there's a new version of whichllm

Home / v0.5.9
Name Modified Size InfoDownloads / Week
Parent folder
README.md 2026-06-10 1.1 kB
v0.5.9 source code.tar.gz 2026-06-10 1.1 MB
v0.5.9 source code.zip 2026-06-10 1.2 MB
Totals: 3 Items   2.3 MB 0

Highlights

  • GPU bandwidth detection now falls back to the bundled TechPowerUp database (2,824 GPUs) when a card is missing from the curated catalog. Uncatalogued cards no longer show BW: N/A with 0.0 tok/s estimates and oversized recommendations, and a laptop card can never inherit its desktop sibling's bandwidth. (#74, [#98])
  • Fixed AMD discrete GPU detection on Linux, including RX 6750 XT and the compound lspci name path. (#61)
  • Artificial Analysis Intelligence Index is fetched live again after the site's App Router migration. Live scores overlay the curated snapshot, so coverage can only grow. (#87)
  • Added MXFP4 and NVFP4 quantization support. These repos were previously labeled FP16, overestimating VRAM by about 3.5x. (#27)
  • Added Apple M5-family simulation entries and Kepler-era Quadro catalog coverage.
  • Community GGUF repos without base_model metadata now match official benchmark scores by name.

QA

  • CI lint: passed
  • CI tests: Python 3.11, 3.12, and 3.13 passed
  • Local: 329 tests passed; sdist and wheel built successfully
  • Real hardware smoke test on Apple M2
Source: README.md, updated 2026-06-10