| Name | Modified | Size | Downloads / Week |
|---|---|---|---|
| Parent folder | |||
| README.md | 2026-05-19 | 567 Bytes | |
| v0.5.7 source code.tar.gz | 2026-05-19 | 1.1 MB | |
| v0.5.7 source code.zip | 2026-05-19 | 1.2 MB | |
| Totals: 3 Items | 2.3 MB | 0 | |
What's Changed
- Detect DGX Spark / NVIDIA GB10 as a shared-memory NVIDIA GPU when NVIDIA reports
memory.totalas unavailable. - Fix
whichllm runcrashes for large Transformers models by providing anoffload_folder. - Respect
XDG_CACHE_HOMEfor cache paths, while ignoring relative values per the XDG spec. - Treat Apple Silicon as shared memory in fit detection.
- Inline LiveBench fallback data and speed up benchmark score fetching.
Validation
ruff format --check .ruff check .pytest -q -spython -m buildtwine check dist/*