whichllm Files

Find the local LLM that actually runs and performs best

This is an exact mirror of the whichllm project, hosted at https://github.com/Andyyyy64/whichllm. SourceForge is not affiliated with whichllm.

The interactive file manager requires Javascript. Please enable it or use sftp or scp.
You may still browse the files here.

Name	Modified	Size	InfoDownloads / Week
Parent folder
README.md	2026-05-19	567 Bytes	0
v0.5.7 source code.tar.gz	2026-05-19	1.1 MB	0
v0.5.7 source code.zip	2026-05-19	1.2 MB	0
Totals: 3 Items		2.3 MB	0

What's Changed

Detect DGX Spark / NVIDIA GB10 as a shared-memory NVIDIA GPU when NVIDIA reports memory.total as unavailable.
Fix whichllm run crashes for large Transformers models by providing an offload_folder.
Respect XDG_CACHE_HOME for cache paths, while ignoring relative values per the XDG spec.
Treat Apple Silicon as shared memory in fit detection.
Inline LiveBench fallback data and speed up benchmark score fetching.

Validation

ruff format --check .
ruff check .
pytest -q -s
python -m build
twine check dist/*

Source: README.md, updated 2026-05-19

Other Useful Business Software

Ship Agents Faster

Transform your applications and workflows into powerful agentic systems at global scale.

Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.

Get Started Free

Enterprise-grade ITSM, for every business

Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.

Try it Free

Ship Agents Faster

Transform your applications and workflows into powerful agentic systems at global scale.

Get Started Free

Recommended Projects

llmfit
157 models, 30 providers, one command to find what runs on hardware
Gemma Chat
Local AI chat + coding agent for Apple Silicon, powered by Gemma 4
ds4.c
DeepSeek 4 Flash local inference engine for Metal
clinfo
clinfo - openCL INFOrmation Simple linux script that provides information regarding cpu-gpu components by querying devices and comparing them against known data. CPUs : Intel(>Netburst), GPUs : Amd (>3xxx), Nvidia(>8)
OpenMonoAgent
Terminal-native coding agent powered by local LLMs

whichllm Files

Find the local LLM that actually runs and performs best

Get an email when there's a new version of whichllm

What's Changed

Validation