whichllm Files

Find the local LLM that actually runs and performs best

This is an exact mirror of the whichllm project, hosted at https://github.com/Andyyyy64/whichllm. SourceForge is not affiliated with whichllm.

The interactive file manager requires Javascript. Please enable it or use sftp or scp.
You may still browse the files here.

Name	Modified	Size	InfoDownloads / Week
Parent folder
README.md	2026-05-17	550 Bytes	0
v0.5.6 source code.tar.gz	2026-05-17	1.1 MB	0
v0.5.6 source code.zip	2026-05-17	1.1 MB	0
Totals: 3 Items		2.3 MB	0

What's Changed

Add speed estimate confidence metadata and estimated tok/s ranges.
Improve MoE speed estimates using active parameters and bandwidth-scaled read floors.
Add Windows AMD/Intel GPU detection fallback through Win32_VideoController and registry memory reads.
Treat Ryzen AI / Radeon 890M-class Windows iGPUs as shared-memory AMD GPUs.
Avoid summing dedicated GPU VRAM with shared-memory iGPU system RAM as one full-GPU target.

Validation

ruff format --check .
ruff check .
pytest -q -s
python -m build

Source: README.md, updated 2026-05-17

Other Useful Business Software

Our Free Plans just got better! | Auth0

With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now

$300 Free Credits for Your Google Cloud Projects

Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.

Start Free Trial

Our Free Plans just got better! | Auth0

With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

Try free now

Recommended Projects

llmfit
157 models, 30 providers, one command to find what runs on hardware
Gemma Chat
Local AI chat + coding agent for Apple Silicon, powered by Gemma 4
ds4.c
DeepSeek 4 Flash local inference engine for Metal
clinfo
clinfo - openCL INFOrmation Simple linux script that provides information regarding cpu-gpu components by querying devices and comparing them against known data. CPUs : Intel(>Netburst), GPUs : Amd (>3xxx), Nvidia(>8)
OpenMonoAgent
Terminal-native coding agent powered by local LLMs

whichllm Files

Find the local LLM that actually runs and performs best

Get an email when there's a new version of whichllm

What's Changed

Validation