whichllm Files

Find the local LLM that actually runs and performs best

This is an exact mirror of the whichllm project, hosted at https://github.com/Andyyyy64/whichllm. SourceForge is not affiliated with whichllm.

The interactive file manager requires Javascript. Please enable it or use sftp or scp.
You may still browse the files here.

Name	Modified	Size	InfoDownloads / Week
Parent folder
README.md	2026-06-10	1.1 kB	0
v0.5.9 source code.tar.gz	2026-06-10	1.1 MB	0
v0.5.9 source code.zip	2026-06-10	1.2 MB	0
Totals: 3 Items		2.3 MB	0

Highlights

GPU bandwidth detection now falls back to the bundled TechPowerUp database (2,824 GPUs) when a card is missing from the curated catalog. Uncatalogued cards no longer show BW: N/A with 0.0 tok/s estimates and oversized recommendations, and a laptop card can never inherit its desktop sibling's bandwidth. (#74, [#98])
Fixed AMD discrete GPU detection on Linux, including RX 6750 XT and the compound lspci name path. (#61)
Artificial Analysis Intelligence Index is fetched live again after the site's App Router migration. Live scores overlay the curated snapshot, so coverage can only grow. (#87)
Added MXFP4 and NVFP4 quantization support. These repos were previously labeled FP16, overestimating VRAM by about 3.5x. (#27)
Added Apple M5-family simulation entries and Kepler-era Quadro catalog coverage.
Community GGUF repos without base_model metadata now match official benchmark scores by name.

QA

CI lint: passed
CI tests: Python 3.11, 3.12, and 3.13 passed
Local: 329 tests passed; sdist and wheel built successfully
Real hardware smoke test on Apple M2

Source: README.md, updated 2026-06-10

Other Useful Business Software

$300 Free Credits for Your Google Cloud Projects Icon

$300 Free Credits for Your Google Cloud Projects

Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.

Start Free Trial

Our Free Plans just got better! | Auth0 Icon

Our Free Plans just got better! | Auth0

With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now

$300 Free Credits for Your Google Cloud Projects

Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.

Start Free Trial

Recommended Projects

llmfit
157 models, 30 providers, one command to find what runs on hardware
Gemma Chat
Local AI chat + coding agent for Apple Silicon, powered by Gemma 4
ds4.c
DeepSeek 4 Flash local inference engine for Metal
clinfo
clinfo - openCL INFOrmation Simple linux script that provides information regarding cpu-gpu components by querying devices and comparing them against known data. CPUs : Intel(>Netburst), GPUs : Amd (>3xxx), Nvidia(>8)
OpenMonoAgent
Terminal-native coding agent powered by local LLMs