LMCache - Browse /v0.4.4 at SourceForge.net

The interactive file manager requires Javascript. Please enable it or use sftp or scp.
You may still browse the files here.

Name	Modified	Size	InfoDownloads / Week
Parent folder
lmcache-0.4.4-cp310-cp310-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl	2026-04-23	11.7 MB	0
lmcache-0.4.4-cp311-cp311-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl	2026-04-23	11.7 MB	1
lmcache-0.4.4-cp312-cp312-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl	2026-04-23	11.8 MB	0
lmcache-0.4.4-cp313-cp313-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl	2026-04-23	11.8 MB	0
lmcache-0.4.4.tar.gz	2026-04-23	3.6 MB	0
lmcache_cli-0.4.5.dev0-py3-none-any.whl	2026-04-23	1.3 MB	0
README.md	2026-04-22	8.7 kB	0
v0.4.4 source code.tar.gz	2026-04-22	3.6 MB	1
v0.4.4 source code.zip	2026-04-22	4.2 MB	0
Totals: 9 Items		59.6 MB	2

What's Changed

Refactor remote plugin to accept multiply connector by @maobaolong in https://github.com/LMCache/LMCache/pull/2666
[MP]feat: support different kv cache shape and dtype across layers by @liuyumoye in https://github.com/LMCache/LMCache/pull/2926
[Chore][CI]: K3 base CI image 12.9 CUDA by @sammshen in https://github.com/LMCache/LMCache/pull/2975
fix: use pin=False in _allocate_and_put to prevent pd_buffer leak by @ningziwen in https://github.com/LMCache/LMCache/pull/2847
feat(disk): support multi-path local disk backend for multi-device I/O by @glimchb in https://github.com/LMCache/LMCache/pull/2801
[Chore][CI] Upgrade CI base image to CUDA 13.0 by @sammshen in https://github.com/LMCache/LMCache/pull/2981
[doc] document long-doc-permutator workload in cli bench by @deng451e in https://github.com/LMCache/LMCache/pull/2963
[MP][Bugfix] Fix deadlock caused by cuda launch host func by @ApostaC in https://github.com/LMCache/LMCache/pull/2952
[BugFix]: Fix typo bug by @princepride in https://github.com/LMCache/LMCache/pull/2980
[CI] Pin cu128 nightly wheel for blend ci test by @deng451e in https://github.com/LMCache/LMCache/pull/2987
[MP][optimize] optimize save when mla enabled by @chunxiaozheng in https://github.com/LMCache/LMCache/pull/2935
[hotfix] fix prometheus version for UT failure by @ApostaC in https://github.com/LMCache/LMCache/pull/3000
Update LMCache Office Hours to Wednesday by @nijaba in https://github.com/LMCache/LMCache/pull/2990
[fix] Limit proxy in-flight requests to prevent PD buffer deadlock by @deng451e in https://github.com/LMCache/LMCache/pull/2957
[MP] Lazy start heartbeat thread when first req coming by @maobaolong in https://github.com/LMCache/LMCache/pull/2943
[Operator] Add L2 RESP (Redis/Valkey) adapter support by @royyhuang in https://github.com/LMCache/LMCache/pull/2967
[Feat][RawBlock] Add TP>1 support and compact batched retrieval path by @DongDongJu in https://github.com/LMCache/LMCache/pull/2948
[MP] Introduce a simple way to register_gauge metrics. by @maobaolong in https://github.com/LMCache/LMCache/pull/2906
[Build] Add lmcache-cli lightweight wheel by @deng451e in https://github.com/LMCache/LMCache/pull/2959
Copy a snapshot of lmcache_mp_connector.py for vllm 0.18.0 by @maobaolong in https://github.com/LMCache/LMCache/pull/2887
[MP] Add a new argument to specify whether retain_in_l1 by @maobaolong in https://github.com/LMCache/LMCache/pull/2813
[Chore][CI] Skip k3 builds when only docs/trivial files changed by @sammshen in https://github.com/LMCache/LMCache/pull/2993
[ops][refactor] Add full list of Python fallbacks to run without compiled CUDA extensions by @hlin99 in https://github.com/LMCache/LMCache/pull/2591
[Feat] L0 Subscriber by @Oasis-Git in https://github.com/LMCache/LMCache/pull/2974
refactor: extract PathSharder module for shared multi-path selection by @glimchb in https://github.com/LMCache/LMCache/pull/2982
refactor(mp): replace job_id with request_id in query_prefetch_status by @yoo-kumaneko in https://github.com/LMCache/LMCache/pull/2996
[MP] Support lazy import built-in l2 adapter by @maobaolong in https://github.com/LMCache/LMCache/pull/2905
[MP][Optimize] Skip locked keys during LRU eviction to improve eviction efficiency by @chunxiaozheng in https://github.com/LMCache/LMCache/pull/2978
fix: add controller config validation and clear error messages (#2907) by @ianliuy in https://github.com/LMCache/LMCache/pull/3003
feat: add chunk hashes logger to MP server for offline data analysis by @yoo-kumaneko in https://github.com/LMCache/LMCache/pull/2928
[Chore][CI]: K3 MP output token quantity tolerance by @sammshen in https://github.com/LMCache/LMCache/pull/3030
feat(tools): add LRU cache simulator for lookup-hash JSONL logs by @yoo-kumaneko in https://github.com/LMCache/LMCache/pull/3021
[Feat] L1 Subscriber by @Oasis-Git in https://github.com/LMCache/LMCache/pull/2986
[Feat] Add cache_salt parameter to MP adapter interfaces by @royyhuang in https://github.com/LMCache/LMCache/pull/3029
[Feat] Add is_user_level property and cache_salt param to EvictionPolicy by @royyhuang in https://github.com/LMCache/LMCache/pull/3032
[Feat][DAX] Optimize staged batched restore path and document modification by @DongDongJu in https://github.com/LMCache/LMCache/pull/2904
[Chore] Remove v0 code by @sammshen in https://github.com/LMCache/LMCache/pull/2968
[Chore] add coding standard and PR review instructions by @ApostaC in https://github.com/LMCache/LMCache/pull/3039
[Observability] Per-request root OTel span and SpanRegistry for MP server tracing by @deng451e in https://github.com/LMCache/LMCache/pull/3033
feat(pd_backend): add pd_skip_proxy_notification to skip ZMQ proxy notification by @ningziwen in https://github.com/LMCache/LMCache/pull/2874
[Bugfix] fix some memory leak in cache_engine and eic connector by @liubj77 in https://github.com/LMCache/LMCache/pull/2544
[Hotfix][CI] Unblock CI: pandas auto-heal + CUDA 12 build toolchain by @sammshen in https://github.com/LMCache/LMCache/pull/3055
[Hotfix][CI] Pin vLLM nightly to cu130 index to match CUDA 13 base image by @ApostaC in https://github.com/LMCache/LMCache/pull/3061
[Docs] Mirror lmcache/ layout in docs/design/ for discoverability by @ApostaC in https://github.com/LMCache/LMCache/pull/3040
Add scheduler instance_id and model_name to L0 KV lifecycle tracking by @Oasis-Git in https://github.com/LMCache/LMCache/pull/3043
chore: expose package version via init.py by @hlin99 in https://github.com/LMCache/LMCache/pull/3034
Fix: Safely handle layerwise cache shape dimensions in remote backend by @hlin99 in https://github.com/LMCache/LMCache/pull/2751
[Core] Add persistence interfaces and nixl persistence by @YaoJiayi in https://github.com/LMCache/LMCache/pull/2938
[Misc] Reduce the logs generated by lazy memory allocator by @ApostaC in https://github.com/LMCache/LMCache/pull/3068
[MP][Feat] Add cache_salt to ObjectKey for cache isolation by @royyhuang in https://github.com/LMCache/LMCache/pull/3042
[ROCm] Make bare-host ROCm install self-sufficient by @Shaoting-Feng in https://github.com/LMCache/LMCache/pull/3070
[MP] Add tracing functionality for storage manager by @ApostaC in https://github.com/LMCache/LMCache/pull/3063
[MP][optimize] unified touch all keys in end session request by @chunxiaozheng in https://github.com/LMCache/LMCache/pull/3020
[step3] remove unnecessary code in mp adapter by @chunxiaozheng in https://github.com/LMCache/LMCache/pull/2994
fix(mp): correct store cached requests in lmcache_mp_connector by @maobaolong in https://github.com/LMCache/LMCache/pull/3012
[refactor]: Replace use_cufile with use_gds/gds_backend config flags by @glimchb in https://github.com/LMCache/LMCache/pull/2858
[CI] Add cu13.0 wheel + container builds and nightly wheel releases by @deng451e in https://github.com/LMCache/LMCache/pull/3069
[CI] Run the same test set on AMD as on NVIDIA by @Shaoting-Feng in https://github.com/LMCache/LMCache/pull/3071
[ROCm][MP] Fix HIP invalid-argument on lazy host buffer past 2 GB by @Shaoting-Feng in https://github.com/LMCache/LMCache/pull/3079
[CLI] Refactor query command by @deng451e in https://github.com/LMCache/LMCache/pull/2995
[CI] add missing egress endpoints to nightly Docker build by @deng451e in https://github.com/LMCache/LMCache/pull/3087
[Hotfix][CI] Fail-fast when vLLM CLI import chain is broken post-install by @sammshen in https://github.com/LMCache/LMCache/pull/3093
[CLI][fix] lazy torch import in init.py to unblock CLI-only installs by @deng451e in https://github.com/LMCache/LMCache/pull/3086
[CLI] Introduce lmcache trace CLI by @ApostaC in https://github.com/LMCache/LMCache/pull/3075
[Chore][Docs]: daily drift check — multi-process mode by @ApostaC in https://github.com/LMCache/LMCache/pull/3076
[Fix][CI] fix nightly wheel versioning and build reliability by @deng451e in https://github.com/LMCache/LMCache/pull/3097
[Hotfix][CI] Replace vllm main.py patch with sitecustomize.py by @sammshen in https://github.com/LMCache/LMCache/pull/3100
[CI] fix blend-server venv by @deng451e in https://github.com/LMCache/LMCache/pull/3099
[MP] Introduce MP runtime plugin framework by @maobaolong in https://github.com/LMCache/LMCache/pull/2956