LMCache - Browse /v0.3.7 at SourceForge.net

The interactive file manager requires Javascript. Please enable it or use sftp or scp.
You may still browse the files here.

Name	Modified	Size	InfoDownloads / Week
Parent folder
lmcache-0.3.7-cp310-cp310-manylinux_2_24_x86_64.manylinux_2_28_x86_64.whl	2025-09-29	3.8 MB	0
lmcache-0.3.7-cp311-cp311-manylinux_2_24_x86_64.manylinux_2_28_x86_64.whl	2025-09-29	3.8 MB	0
lmcache-0.3.7-cp312-cp312-manylinux_2_24_x86_64.manylinux_2_28_x86_64.whl	2025-09-29	3.8 MB	0
lmcache-0.3.7-cp313-cp313-manylinux_2_24_x86_64.manylinux_2_28_x86_64.whl	2025-09-29	3.8 MB	0
lmcache-0.3.7.tar.gz	2025-09-29	1.1 MB	0
README.md	2025-09-29	7.7 kB	0
v0.3.7 source code.tar.gz	2025-09-29	1.0 MB	0
v0.3.7 source code.zip	2025-09-29	1.3 MB	2
Totals: 8 Items		18.8 MB	2

Screenshot 2025-09-29 at 10 44 24 AM

Big Contributions coming soon: - Hetero TP by @novahow - More CacheBlend support by @junl666

What's Changed

[Core] Refactor the storage manager to support using CPU & GPU backends at the same time by @ApostaC in https://github.com/LMCache/LMCache/pull/1578
Fix a hardcoded device assignment when init FlashInferSparseBackend by @xleoken in https://github.com/LMCache/LMCache/pull/1571
Add sphinx-multiversion for tracing docs versions by @kobe0938 in https://github.com/LMCache/LMCache/pull/1599
[Doc] Fold the navigation bar for KV Cache management by @kobe0938 in https://github.com/LMCache/LMCache/pull/1598
Bump step-security/harden-runner from 2.12.2 to 2.13.1 by @dependabot[bot] in https://github.com/LMCache/LMCache/pull/1597
Bump pypa/gh-action-pypi-publish from 1.12.4 to 1.13.0 by @dependabot[bot] in https://github.com/LMCache/LMCache/pull/1541
Bump actions/setup-python from 5.6.0 to 6.0.0 by @dependabot[bot] in https://github.com/LMCache/LMCache/pull/1540
Bump actions/stale from 9.1.0 to 10.0.0 by @dependabot[bot] in https://github.com/LMCache/LMCache/pull/1539
Refactor lmcache_get_config to lmcache_get_or_create_config by @maobaolong in https://github.com/LMCache/LMCache/pull/1586
[Bugfix] Fix gds loading bytes comparison error by @kebe7jun in https://github.com/LMCache/LMCache/pull/1605
[feat] add continuous usage context by @royyhuang in https://github.com/LMCache/LMCache/pull/1612
Docs update for newcomers. by @KuntaiDu in https://github.com/LMCache/LMCache/pull/1595
[Refactor] Cleanup PD code and separate transfer code by @YaoJiayi in https://github.com/LMCache/LMCache/pull/1579
Fix root doc endpoint refer to latest release snapshot(0.3.6) instead of latest commit issue by @kobe0938 in https://github.com/LMCache/LMCache/pull/1625
Add .github/workflows/build_doc.yml to auto build by @kobe0938 in https://github.com/LMCache/LMCache/pull/1626
[Core] Implement nixl storage backend batched_get_non_blocking by @tshmilnvidia in https://github.com/LMCache/LMCache/pull/1559
Fix two bugs in async loading with MLA and TP > 1 by @yoo-kumaneko in https://github.com/LMCache/LMCache/pull/1619
[Misc] Hotfix: missing dependency: cpuinfo by @ApostaC in https://github.com/LMCache/LMCache/pull/1627
Update vLLM options for MLA test by @yoo-kumaneko in https://github.com/LMCache/LMCache/pull/1588
[Doc] Fix blending related docs by @YaoJiayi in https://github.com/LMCache/LMCache/pull/1624
support tmp path in FSConnector by @chunxiaozheng in https://github.com/LMCache/LMCache/pull/1603
[Test] Add a test for fs_connector by @maobaolong in https://github.com/LMCache/LMCache/pull/1594
[Chore]: Remove Python 3.9 as the minimum Python version by @hickeyma in https://github.com/LMCache/LMCache/pull/1618
[Core] Fix NixlStorageBackend import by @tshmilnvidia in https://github.com/LMCache/LMCache/pull/1631
[Bugfix] Fix formatting and testing issues in latest branch by @YaoJiayi in https://github.com/LMCache/LMCache/pull/1634
[feat]: working async s3 by @sammshen in https://github.com/LMCache/LMCache/pull/1614
[Docs] Fix the issue that left sidebar is unable to unfold by @KuntaiDu in https://github.com/LMCache/LMCache/pull/1637
Improve installation section to make it less confusing by @kobe0938 in https://github.com/LMCache/LMCache/pull/1641
[CD]: Lightweight Dockerfile by @sammshen in https://github.com/LMCache/LMCache/pull/1515
Set Qwen/Qwen3-8B as default model; Remove all hf tokens; Remove config file in minimal example by @kobe0938 in https://github.com/LMCache/LMCache/pull/1643
Replace default python package manager with uv to ensure fast & smooth experience + add vllm in minimal example to streamline user experience by @kobe0938 in https://github.com/LMCache/LMCache/pull/1642
[Bugfix] Fix unit from s to ms in disagg_proxy_server.py by @aztecher in https://github.com/LMCache/LMCache/pull/1652
[Improve] Add timeout for lookup client by @maobaolong in https://github.com/LMCache/LMCache/pull/1615
[Doc] Fix broken link to mooncake store by @aztecher in https://github.com/LMCache/LMCache/pull/1651
Add MRU policy to caching_policies doc by @xleoken in https://github.com/LMCache/LMCache/pull/1607
Refactoring the cache policy factory by @xleoken in https://github.com/LMCache/LMCache/pull/1602
[Core] P2P Backend by @YaoJiayi in https://github.com/LMCache/LMCache/pull/1610
[ROCM] [BUGFIX] : add missing hip files by @TaeSooRhee-moreh in https://github.com/LMCache/LMCache/pull/1654
Fixed the socket resource leakage issue of LMCacheLookupClient by @xleoken in https://github.com/LMCache/LMCache/pull/1613
[Improve] Support pass through the lmcache config from vllm extra config by @maobaolong in https://github.com/LMCache/LMCache/pull/1568
[feat][internal_api] Add inference info internal api by @maobaolong in https://github.com/LMCache/LMCache/pull/1653
[Test]: Enable unit tests to run on non-CUDA environment by @hickeyma in https://github.com/LMCache/LMCache/pull/1575
[Backend][BugFix]: Refactor StorageBackendInterface constructor to enable dynamic loading by @hickeyma in https://github.com/LMCache/LMCache/pull/1636
Improve the log message clarity when retrieve tokens by @xleoken in https://github.com/LMCache/LMCache/pull/1623
[CI/CD] Fix GDS test failure by adding use_direct_io = True by @ApostaC in https://github.com/LMCache/LMCache/pull/1659
Remove redundant connection check in remote_backend by @xleoken in https://github.com/LMCache/LMCache/pull/1667
[bugfix] fix broadcast and to_gpu error by @chunxiaozheng in https://github.com/LMCache/LMCache/pull/1675
[Improve][LocalCpu]Remove stream from local_cpu_backend.py by @maobaolong in https://github.com/LMCache/LMCache/pull/1673
Get a default value if old vllm doesn't have priority by @yoo-kumaneko in https://github.com/LMCache/LMCache/pull/1678
[BugFix][Testing]: Enable unit tests for GDS backend by @hickeyma in https://github.com/LMCache/LMCache/pull/1648
Initialize stack_frames to None to prevent NameError. by @xleoken in https://github.com/LMCache/LMCache/pull/1649
Remove redundancy. Also old version of vllm do not support size local. by @yoo-kumaneko in https://github.com/LMCache/LMCache/pull/1669
[patch]: add eagle decode layer by @sammshen in https://github.com/LMCache/LMCache/pull/1668
[patch]: async stress on cpu by @sammshen in https://github.com/LMCache/LMCache/pull/1665
Add metrics remote_put_task_num by @maobaolong in https://github.com/LMCache/LMCache/pull/1620
[core] support set limit of cache hit by @chunxiaozheng in https://github.com/LMCache/LMCache/pull/1666
[bugfix] fix memory leak in fs connector by @chunxiaozheng in https://github.com/LMCache/LMCache/pull/1656
[CD]: add flash infer by @sammshen in https://github.com/LMCache/LMCache/pull/1699
[Docs] Layerwise Docs by @sammshen in https://github.com/LMCache/LMCache/pull/1697
[Benchmark] configure user stacking at high qps in multi-round-qa by @yuezhu1 in https://github.com/LMCache/LMCache/pull/1609

New Contributors

@royyhuang made their first contribution in https://github.com/LMCache/LMCache/pull/1612
@TaeSooRhee-moreh made their first contribution in https://github.com/LMCache/LMCache/pull/1654
@yuezhu1 made their first contribution in https://github.com/LMCache/LMCache/pull/1609

Full Changelog: https://github.com/LMCache/LMCache/compare/v0.3.6...v0.3.7

Source: README.md, updated 2025-09-29

LMCache Files

Supercharge Your LLM with the Fastest KV Cache Layer

What's Changed

New Contributors

LMCache Files

Supercharge Your LLM with the Fastest KV Cache Layer

Get an email when there's a new version of LMCache

What's Changed

New Contributors