PrimoCacheRomex Software
|
||||||
Related Products
|
||||||
About
LMCache is an open source Knowledge Delivery Network (KDN) designed as a caching layer for large language model serving that accelerates inference by reusing KV (key-value) caches across repeated or overlapping computations. It enables fast prompt caching, allowing LLMs to “prefill” recurring text only once and then reuse those stored KV caches, even in non-prefix positions, across multiple serving instances. This approach reduces time to first token, saves GPU cycles, and increases throughput in scenarios such as multi-round question answering or retrieval augmented generation. LMCache supports KV cache offloading (moving cache from GPU to CPU or disk), cache sharing across instances, and disaggregated prefill, which separates the prefill and decoding phases for resource efficiency. It is compatible with inference engines like vLLM and TGI and supports compressed storage, blending techniques to merge caches, and multiple backend storage options.
|
About
Effectively cache your frequently used applications, documents and other data into faster storage devices, accessing them at up to RAM-like or SSD-like speeds. Make your computer more responsive for creating, gaming and producing, with less boot and load times. Complete write requests very quickly by temporarily storing incoming data into RAM or SSD storage first and writing them back to target disks later. Enable your computer to handle heavy or stream write IOs, while reducing writes and wear on disks. Capable of interoperating with almost all faster storage devices, including system memory, invisible memory, solid-state drives and flash drives, to accelerate relatively slow storage. Setup caching and accelerate storage in just few simple clicks! Special features such as multiple caching strategies, different writing modes, individual read/write space and individual volume control, make caching flexible to various scenarios.
|
|||||
Platforms Supported
Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook
|
Platforms Supported
Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook
|
|||||
Audience
AI engineers and infrastructure teams looking for a tool to lower latency, reduce compute cost, and scale throughput
|
Audience
Individuals in need of a software caching solution to accelerate storage
|
|||||
Support
Phone Support
24/7 Live Support
Online
|
Support
Phone Support
24/7 Live Support
Online
|
|||||
API
Offers API
|
API
Offers API
|
|||||
Screenshots and Videos |
Screenshots and Videos |
|||||
Pricing
Free
Free Version
Free Trial
|
Pricing
$29.95 per computer
Free Version
Free Trial
|
|||||
Reviews/
|
Reviews/
|
|||||
Training
Documentation
Webinars
Live Online
In Person
|
Training
Documentation
Webinars
Live Online
In Person
|
|||||
Company InformationLMCache
United States
lmcache.ai/
|
Company InformationRomex Software
Founded: 2008
China
www.romexsoftware.com/en-us/primo-cache/
|
|||||
Alternatives |
Alternatives |
|||||
|
|
||||||
|
|
|
|||||
|
|
||||||
|
|
||||||
Categories |
Categories |
|||||
Integrations
No info available.
|
Integrations
No info available.
|
|||||
|
|
|