Related Products
|
||||||
About
LMCache is an open source Knowledge Delivery Network (KDN) designed as a caching layer for large language model serving that accelerates inference by reusing KV (key-value) caches across repeated or overlapping computations. It enables fast prompt caching, allowing LLMs to “prefill” recurring text only once and then reuse those stored KV caches, even in non-prefix positions, across multiple serving instances. This approach reduces time to first token, saves GPU cycles, and increases throughput in scenarios such as multi-round question answering or retrieval augmented generation. LMCache supports KV cache offloading (moving cache from GPU to CPU or disk), cache sharing across instances, and disaggregated prefill, which separates the prefill and decoding phases for resource efficiency. It is compatible with inference engines like vLLM and TGI and supports compressed storage, blending techniques to merge caches, and multiple backend storage options.
|
About
RAGFlow is an open source Retrieval-Augmented Generation (RAG) engine that enhances information retrieval by combining Large Language Models (LLMs) with deep document understanding. It offers a streamlined RAG workflow suitable for businesses of any scale, providing truthful question-answering capabilities backed by well-founded citations from various complex formatted data. Key features include template-based chunking, compatibility with heterogeneous data sources, and automated RAG orchestration.
|
|||||
Platforms Supported
Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook
|
Platforms Supported
Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook
|
|||||
Audience
AI engineers and infrastructure teams looking for a tool to lower latency, reduce compute cost, and scale throughput
|
Audience
AI developers seeking a solution to integrate retrieval-augmented generation capabilities into their applications, enhancing accuracy and traceability in AI-driven tasks
|
|||||
Support
Phone Support
24/7 Live Support
Online
|
Support
Phone Support
24/7 Live Support
Online
|
|||||
API
Offers API
|
API
Offers API
|
|||||
Screenshots and Videos |
Screenshots and Videos |
|||||
Pricing
Free
Free Version
Free Trial
|
Pricing
Free
Free Version
Free Trial
|
|||||
Reviews/
|
Reviews/
|
|||||
Training
Documentation
Webinars
Live Online
In Person
|
Training
Documentation
Webinars
Live Online
In Person
|
|||||
Company InformationLMCache
United States
lmcache.ai/
|
Company InformationRAGFlow
ragflow.io
|
|||||
Alternatives |
Alternatives |
|||||
|
|
||||||
|
|
||||||
|
|
||||||
Categories |
Categories |
|||||
Integrations
Docker
Elestio
|
||||||
|
|
|