GPTCache

ChatGPT and various large language models (LLMs) boast incredible versatility, enabling the development of a wide range of applications. However, as your application grows in popularity and encounters higher traffic levels, the expenses related to LLM API calls can become substantial. Additionally, LLM services might exhibit slow response times, especially when dealing with a significant number of requests. To tackle this challenge, we have created GPTCache, a project dedicated to building a semantic cache for storing LLM responses. This project is undergoing swift development, and as such, the API may be subject to change at any time.

Features

GPTCache has been fully integrated with LangChain
A Library for Creating Semantic Cache for LLM Queries
You can quickly try GPTCache and put it into a production environment without heavy development
By default, only a limited number of libraries are installed to support the basic cache functionalities
Make sure that the Python version is 3.8.1 or higher
Examples included

Project Samples

Project Activity

See All Activity >

License

MIT License

Follow GPTCache

GPTCache Web Site

Other Useful Business Software

Build Agents and Models on One Platform

Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.

Try It Free

Rate This Project

User Reviews

Be the first to post a review of GPTCache!

Additional Project Details

Operating Systems

Windows

Programming Language

Python

Related Categories

Python Artificial Intelligence Software

Registered

2023-05-29

Similar Business Software

StackAI

StackAI is an enterprise AI automation platform to build end-to-end internal tools and processes with AI agents in a fully compliant and secure way. Designed for large, regulated organizations, it enables teams to automate complex workflows across operations, compliance, finance, IT, and support...

See Software
Gaffa

Gaffa is a web scraping and browser automation API that gives developers full, real-browser control with a single API call no headless browsers, proxies, CAPTCHA handling, or scaling infrastructure to manage. JavaScript rendering is handled by default, so pages load exactly as they would for a...

See Software
Pipefy

Pipefy is the AI-driven Business Orchestration and Automation Technologies (BOAT) platform that delivers enterprise results in days, not months. Designed as a secure orchestration layer, Pipefy bridges the gap between rigid legacy systems (ERPs/CRMs) and agile business needs. It allows IT...

See Software
ONLYOFFICE Docs

ONLYOFFICE is an open-source project that offers cloud-based and self-hosted solutions for business of all sizes. The key product is ONLYOFFICE Docs, a secure office suite that seamlessly integrates into the most popular platforms, e.g. Odoo, Alfresco, Confluence, Pipedrive, Redmine, SuiteCRM...

See Software
Retool

Retool is the AI-native enterprise app development platform where teams build and ship production-ready apps — at AI speed, with enterprise governance built in. Describe what you need and get a working app, import React-based apps from Lovable, Replit, or Claude Code, or connect your AI agent...

See Software
Evertune

Evertune is the Generative Engine Optimization (GEO) platform for enterprise brands that need to know -- and improve -- how AI models represent them. When buyers use ChatGPT, Gemini, Perplexity or AI Overviews to research a category, your brand either shows up confidently or it doesn't show up...

See Software