Showing 87 open source projects for "cache"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Train ML Models With SQL You Already Know Icon
    Train ML Models With SQL You Already Know

    BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

    Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
    Try Free
  • 1
    requests-cache

    requests-cache

    Persistent HTTP cache for python requests

    requests-cache is a persistent HTTP cache that provides an easy way to get better performance with the Python requests library. Keep using the requests library you’re already familiar with. Add caching with a drop-in replacement for requests. The session, or install globally to add transparent caching to all request functions. Get sub-millisecond response times for cached responses.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Flask-Caching

    Flask-Caching

    A caching extension for Flask

    ...If, for example, a key does not exist in the cache, a new key-value entry will be created in the cache. Otherwise, the value (i.e. the cached result) of the key will be returned.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 3
    R-KV

    R-KV

    Redundancy-aware KV Cache Compression for Reasoning Models

    R-KV is an open-source research project that focuses on improving the efficiency of large language model inference through key-value cache compression techniques. Modern transformer models rely heavily on KV caches during autoregressive decoding, which store intermediate attention states to accelerate generation. However, these caches can consume large amounts of memory, especially in reasoning-oriented models with long context windows. R-KV introduces a method for compressing the KV cache during decoding, allowing models to maintain reasoning performance while reducing memory consumption and computational overhead. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    GPTCache

    GPTCache

    Semantic cache for LLMs. Fully integrated with LangChain

    ...Additionally, LLM services might exhibit slow response times, especially when dealing with a significant number of requests. To tackle this challenge, we have created GPTCache, a project dedicated to building a semantic cache for storing LLM responses. This project is undergoing swift development, and as such, the API may be subject to change at any time.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Build Securely on Azure with Proven Frameworks Icon
    Build Securely on Azure with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 5
    KVCache-Factory

    KVCache-Factory

    Unified KV Cache Compression Methods for Auto-Regressive Models

    KVCache-Factory is an open-source research framework designed to explore and implement unified key-value cache compression techniques for autoregressive transformer models. In large language models, the key-value cache stores intermediate attention states that enable efficient token generation during inference, but these caches can consume large amounts of GPU memory when handling long contexts. KVCache-Factory provides a platform for implementing and evaluating multiple compression strategies that reduce memory usage while preserving model performance. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    CAG

    CAG

    Cache-Augmented Generation: A Simple, Efficient Alternative to RAG

    CAG, or Cache-Augmented Generation, is an experimental framework that explores an alternative architecture for integrating external knowledge into large language model responses. Traditional retrieval-augmented generation systems rely on real-time retrieval of documents from databases or vector stores during inference. CAG proposes a different approach by preloading relevant knowledge into the model’s context window and precomputing the model’s key-value cache before queries are processed. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    LMCache

    LMCache

    Supercharge Your LLM with the Fastest KV Cache Layer

    ...These capabilities aim to lower latency, cut GPU cycles, and stabilize performance for production workloads with overlapping prompts or retrieval-augmented contexts. The end result is a cache fabric for LLMs that complements engines rather than replacing them.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    Django RQ

    Django RQ

    A simple app that provides django integration for RQ

    ...You can provide your own singleton Redis connection object to this function so that it will not create a new connection object for each queue definition. If you have django-redis or django-redis-cache installed, you can instruct django_rq to use the same connection information from your Redis cache. This has two advantages, it's DRY and it takes advantage of any optimization that may be going on in your cache setup (like using connection pooling or Hiredis.)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    python-binance

    python-binance

    Binance Exchange API python implementation for automated trading

    ...No need to generate timestamps yourself, the wrapper does it for you. Response exception handling. Websocket handling with reconnection and multiplexed connections. Symbol Depth Cache. Historical Kline/Candle fetching function. Withdraw functionality. Deposit addresses. Margin Trading. Futures Trading. Vanilla Options. Support other domains (.us, .jp, etc). The breaking changes include the migration from wapi to sapi endpoints which are related to the wallet endpoints detailed in the Binance Docs.
    Downloads: 2 This Week
    Last Update:
    See Project
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    Access competitive interest rates on your digital assets.

    Generate interest, borrow against your crypto, and trade a range of cryptocurrencies — all in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • 10
    django-prometheus

    django-prometheus

    Export Django monitoring metrics for Prometheus.io

    Export Django monitoring metrics for Prometheus.io. This library provides Prometheus metrics for Django-related operations. Prometheus uses Histogram based grouping for monitoring latencies. You can define custom buckets for latency, adding more buckets decreases performance but increases accuracy. SQLite, MySQL, and PostgreSQL databases can be monitored. Just replace the ENGINE property of your database, replacing django.db.backends with django_prometheus.db.backends. You may want to...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    DjangoBlog

    DjangoBlog

    A blog system based on python3.8 and Django3.0

    ...Markdown supporting. Sidebar feature, new articles, most readings, tags, etc. OAuth Login supported, including Google, GitHub, Facebook, Weibo, QQ. Memcache supported, with cache auto refresh. Simple SEO Features, notify Google and Baidu when there was a new article or other things. Simple picture bed feature integrated. django-compressor integrated, auto-compressed css, js. Website exception email notification. When there is an unhandle exception, system will send an email notification. Wechat official account feature integrated. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    Flet

    Flet

    Flet enables developers to easily build realtime web and mobile apps

    ...An internal tool or a dashboard for your team, weekend project, data entry form, kiosk app or high-fidelity prototype - Flet is an ideal framework to quickly hack great-looking interactive apps to serve a group of users. No more complex architecture with JavaScript frontend, REST API backend, database, cache, etc. With Flet you just write a monolith stateful app in Python only and get a multi-user, real-time Single-Page Application (SPA). To start developing with Flet, you just need your favorite IDE or text editor. With no SDKs, no thousands of dependencies, no complex tooling, Flet has a built-in web server with assets hosting and desktop clients.
    Downloads: 59 This Week
    Last Update:
    See Project
  • 13
    PSCoverDL

    PSCoverDL

    Small tool to download PS1/PS2 covers for DuckStation and PCSX2

    ...The tool supports multiple platforms and includes features such as multithreading for faster downloads and configuration files for customization. Its graphical interface simplifies the process, making it accessible even to non-technical users. It integrates directly with emulator cache files, ensuring accurate matching between games and covers. The system is designed for efficiency, reducing the manual effort required to curate large collections. Overall, pscoverdl complements the ps2-covers repository by providing automation and usability.
    Downloads: 13 This Week
    Last Update:
    See Project
  • 14
    mac-cleanup-py

    mac-cleanup-py

    Python cleanup script for macOS

    mac-cleanup-py is a powerful cleanup script for macOS. This project is a rewrite of the original mac-cleanup-sh rewritten in Python.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    FastDeploy

    FastDeploy

    High-performance Inference and Deployment Toolkit for LLMs and VLMs

    ...The platform enables developers to deploy trained models quickly using optimized inference pipelines that support GPUs, specialized AI accelerators, and other hardware architectures. FastDeploy includes advanced acceleration technologies such as speculative decoding, multi-token prediction, and efficient KV cache management to improve throughput and latency during inference. It also offers compatibility with OpenAI-style APIs and vLLM-like interfaces, allowing developers to integrate deployed models easily into existing applications and services.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Werkzeug

    Werkzeug

    The comprehensive WSGI web application library

    ...Includes a routing system for matching URLs to endpoints and generating URLs for endpoints, with an extensible system for capturing variables from URLs. Includes HTTP utilities to handle entity tags, cache control, dates, user agents, cookies, files, and more.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Django

    Django

    The Web framework for perfectionists with deadlines

    Django is a high-level, free and open-source Python web framework founded on the Model–Template–View (MTV) pattern, designed to facilitate rapid development of secure, maintainable, and scalable database-driven websites. First, read docs/intro/install.txt for instructions on installing Django. Next, work through the tutorials in order (docs/intro/tutorial01.txt, docs/intro/tutorial02.txt, etc.). If you want to set up an actual deployment server, read docs/howto/deployment/index.txt for...
    Downloads: 16 This Week
    Last Update:
    See Project
  • 18
    django-health-check

    django-health-check

    a pluggable app that runs a full check on the deployment

    The primary intended use case is to monitor conditions via HTTP(S), with responses available in HTML and JSON formats. When you get back a response that includes one or more problems, you can then decide the appropriate course of action, which could include generating notifications and/or automating the replacement of a failing node with a new one. If you are monitoring health in a high-availability environment with a load balancer that returns responses from multiple nodes, please note that...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 19
    HunyuanWorld-Voyager

    HunyuanWorld-Voyager

    RGBD video generation model conditioned on camera input

    HunyuanWorld-Voyager is a next-generation video diffusion framework developed by Tencent-Hunyuan for generating world-consistent 3D scene videos from a single input image. By leveraging user-defined camera paths, it enables immersive scene exploration and supports controllable video synthesis with high realism. The system jointly produces aligned RGB and depth video sequences, making it directly applicable to 3D reconstruction tasks. At its core, Voyager integrates a world-consistent video...
    Downloads: 24 This Week
    Last Update:
    See Project
  • 20
    WordOps

    WordOps

    Install and manage a high performance WordPress stack

    An essential toolset that eases WordPress site and server administration.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Ansible AUR

    Ansible AUR

    Ansible module to manage packages from the AUR

    ansible-aur is an Ansible module designed to manage packages from the Arch User Repository (AUR). It supports various AUR helpers such as yay, paru, pacaur, trizen, pikaur, and aurman, automatically selecting the first available helper or falling back to makepkg if none are found. This module facilitates the installation and upgrading of AUR packages within Ansible playbooks, streamlining package management for Arch Linux systems.​
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    ModelScope

    ModelScope

    Bring the notion of Model-as-a-Service to life

    ModelScope is built upon the notion of “Model-as-a-Service” (MaaS). It seeks to bring together most advanced machine learning models from the AI community, and streamlines the process of leveraging AI models in real-world applications. The core ModelScope library open-sourced in this repository provides the interfaces and implementations that allow developers to perform model inference, training and evaluation. In particular, with rich layers of API abstraction, the ModelScope library offers...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 23
    ContextForge MCP Gateway

    ContextForge MCP Gateway

    A Model Context Protocol (MCP) Gateway & Registry

    MCP Context Forge is a feature-rich gateway and registry that federates Model Context Protocol (MCP) servers and traditional REST services behind a single, governed endpoint. It exposes an MCP-compliant interface to clients while handling discovery, authentication, rate limiting, retries, and observability on the server side. The gateway scales horizontally, supports multi-cluster deployments on Kubernetes, and uses Redis for federation and caching across instances. Operators can define...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 24
    mac code

    mac code

    Claude Code, but it runs on your Mac for free

    mac code is a local AI coding agent designed to run large language models directly on Apple Silicon machines without relying on cloud services, effectively transforming a Mac into a self-contained AI development environment. The project focuses on enabling models that traditionally exceed available RAM to run efficiently by streaming model weights from SSD storage, thereby overcoming hardware limitations through innovative memory management techniques. It operates as a CLI-based assistant...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    tiny-llm

    tiny-llm

    A course of learning LLM inference serving on Apple Silicon

    tiny-llm is an educational open-source project designed to teach system engineers how large language model inference and serving systems work by building them from scratch. The project is structured as a guided course that walks developers through the process of implementing the core components required to run a modern language model, including attention mechanisms, token generation, and optimization techniques. Rather than relying on high-level machine learning frameworks, the codebase uses...
    Downloads: 2 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • Next
MongoDB Logo MongoDB