Showing 17 open source projects for "cache memory simulator"

View related business solutions
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 1
    R-KV

    R-KV

    Redundancy-aware KV Cache Compression for Reasoning Models

    R-KV is an open-source research project that focuses on improving the efficiency of large language model inference through key-value cache compression techniques. Modern transformer models rely heavily on KV caches during autoregressive decoding, which store intermediate attention states to accelerate generation. However, these caches can consume large amounts of memory, especially in reasoning-oriented models with long context windows. R-KV introduces a method for compressing the KV cache during decoding, allowing models to maintain reasoning performance while reducing memory consumption and computational overhead. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    KVCache-Factory

    KVCache-Factory

    Unified KV Cache Compression Methods for Auto-Regressive Models

    KVCache-Factory is an open-source research framework designed to explore and implement unified key-value cache compression techniques for autoregressive transformer models. In large language models, the key-value cache stores intermediate attention states that enable efficient token generation during inference, but these caches can consume large amounts of GPU memory when handling long contexts. KVCache-Factory provides a platform for implementing and evaluating multiple compression strategies that reduce memory usage while preserving model performance. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    LMCache

    LMCache

    Supercharge Your LLM with the Fastest KV Cache Layer

    ...These capabilities aim to lower latency, cut GPU cycles, and stabilize performance for production workloads with overlapping prompts or retrieval-augmented contexts. The end result is a cache fabric for LLMs that complements engines rather than replacing them.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 4
    django-health-check

    django-health-check

    a pluggable app that runs a full check on the deployment

    The primary intended use case is to monitor conditions via HTTP(S), with responses available in HTML and JSON formats. When you get back a response that includes one or more problems, you can then decide the appropriate course of action, which could include generating notifications and/or automating the replacement of a failing node with a new one. If you are monitoring health in a high-availability environment with a load balancer that returns responses from multiple nodes, please note that...
    Downloads: 0 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    TensorRT LLM

    TensorRT LLM

    TensorRT LLM provides users with an easy-to-use Python API

    TensorRT-LLM is an open-source high-performance inference library specifically designed to optimize and accelerate large language model deployment on NVIDIA GPUs. It provides a Python-based API built on top of PyTorch that allows developers to define, customize, and deploy LLMs efficiently across a variety of hardware configurations, from single GPUs to large multi-node clusters. The library focuses on maximizing throughput and minimizing latency through advanced techniques such as...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    LingBot-World

    LingBot-World

    Advancing Open-source World Models

    LingBot-World is an open-source, high-fidelity world simulator designed to advance the state of world models through video generation. Built on top of Wan2.2, it enables realistic, dynamic environment simulation across diverse styles, including real-world, scientific, and stylized domains. LingBot-World supports long-term temporal consistency, maintaining coherent scenes and interactions over minute-level horizons. With real-time interactivity and sub-second latency at 16 FPS, it is...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 7
    Zappa - Serverless Python

    Zappa - Serverless Python

    Serverless Python

    ...With Zappa, each request is given its own virtual HTTP "server" by Amazon API Gateway. AWS handles the horizontal scaling automatically, so no requests ever time out. Each request then calls your application from a memory cache in AWS Lambda and returns the response via Python's WSGI interface. After your app returns, the "server" dies.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    Zappa

    Zappa

    Serverless Python

    ...With Zappa, each request is given its own virtual HTTP "server" by Amazon API Gateway. AWS handles the horizontal scaling automatically, so no requests ever time out. Each request then calls your application from a memory cache in AWS Lambda and returns the response via Python's WSGI interface. After your app returns, the "server" dies.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    gitfs

    gitfs

    Version controlled file system

    gitfs is a FUSE file system that fully integrates with git. You can mount a remote repository's branch locally, and any subsequent changes made to the files will be automatically committed to the remote. gitfs was developed by the awesome engineering team at Presslabs, a Managed WordPress Hosting provider. gitfs was designed to bring the full powers of git to everyone, no matter how little they know about versioning. A user can mount any repository and all their changes will be automatically...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 10

    N.Tesla

    Numerical Transient Simulator for Power System

    ...N.Tesla is an open source project to provide students and researchers of EE with a tool to implement new models for power system study and research. N.Tesla is short for Numerical TransiEnt SimuLAtor for power system and is in memory of Nikola Tesla, one the greatest scientists in alternating current power system. If you have any questions, please contact the author by send email to [lichgang AT gmail DOT com] or leave feedback in the Discussion forum. Mailing List for N.Tesla: http://www.freelists.org/list/ntesla Mailing List for N.Tesla Developer: http://www.freelists.org/list/ntesla-dev
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11

    ppc_sim_booke

    e500v2 simulator

    An experimental ISS for freescale's e500v2 core. It is meant to simulate only cpu and memory subsystem and is purely intended for academic/learning purposes only.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12

    pykb

    Python Killboard Platform for EVE Online

    ...Aggregates exist along with code to maintain them to enable high speed reports for summarized data. A point calculation system is included as is support for parsing German and Russian kill mail text. Caching exists in two forms, a basic in memory dict based caching system suitable for single process tasks and integration with uwsgi's powerful and fast built in cache. Utility modules are provided for adding in killmails, backfilling killmails and dumping them via the command line. These also serve as reusable code for web apps or at least as usage examples. A sample web app using pykb is included.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Infinispan

    Infinispan

    High performance distributed in-memory key/value store

    Infinispan is an open source, Java based data grid platform. ***IMPORTANT*** Starting with Infinispan 5.0.0.FINAL, Infinispan releases are no longer hosted in Sourceforge. They can now be located in www.jboss.org/infinispan/downloads
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Lioness (Languages Interop Framework)
    Framework for making Windows applications that are one .exe file in AutoHotKey_L,C++,C#, VB.NET,Java,Groovy,Common Lisp,Nemerle,Ruby,Python,PHP,Lua,Tcl,Perl,Jint,S#,WSH VBScript,HTML/JavaScript/CSS,COM, PowerShell without compiling . For .NET 4.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Roxbird Download Manager
    RoxBird is a advanced graphical download manager. It can accelerate your download speed upto 40% than your normal speed. Among others its key-features include file-browser, Pre-viewer, recursive downloading,Scheduling & Queues, Write-cache, and more.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    CDNsim is a GNU/LINUX simulation tool for CDNs, written in C++ (core) and python (GUI wizard). It models: redirection policies, cache policies, TCP/IP, batch simulations, statistics extraction and more. CDNsim is uses the OMNet++ library
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17

    Video DeDup

    Find duplicate videos by content

    ...Parse a video directory to create one image every n seconds, then identify duplicate images and show possible video duplicates for manual analysis. . 1f_parse scan video source folder, replicate with ffmpeg an image folder, then for each image create a fingerprint. . CompareV2 perform the fingerprints comparison and works in memory with constant memory usage. It is folder agnostic and parse fingerprints ordered by source file name. Since 201906 in free Pascal for x3 speed and less memory usage in multi-threading. . 3h_analyse is folder agnostic by first loading in memory current source and image folder. Then a lot of options and cache mechanism enable to get correct performance. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB