Showing 456 open source projects for "memory"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • 1
    Basic Memory

    Basic Memory

    Persistent AI memory using local Markdown knowledge graphs

    Basic Memory creates a semantic knowledge graph by linking related ideas, making it easier to retrieve, expand, and connect information over time. With a local-first design, your data stays private and portable, while optional cloud sync enables cross-device access. It combines simplicity with powerful indexing and search, giving you a flexible way to build long-term memory for projects, research, and workflows.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 2
    Kernel Memory

    Kernel Memory

    Research project. A Memory solution for users, teams, and applications

    Kernel Memory is an open-source reference architecture developed by Microsoft to help developers build memory systems for AI applications powered by large language models. The project focuses on enabling applications to store, index, and retrieve information so that AI systems can incorporate external knowledge when generating responses. It supports scenarios such as document ingestion, semantic search, and retrieval-augmented generation, allowing language models to answer questions using contextual information from private or enterprise datasets. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Hindsight

    Hindsight

    Hindsight: Agent Memory That Learns

    Hindsight is an advanced, open-source memory system for AI agents designed to enable long-term learning, reasoning, and consistency across interactions by treating memory as a first-class component of intelligence rather than a simple retrieval layer. It addresses one of the core limitations of modern AI agents, which is their inability to retain and meaningfully use past experiences over time, by introducing a structured, biomimetic memory architecture inspired by how human memory works. ...
    Downloads: 40 This Week
    Last Update:
    See Project
  • 4
    FlashAttention

    FlashAttention

    Fast and memory-efficient exact attention

    FlashAttention is a high-performance deep learning optimization library that reimplements the attention mechanism used in transformer models to be significantly faster and more memory-efficient than standard implementations. It achieves this by using IO-aware algorithms that minimize memory reads and writes, reducing the quadratic memory overhead typically associated with attention operations. The project provides implementations of FlashAttention, FlashAttention-2, and newer iterations optimized for modern GPU architectures such as NVIDIA Hopper and AMD accelerators. ...
    Downloads: 54 This Week
    Last Update:
    See Project
  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • 5
    whisper.cpp

    whisper.cpp

    Port of OpenAI's Whisper model in C/C++

    ...The command downloads the base.en model converted to custom ggml format and runs the inference on all .wav samples in the folder samples. whisper.cpp supports integer quantization of the Whisper ggml models. Quantized models require less memory and disk space and depending on the hardware can be processed more efficiently.
    Downloads: 438 This Week
    Last Update:
    See Project
  • 6
    MemMachine

    MemMachine

    Universal memory layer for AI Agents

    MemMachine is a universal memory layer designed for AI agents that provides persistent, rich memory storage and retrieval capabilities so autonomous agent systems can recall context, personal preferences, and long-term interaction history across sessions, models, and use cases. Unlike ephemeral LLM prompt state, MemMachine supports distinct memory types—short-term conversational context, long-term persistent knowledge, and profile memory for personalized facts—persisted in optimized stores (e.g., graph databases for episodic lines of reasoning and SQL for user facts) to support robust, context-aware intelligence in agents. ...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 7
    Beads

    Beads

    A memory upgrade for your coding agent

    ...This approach helps coding agents — and human collaborators — track which tasks depend on others, what has been done, and where workflows branch or reunify without losing important data. By leveraging Git as the storage backbone, the project ensures that memory is persistent, diffable, and sharable, with the ability to roll back, branch, or merge memory states just like source code.
    Downloads: 11 This Week
    Last Update:
    See Project
  • 8
    OpenMemory

    OpenMemory

    Local long-term memory engine for AI apps with persistent storage

    OpenMemory is a self-hosted memory engine designed to provide long-term, persistent storage for AI and LLM-powered applications. It enables developers to give otherwise stateless models a structured memory layer that can store, retrieve, and manage contextual information over time. OpenMemory is built around a hierarchical memory architecture that organizes data into semantic sectors and connects them through a graph-based structure for efficient retrieval.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 9
    ReMe

    ReMe

    Memory Management Kit for Agents

    ReMe is a memory management kit for AI agents that gives them structured, persistent memory capabilities, enabling agents to extract, store, and reuse information across sessions, tasks, and interactions. It is designed to support long-running agent workflows where context matters and working memory alone isn’t enough, helping agents remember user preferences, task histories, and relevant past observations.
    Downloads: 5 This Week
    Last Update:
    See Project
  • Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure Icon
    Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure

    Native application identity and user-based security for your Azure cloud

    Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
    Get a free trial
  • 10
    SimpleMem

    SimpleMem

    SimpleMem: Efficient Lifelong Memory for LLM Agents

    SimpleMem is a lightweight memory-augmented model framework that helps developers build AI applications that retain long-term context and recall relevant information without overloading model context windows. It provides easy-to-use APIs for storing structured memory entries, querying those memories using semantic search, and retrieving context to augment prompt inputs for downstream processing.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 11
    MemOS

    MemOS

    AI memory OS for LLM and Agent systems

    MemOS is an experimental operating system and runtime built around the concept of memory-centric computing, where memory objects are first-class citizens and program execution is organized around efficient, persistent memory access rather than traditional process and file system boundaries. The project explores rethinking system abstractions by tightly coupling computation with memory objects so that programs can operate on large datasets without expensive serialization or context switching. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 12
    MemPalace

    MemPalace

    The highest-scoring AI memory system ever benchmarked

    MemPalace is an open-source AI memory system designed to solve one of the most persistent limitations of large language models: the loss of context between sessions. Instead of relying on summarization or selective extraction like most memory tools, it takes a radically different approach by storing conversations in their entirety and making them retrievable through structured organization and semantic search.
    Downloads: 11 This Week
    Last Update:
    See Project
  • 13
    ChatGLM-6B

    ChatGLM-6B

    ChatGLM-6B: An Open Bilingual Dialogue Language Model

    ChatGLM-6B is an open bilingual (Chinese + English) conversational language model based on the GLM architecture, with approximately 6.2 billion parameters. The project provides inference code, demos (command line, web, API), quantization support for lower memory deployment, and tools for finetuning (e.g., via P-Tuning v2). It is optimized for dialogue and question answering with a balance between performance and deployability in consumer hardware settings. Support for quantized inference (INT4, INT8) to reduce GPU memory requirements. Automatic mode switching between precision/memory tradeoffs (full/quantized).
    Downloads: 11 This Week
    Last Update:
    See Project
  • 14
    Hermes Desktop

    Hermes Desktop

    Desktop Companion for Hermes Agent

    ...It provides an intuitive graphical interface that eliminates the need to manage Hermes through command-line tools. Users can connect to local or remote Hermes instances while accessing chat, memory management, skills, tools, profiles, and scheduling from a single workspace. The platform supports multiple AI providers, including OpenAI, Anthropic, Google Gemini, Grok, OpenRouter, and local model endpoints. Its real-time streaming chat interface includes markdown rendering, syntax highlighting, tool progress tracking, and token usage monitoring. ...
    Downloads: 79 This Week
    Last Update:
    See Project
  • 15
    MemoryOS

    MemoryOS

    MemoryOS is designed to provide a memory operating system

    MemoryOS is an open-source framework designed to provide a structured memory management system for AI agents and large language model applications. The project addresses one of the major limitations of modern language models: their inability to maintain long-term context beyond the limits of their prompt window. MemoryOS introduces a hierarchical memory architecture inspired by operating system memory management principles, allowing agents to store, update, retrieve, and generate information from multiple layers of memory.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    AirLLM

    AirLLM

    AirLLM 70B inference with single 4GB GPU

    AirLLM is an open source Python library that enables extremely large language models to run on consumer hardware with very limited GPU memory. The project addresses one of the main barriers to local LLM experimentation by introducing a memory-efficient inference technique that loads model layers sequentially rather than storing the entire model in GPU memory. This layer-wise inference approach allows models with tens of billions of parameters to run on devices with only a few gigabytes of VRAM. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 17
    TrustClaw

    TrustClaw

    A self-hostable personal AI agent with vector memory

    TrustClaw is a self-hostable personal AI agent that connects conversational access, persistent memory, and external tool use into one assistant workflow. It is built for users who want their own AI agent rather than a fully hosted platform, making it easier to run, inspect, and adapt the system for personal or technical needs. The project uses vector memory so the assistant can retain and retrieve useful context across interactions.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 18
    EverMemOS

    EverMemOS

    Long-term memory OS for AI with structured recall and context awarenes

    EverMemOS is an open-source memory operating system built to give AI agents long-term, structured memory. It captures conversations, transforms them into organized memory units, and enables agents to recall past interactions with context and meaning. Instead of treating each prompt independently, it builds evolving user profiles, tracks preferences, and connects related events into coherent narratives.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19
    memsearch

    memsearch

    A Markdown-first memory system, a standalone library for any AI agent

    memsearch is a markdown-first memory system designed to provide long-term memory capabilities for AI agents through structured storage and semantic retrieval. It enables agents to store, organize, and retrieve information using embeddings and hybrid search techniques, ensuring that relevant context is always available. The system supports advanced features such as reranking and progressive disclosure, which help prioritize the most useful information for a given query. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 20
    MemU

    MemU

    MemU is an open-source memory framework for AI companions

    MemU is an agentic memory layer for LLM applications, specifically designed for AI companions. Transform your memory into an intelligent file system that automatically organizes, connects, and evolves with your memories. Simple, fast, and reliable memory infrastructure for AI applications. Powerful tools and dedicated support to scale your AI applications with confidence.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 21
    Hermes Agent

    Hermes Agent

    The agent that grows with you

    Hermes Agent is a fully open-source autonomous AI agent designed to run persistently on your own machine or server, becoming more capable the longer it operates by learning from experience and building reusable procedural skills. Rather than functioning as a stateless chatbot, it maintains long-term memory across sessions and can generate searchable “Skill Documents” that capture how it solved complex tasks so it doesn’t start from scratch each time. The agent interfaces with messaging platforms like Telegram, Discord, Slack, and WhatsApp through a single gateway process, and also offers an interactive terminal user interface with history, autocomplete, and streamable tool output. ...
    Downloads: 117 This Week
    Last Update:
    See Project
  • 22
    Claude-Mem

    Claude-Mem

    Claude Code plugin that automatically captures everything Claude does

    Claude-Mem is a persistent memory compression system built specifically for Claude Code to preserve context across coding sessions. It automatically captures Claude’s tool usage, observations, and decisions, then compresses them into semantic memories that carry forward into future sessions. By enabling long-term continuity, Claude-Mem helps Claude “remember” project history, past fixes, and prior reasoning even after restarts or reconnects.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 23
    agentmemory

    agentmemory

    #1 Persistent memory for AI coding agents

    agentmemory is a persistent memory server for AI coding agents that captures project context, past decisions, bugs, preferences, and implementation details so users do not have to re-explain the same information in every session. It works across tools such as Claude Code, Cursor, Gemini CLI, Codex CLI, OpenCode, Cline, and any client that supports MCP or REST APIs.
    Downloads: 11 This Week
    Last Update:
    See Project
  • 24
    GBrain

    GBrain

    Garry's Opinionated OpenClaw/Hermes Agent Brain

    GBrain is an open-source AI memory system designed to give autonomous agents persistent, structured, and scalable long-term memory across interactions and workflows. It operates by transforming large collections of markdown documents, personal notes, and external data into a searchable knowledge base backed by PostgreSQL and vector embeddings, enabling both semantic and keyword-based retrieval.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 25
    bitsandbytes

    bitsandbytes

    Accessible large language models via k-bit quantization for PyTorch

    bitsandbytes is an open-source library designed to make training and inference of large neural networks more efficient by dramatically reducing memory usage. Built primarily for the PyTorch ecosystem, the library introduces advanced quantization techniques that allow models to operate using reduced numerical precision while maintaining high accuracy. These optimizations enable large language models and other deep learning architectures to run on hardware with limited memory resources, including consumer-grade GPUs. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
Auth0 Logo