memory free download - SourceForge

Showing 456 open source projects for "memory"

View related business solutions

Artificial Intelligence Linux Clear Filters & Widen Search

MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
$300 Free Credits for Your Google Cloud Projects
Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.

Start Free Trial
1

Basic Memory

Persistent AI memory using local Markdown knowledge graphs

Basic Memory creates a semantic knowledge graph by linking related ideas, making it easier to retrieve, expand, and connect information over time. With a local-first design, your data stays private and portable, while optional cloud sync enables cross-device access. It combines simplicity with powerful indexing and search, giving you a flexible way to build long-term memory for projects, research, and workflows.

Downloads: 9 This Week

Last Update: 2 days ago
See Project
2

Kernel Memory

Research project. A Memory solution for users, teams, and applications

Kernel Memory is an open-source reference architecture developed by Microsoft to help developers build memory systems for AI applications powered by large language models. The project focuses on enabling applications to store, index, and retrieve information so that AI systems can incorporate external knowledge when generating responses. It supports scenarios such as document ingestion, semantic search, and retrieval-augmented generation, allowing language models to answer questions using contextual information from private or enterprise datasets. ...

Downloads: 0 This Week

Last Update: 2026-03-06
See Project
3

Hindsight

Hindsight: Agent Memory That Learns

Hindsight is an advanced, open-source memory system for AI agents designed to enable long-term learning, reasoning, and consistency across interactions by treating memory as a first-class component of intelligence rather than a simple retrieval layer. It addresses one of the core limitations of modern AI agents, which is their inability to retain and meaningfully use past experiences over time, by introducing a structured, biomimetic memory architecture inspired by how human memory works. ...

Downloads: 40 This Week

Last Update: 3 days ago
See Project
4

FlashAttention

Fast and memory-efficient exact attention

FlashAttention is a high-performance deep learning optimization library that reimplements the attention mechanism used in transformer models to be significantly faster and more memory-efficient than standard implementations. It achieves this by using IO-aware algorithms that minimize memory reads and writes, reducing the quadratic memory overhead typically associated with attention operations. The project provides implementations of FlashAttention, FlashAttention-2, and newer iterations optimized for modern GPU architectures such as NVIDIA Hopper and AMD accelerators. ...

Downloads: 54 This Week

Last Update: 4 days ago
See Project
Build Agents and Models on One Platform
Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.

Try It Free
5

whisper.cpp

Port of OpenAI's Whisper model in C/C++

...The command downloads the base.en model converted to custom ggml format and runs the inference on all .wav samples in the folder samples. whisper.cpp supports integer quantization of the Whisper ggml models. Quantized models require less memory and disk space and depending on the hardware can be processed more efficiently.

Downloads: 438 This Week

Last Update: 2026-06-01
See Project
6

MemMachine

Universal memory layer for AI Agents

MemMachine is a universal memory layer designed for AI agents that provides persistent, rich memory storage and retrieval capabilities so autonomous agent systems can recall context, personal preferences, and long-term interaction history across sessions, models, and use cases. Unlike ephemeral LLM prompt state, MemMachine supports distinct memory types—short-term conversational context, long-term persistent knowledge, and profile memory for personalized facts—persisted in optimized stores (e.g., graph databases for episodic lines of reasoning and SQL for user facts) to support robust, context-aware intelligence in agents. ...

Downloads: 7 This Week

Last Update: 2026-05-18
See Project
7

Beads

A memory upgrade for your coding agent

...This approach helps coding agents — and human collaborators — track which tasks depend on others, what has been done, and where workflows branch or reunify without losing important data. By leveraging Git as the storage backbone, the project ensures that memory is persistent, diffable, and sharable, with the ability to roll back, branch, or merge memory states just like source code.

Downloads: 11 This Week

Last Update: 2026-05-09
See Project
8

OpenMemory

Local long-term memory engine for AI apps with persistent storage

OpenMemory is a self-hosted memory engine designed to provide long-term, persistent storage for AI and LLM-powered applications. It enables developers to give otherwise stateless models a structured memory layer that can store, retrieve, and manage contextual information over time. OpenMemory is built around a hierarchical memory architecture that organizes data into semantic sectors and connects them through a graph-based structure for efficient retrieval.

Downloads: 5 This Week

Last Update: 2026-03-18
See Project
9

ReMe

Memory Management Kit for Agents

ReMe is a memory management kit for AI agents that gives them structured, persistent memory capabilities, enabling agents to extract, store, and reuse information across sessions, tasks, and interactions. It is designed to support long-running agent workflows where context matters and working memory alone isn’t enough, helping agents remember user preferences, task histories, and relevant past observations.

Downloads: 5 This Week

Last Update: 2026-06-03
See Project
Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure
Native application identity and user-based security for your Azure cloud

Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.

Get a free trial
10

SimpleMem

SimpleMem: Efficient Lifelong Memory for LLM Agents

SimpleMem is a lightweight memory-augmented model framework that helps developers build AI applications that retain long-term context and recall relevant information without overloading model context windows. It provides easy-to-use APIs for storing structured memory entries, querying those memories using semantic search, and retrieving context to augment prompt inputs for downstream processing.

Downloads: 5 This Week

Last Update: 2026-05-21
See Project
11

MemOS

AI memory OS for LLM and Agent systems

MemOS is an experimental operating system and runtime built around the concept of memory-centric computing, where memory objects are first-class citizens and program execution is organized around efficient, persistent memory access rather than traditional process and file system boundaries. The project explores rethinking system abstractions by tightly coupling computation with memory objects so that programs can operate on large datasets without expensive serialization or context switching. ...

Downloads: 2 This Week

Last Update: 3 days ago
See Project
12

MemPalace

The highest-scoring AI memory system ever benchmarked

MemPalace is an open-source AI memory system designed to solve one of the most persistent limitations of large language models: the loss of context between sessions. Instead of relying on summarization or selective extraction like most memory tools, it takes a radically different approach by storing conversations in their entirety and making them retrievable through structured organization and semantic search.

Downloads: 11 This Week

Last Update: 2026-06-06
See Project
13

ChatGLM-6B

ChatGLM-6B: An Open Bilingual Dialogue Language Model

ChatGLM-6B is an open bilingual (Chinese + English) conversational language model based on the GLM architecture, with approximately 6.2 billion parameters. The project provides inference code, demos (command line, web, API), quantization support for lower memory deployment, and tools for finetuning (e.g., via P-Tuning v2). It is optimized for dialogue and question answering with a balance between performance and deployability in consumer hardware settings. Support for quantized inference (INT4, INT8) to reduce GPU memory requirements. Automatic mode switching between precision/memory tradeoffs (full/quantized).

Downloads: 11 This Week

Last Update: 2025-09-26
See Project
14

Hermes Desktop

Desktop Companion for Hermes Agent

...It provides an intuitive graphical interface that eliminates the need to manage Hermes through command-line tools. Users can connect to local or remote Hermes instances while accessing chat, memory management, skills, tools, profiles, and scheduling from a single workspace. The platform supports multiple AI providers, including OpenAI, Anthropic, Google Gemini, Grok, OpenRouter, and local model endpoints. Its real-time streaming chat interface includes markdown rendering, syntax highlighting, tool progress tracking, and token usage monitoring. ...

Downloads: 79 This Week

Last Update: 2 days ago
See Project
15

MemoryOS

MemoryOS is designed to provide a memory operating system

MemoryOS is an open-source framework designed to provide a structured memory management system for AI agents and large language model applications. The project addresses one of the major limitations of modern language models: their inability to maintain long-term context beyond the limits of their prompt window. MemoryOS introduces a hierarchical memory architecture inspired by operating system memory management principles, allowing agents to store, update, retrieve, and generate information from multiple layers of memory.

Downloads: 0 This Week

Last Update: 2026-03-09
See Project
16

AirLLM

AirLLM 70B inference with single 4GB GPU

AirLLM is an open source Python library that enables extremely large language models to run on consumer hardware with very limited GPU memory. The project addresses one of the main barriers to local LLM experimentation by introducing a memory-efficient inference technique that loads model layers sequentially rather than storing the entire model in GPU memory. This layer-wise inference approach allows models with tens of billions of parameters to run on devices with only a few gigabytes of VRAM. ...

Downloads: 4 This Week

Last Update: 2026-03-10
See Project
17

TrustClaw

A self-hostable personal AI agent with vector memory

TrustClaw is a self-hostable personal AI agent that connects conversational access, persistent memory, and external tool use into one assistant workflow. It is built for users who want their own AI agent rather than a fully hosted platform, making it easier to run, inspect, and adapt the system for personal or technical needs. The project uses vector memory so the assistant can retain and retrieve useful context across interactions.

Downloads: 7 This Week

Last Update: 2026-06-06
See Project
18

EverMemOS

Long-term memory OS for AI with structured recall and context awarenes

EverMemOS is an open-source memory operating system built to give AI agents long-term, structured memory. It captures conversations, transforms them into organized memory units, and enables agents to recall past interactions with context and meaning. Instead of treating each prompt independently, it builds evolving user profiles, tracks preferences, and connects related events into coherent narratives.

Downloads: 1 This Week

Last Update: 2026-06-06
See Project
19

memsearch

A Markdown-first memory system, a standalone library for any AI agent

memsearch is a markdown-first memory system designed to provide long-term memory capabilities for AI agents through structured storage and semantic retrieval. It enables agents to store, organize, and retrieve information using embeddings and hybrid search techniques, ensuring that relevant context is always available. The system supports advanced features such as reranking and progressive disclosure, which help prioritize the most useful information for a given query. ...

Downloads: 5 This Week

Last Update: 3 days ago
See Project
20

MemU

MemU is an open-source memory framework for AI companions

MemU is an agentic memory layer for LLM applications, specifically designed for AI companions. Transform your memory into an intelligent file system that automatically organizes, connects, and evolves with your memories. Simple, fast, and reliable memory infrastructure for AI applications. Powerful tools and dedicated support to scale your AI applications with confidence.

Downloads: 8 This Week

Last Update: 2026-03-23
See Project
21

Hermes Agent

The agent that grows with you

Hermes Agent is a fully open-source autonomous AI agent designed to run persistently on your own machine or server, becoming more capable the longer it operates by learning from experience and building reusable procedural skills. Rather than functioning as a stateless chatbot, it maintains long-term memory across sessions and can generate searchable “Skill Documents” that capture how it solved complex tasks so it doesn’t start from scratch each time. The agent interfaces with messaging platforms like Telegram, Discord, Slack, and WhatsApp through a single gateway process, and also offers an interactive terminal user interface with history, autocomplete, and streamable tool output. ...

Downloads: 117 This Week

Last Update: 2026-06-06
See Project
22

Claude-Mem

Claude Code plugin that automatically captures everything Claude does

Claude-Mem is a persistent memory compression system built specifically for Claude Code to preserve context across coding sessions. It automatically captures Claude’s tool usage, observations, and decisions, then compresses them into semantic memories that carry forward into future sessions. By enabling long-term continuity, Claude-Mem helps Claude “remember” project history, past fixes, and prior reasoning even after restarts or reconnects.

Downloads: 6 This Week

Last Update: 2 days ago
See Project
23

agentmemory

#1 Persistent memory for AI coding agents

agentmemory is a persistent memory server for AI coding agents that captures project context, past decisions, bugs, preferences, and implementation details so users do not have to re-explain the same information in every session. It works across tools such as Claude Code, Cursor, Gemini CLI, Codex CLI, OpenCode, Cline, and any client that supports MCP or REST APIs.

Downloads: 11 This Week

Last Update: 2026-06-07
See Project
24

GBrain

Garry's Opinionated OpenClaw/Hermes Agent Brain

GBrain is an open-source AI memory system designed to give autonomous agents persistent, structured, and scalable long-term memory across interactions and workflows. It operates by transforming large collections of markdown documents, personal notes, and external data into a searchable knowledge base backed by PostgreSQL and vector embeddings, enabling both semantic and keyword-based retrieval.

Downloads: 3 This Week

Last Update: 2026-05-16
See Project
25

bitsandbytes

Accessible large language models via k-bit quantization for PyTorch

bitsandbytes is an open-source library designed to make training and inference of large neural networks more efficient by dramatically reducing memory usage. Built primarily for the PyTorch ecosystem, the library introduces advanced quantization techniques that allow models to operate using reduced numerical precision while maintaining high accuracy. These optimizations enable large language models and other deep learning architectures to run on hardware with limited memory resources, including consumer-grade GPUs. ...

Downloads: 4 This Week

Last Update: 2026-03-04
See Project