memory free download - SourceForge

Showing 456 open source projects for "memory"

View related business solutions

Artificial Intelligence Windows Clear Filters & Widen Search

$300 Free Credits for Your Google Cloud Projects
Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.

Start Free Trial
Stop Storing Third-Party Tokens in Your Database
Auth0 Token Vault handles secure token storage, exchange, and refresh for external providers so you don't have to build it yourself.

Rolling your own OAuth token storage can be a security liability. Token Vault securely stores access and refresh tokens from federated providers and handles exchange and renewal automatically. Connected accounts, refresh exchange, and privileged worker flows included.

Try Auth0 for Free
1

Basic Memory

Persistent AI memory using local Markdown knowledge graphs

Basic Memory creates a semantic knowledge graph by linking related ideas, making it easier to retrieve, expand, and connect information over time. With a local-first design, your data stays private and portable, while optional cloud sync enables cross-device access. It combines simplicity with powerful indexing and search, giving you a flexible way to build long-term memory for projects, research, and workflows.

Downloads: 6 This Week

Last Update: 2 days ago
See Project
2

Kernel Memory

Research project. A Memory solution for users, teams, and applications

Kernel Memory is an open-source reference architecture developed by Microsoft to help developers build memory systems for AI applications powered by large language models. The project focuses on enabling applications to store, index, and retrieve information so that AI systems can incorporate external knowledge when generating responses. It supports scenarios such as document ingestion, semantic search, and retrieval-augmented generation, allowing language models to answer questions using contextual information from private or enterprise datasets. ...

Downloads: 0 This Week

Last Update: 2026-03-06
See Project
3

Hindsight

Hindsight: Agent Memory That Learns

Hindsight is an advanced, open-source memory system for AI agents designed to enable long-term learning, reasoning, and consistency across interactions by treating memory as a first-class component of intelligence rather than a simple retrieval layer. It addresses one of the core limitations of modern AI agents, which is their inability to retain and meaningfully use past experiences over time, by introducing a structured, biomimetic memory architecture inspired by how human memory works. ...

Downloads: 29 This Week

Last Update: 3 days ago
See Project
4

whisper.cpp

Port of OpenAI's Whisper model in C/C++

...The command downloads the base.en model converted to custom ggml format and runs the inference on all .wav samples in the folder samples. whisper.cpp supports integer quantization of the Whisper ggml models. Quantized models require less memory and disk space and depending on the hardware can be processed more efficiently.

Downloads: 420 This Week

Last Update: 2026-06-01
See Project
Compliant and Reliable File Transfers Backed by Top Security Certifications
Cerberus FTP Server delivers SOC 2 Type II certified security and FIPS 140-2 validated encryption.

Stop relying on non-certified, legacy file transfer tools that creak under the weight of modern security demands. Get full audit trails, advanced access controls and more supported by an award-winning team of experts. Start your free 25-day trial today.

Start Free Trial
5

FlashAttention

Fast and memory-efficient exact attention

FlashAttention is a high-performance deep learning optimization library that reimplements the attention mechanism used in transformer models to be significantly faster and more memory-efficient than standard implementations. It achieves this by using IO-aware algorithms that minimize memory reads and writes, reducing the quadratic memory overhead typically associated with attention operations. The project provides implementations of FlashAttention, FlashAttention-2, and newer iterations optimized for modern GPU architectures such as NVIDIA Hopper and AMD accelerators. ...

Downloads: 37 This Week

Last Update: 4 days ago
See Project
6

Cherry Studio

Cherry Studio is a desktop client that supports for multiple LLMs

Cherry Studio is a cross-platform desktop client that integrates multiple large language model providers into a unified interface for creating and using AI assistants, supporting customization and multi-model conversations. Selection Assistant with smart content selection enhancement. Deep Research with advanced research capabilities. Memory System with global context awareness. Document Preprocessing with improved document handling. MCP Marketplace for Model Context Protocol ecosystem.

Downloads: 8,754 This Week

Last Update: 2026-06-07
See Project
7

Beads

A memory upgrade for your coding agent

...This approach helps coding agents — and human collaborators — track which tasks depend on others, what has been done, and where workflows branch or reunify without losing important data. By leveraging Git as the storage backbone, the project ensures that memory is persistent, diffable, and sharable, with the ability to roll back, branch, or merge memory states just like source code.

Downloads: 8 This Week

Last Update: 2026-05-09
See Project
8

MemMachine

Universal memory layer for AI Agents

MemMachine is a universal memory layer designed for AI agents that provides persistent, rich memory storage and retrieval capabilities so autonomous agent systems can recall context, personal preferences, and long-term interaction history across sessions, models, and use cases. Unlike ephemeral LLM prompt state, MemMachine supports distinct memory types—short-term conversational context, long-term persistent knowledge, and profile memory for personalized facts—persisted in optimized stores (e.g., graph databases for episodic lines of reasoning and SQL for user facts) to support robust, context-aware intelligence in agents. ...

Downloads: 4 This Week

Last Update: 2026-05-18
See Project
9

OpenMemory

Local long-term memory engine for AI apps with persistent storage

OpenMemory is a self-hosted memory engine designed to provide long-term, persistent storage for AI and LLM-powered applications. It enables developers to give otherwise stateless models a structured memory layer that can store, retrieve, and manage contextual information over time. OpenMemory is built around a hierarchical memory architecture that organizes data into semantic sectors and connects them through a graph-based structure for efficient retrieval.

Downloads: 4 This Week

Last Update: 2026-03-18
See Project
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
10

ReMe

Memory Management Kit for Agents

ReMe is a memory management kit for AI agents that gives them structured, persistent memory capabilities, enabling agents to extract, store, and reuse information across sessions, tasks, and interactions. It is designed to support long-running agent workflows where context matters and working memory alone isn’t enough, helping agents remember user preferences, task histories, and relevant past observations.

Downloads: 3 This Week

Last Update: 2026-06-03
See Project
11

SimpleMem

SimpleMem: Efficient Lifelong Memory for LLM Agents

SimpleMem is a lightweight memory-augmented model framework that helps developers build AI applications that retain long-term context and recall relevant information without overloading model context windows. It provides easy-to-use APIs for storing structured memory entries, querying those memories using semantic search, and retrieving context to augment prompt inputs for downstream processing.

Downloads: 4 This Week

Last Update: 2026-05-21
See Project
12

ChatGLM-6B

ChatGLM-6B: An Open Bilingual Dialogue Language Model

ChatGLM-6B is an open bilingual (Chinese + English) conversational language model based on the GLM architecture, with approximately 6.2 billion parameters. The project provides inference code, demos (command line, web, API), quantization support for lower memory deployment, and tools for finetuning (e.g., via P-Tuning v2). It is optimized for dialogue and question answering with a balance between performance and deployability in consumer hardware settings. Support for quantized inference (INT4, INT8) to reduce GPU memory requirements. Automatic mode switching between precision/memory tradeoffs (full/quantized).

Downloads: 12 This Week

Last Update: 2025-09-26
See Project
13

MemPalace

The highest-scoring AI memory system ever benchmarked

MemPalace is an open-source AI memory system designed to solve one of the most persistent limitations of large language models: the loss of context between sessions. Instead of relying on summarization or selective extraction like most memory tools, it takes a radically different approach by storing conversations in their entirety and making them retrievable through structured organization and semantic search.

Downloads: 8 This Week

Last Update: 8 hours ago
See Project
14

Hermes Desktop

Desktop Companion for Hermes Agent

...It provides an intuitive graphical interface that eliminates the need to manage Hermes through command-line tools. Users can connect to local or remote Hermes instances while accessing chat, memory management, skills, tools, profiles, and scheduling from a single workspace. The platform supports multiple AI providers, including OpenAI, Anthropic, Google Gemini, Grok, OpenRouter, and local model endpoints. Its real-time streaming chat interface includes markdown rendering, syntax highlighting, tool progress tracking, and token usage monitoring. ...

Downloads: 75 This Week

Last Update: 2 days ago
See Project
15

MemOS

AI memory OS for LLM and Agent systems

MemOS is an experimental operating system and runtime built around the concept of memory-centric computing, where memory objects are first-class citizens and program execution is organized around efficient, persistent memory access rather than traditional process and file system boundaries. The project explores rethinking system abstractions by tightly coupling computation with memory objects so that programs can operate on large datasets without expensive serialization or context switching. ...

Downloads: 1 This Week

Last Update: 3 days ago
See Project
16

MemoryOS

MemoryOS is designed to provide a memory operating system

MemoryOS is an open-source framework designed to provide a structured memory management system for AI agents and large language model applications. The project addresses one of the major limitations of modern language models: their inability to maintain long-term context beyond the limits of their prompt window. MemoryOS introduces a hierarchical memory architecture inspired by operating system memory management principles, allowing agents to store, update, retrieve, and generate information from multiple layers of memory.

Downloads: 0 This Week

Last Update: 2026-03-09
See Project
17

AirLLM

AirLLM 70B inference with single 4GB GPU

AirLLM is an open source Python library that enables extremely large language models to run on consumer hardware with very limited GPU memory. The project addresses one of the main barriers to local LLM experimentation by introducing a memory-efficient inference technique that loads model layers sequentially rather than storing the entire model in GPU memory. This layer-wise inference approach allows models with tens of billions of parameters to run on devices with only a few gigabytes of VRAM. ...

Downloads: 4 This Week

Last Update: 2026-03-10
See Project
18

TrustClaw

A self-hostable personal AI agent with vector memory

TrustClaw is a self-hostable personal AI agent that connects conversational access, persistent memory, and external tool use into one assistant workflow. It is built for users who want their own AI agent rather than a fully hosted platform, making it easier to run, inspect, and adapt the system for personal or technical needs. The project uses vector memory so the assistant can retain and retrieve useful context across interactions.

Downloads: 7 This Week

Last Update: 2026-06-06
See Project
19

EverMemOS

Long-term memory OS for AI with structured recall and context awarenes

EverMemOS is an open-source memory operating system built to give AI agents long-term, structured memory. It captures conversations, transforms them into organized memory units, and enables agents to recall past interactions with context and meaning. Instead of treating each prompt independently, it builds evolving user profiles, tracks preferences, and connects related events into coherent narratives.

Downloads: 1 This Week

Last Update: 2026-06-06
See Project
20

memsearch

A Markdown-first memory system, a standalone library for any AI agent

memsearch is a markdown-first memory system designed to provide long-term memory capabilities for AI agents through structured storage and semantic retrieval. It enables agents to store, organize, and retrieve information using embeddings and hybrid search techniques, ensuring that relevant context is always available. The system supports advanced features such as reranking and progressive disclosure, which help prioritize the most useful information for a given query. ...

Downloads: 6 This Week

Last Update: 4 days ago
See Project
21

Hermes Agent

The agent that grows with you

Hermes Agent is a fully open-source autonomous AI agent designed to run persistently on your own machine or server, becoming more capable the longer it operates by learning from experience and building reusable procedural skills. Rather than functioning as a stateless chatbot, it maintains long-term memory across sessions and can generate searchable “Skill Documents” that capture how it solved complex tasks so it doesn’t start from scratch each time. The agent interfaces with messaging platforms like Telegram, Discord, Slack, and WhatsApp through a single gateway process, and also offers an interactive terminal user interface with history, autocomplete, and streamable tool output. ...

Downloads: 134 This Week

Last Update: 2026-06-06
See Project
22

MemU

MemU is an open-source memory framework for AI companions

MemU is an agentic memory layer for LLM applications, specifically designed for AI companions. Transform your memory into an intelligent file system that automatically organizes, connects, and evolves with your memories. Simple, fast, and reliable memory infrastructure for AI applications. Powerful tools and dedicated support to scale your AI applications with confidence.

Downloads: 7 This Week

Last Update: 2026-03-23
See Project
23

Claude-Flow

The leading agent orchestration platform for Claude

...The platform supports both quick swarm tasks and persistent multi-agent sessions known as hives, facilitating distributed AI collaboration with persistent contextual memory. At its core, Claude-Flow integrates Dynamic Agent Architecture (DAA) for self-organizing agent management, neural pattern recognition accelerated by WebAssembly SIMD, and a SQLite-based memory system for context retention and knowledge persistence across tasks. It automates development workflows via pre- and post-operation hooks, providing seamless coordination, code formatting, validation, and performance optimization.

Downloads: 6 This Week

Last Update: 2 days ago
See Project
24

x-transformers

A simple but complete full-attention transformer

A simple but complete full-attention transformer with a set of promising experimental features from various papers. Proposes adding learned memory key/values prior to attending. They were able to remove feedforwards altogether and attain a similar performance to the original transformers. I have found that keeping the feedforwards and adding the memory key/values leads to even better performance. Proposes adding learned tokens, akin to CLS tokens, named memory tokens, that is passed through the attention layers alongside the input tokens. ...

Downloads: 4 This Week

Last Update: 2026-02-12
See Project
25

bitsandbytes

Accessible large language models via k-bit quantization for PyTorch

bitsandbytes is an open-source library designed to make training and inference of large neural networks more efficient by dramatically reducing memory usage. Built primarily for the PyTorch ecosystem, the library introduces advanced quantization techniques that allow models to operate using reduced numerical precision while maintaining high accuracy. These optimizations enable large language models and other deep learning architectures to run on hardware with limited memory resources, including consumer-grade GPUs. ...

Downloads: 3 This Week

Last Update: 2026-03-04
See Project