Page 3 | memory free download

Showing 439 open source projects for "memory"

View related business solutions

Artificial Intelligence Mac Clear Filters & Widen Search

MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
1

OpenHuman

Your Personal AI super intelligence. Private, simple and powerful

...It focuses on a private, desktop-first experience with a friendly interface, onboarding flows, and a persistent assistant that can remember context over time. The project connects to common productivity tools, gathers fresh information from integrations, and organizes user knowledge into a local memory system. It also includes practical agent tools such as web search, web fetching, file access, coding utilities, voice input, text-to-speech, and model routing. Its goal is to make an AI assistant feel continuously useful across meetings, messages, documents, tasks, and personal workflows. Since it is still in early beta, it is best suited for technical users and early adopters who want to experiment with a customizable personal AI environment.

Downloads: 60 This Week

Last Update: 2 days ago
See Project
2

Moltis

A Rust-native claw you can trust

...The platform also includes long-term memory powered by hybrid vector and full-text search, allowing the assistant to retain context across sessions. With multi-channel access such as web UI, Telegram, and API endpoints, Moltis functions as a unified automation hub intended for developers and advanced users who want full control.

Downloads: 13 This Week

Last Update: 2026-06-04
See Project
3

Zep

Zep: A long-term memory store for LLM / Chatbot applications

Easily add relevant documents, chat history memory & rich user data to your LLM app's prompts. Understands chat messages, roles, and user metadata, not just texts and embeddings. Zep Memory and VectorStore implementations are shipped with your favorite frameworks: LangChain, LangChain.js, LlamaIndex, and more. Automatically embed texts and messages using state-of-the-art opeb source models, OpenAI, or bring your own vectors.

Downloads: 2 This Week

Last Update: 2025-09-11
See Project
4

OpenSquilla

Token-Efficient AI Agent with same budget, higher intelligence density

OpenSquilla is a token-efficient microkernel AI agent runtime designed for CLI, web UI, and chat-based workflows. It routes each turn through a shared loop that can select lower-cost models when appropriate while preserving tool dispatch, retries, memory, and decision logging. The project supports multiple LLM providers through a pluggable provider layer, making it adaptable to different model ecosystems. It includes persistent memory, built-in web search, on-device embeddings, and sandboxing for safer execution. OpenSquilla is designed for users who want stronger agent capabilities without wasting tokens on every interaction. ...

Downloads: 5 This Week

Last Update: 2026-06-03
See Project
Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure
Native application identity and user-based security for your Azure cloud

Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.

Get a free trial
5

DeepSeek-Reasonix

DeepSeek-native AI coding agent for your terminal

...Reasonix includes a coding mode with filesystem and shell tools, a lighter chat mode, one-shot task execution, health checks, session utilities, and project-scoped memory. It supports reviewed SEARCH/REPLACE edits, plan mode, MCP servers, web search, hooks, skills, semantic indexing, transcript replay, event logs, and cost or cache tracking. The project is especially useful for developers who want an open, terminal-first coding agent optimized for DeepSeek’s cache mechanics. It also includes a prerelease desktop client for users who prefer a GUI over the same agent loop.

Downloads: 45 This Week

Last Update: 2 days ago
See Project
6

Lossless Claw

LCM (Lossless Context Management) plugin for OpenClaw

...This structure enables agents to dynamically reconstruct detailed context by expanding summaries when needed, effectively simulating perfect long-term memory.

Downloads: 5 This Week

Last Update: 13 hours ago
See Project
7

Neuron AI

The PHP Agentic Framework to build production-ready AI driven apps

Neuron AI is a PHP agentic framework for building production-ready AI applications that connect models, memory, vector databases, and tools into working agents. It is designed for developers who want to create systems such as RAG pipelines, multi-agent workflows, and business process automations without having to hand-build every integration from scratch. The framework provides an Agent class that can be extended to inherit core capabilities like memory, tools, function calling, and retrieval-augmented generation. ...

Downloads: 5 This Week

Last Update: 3 days ago
See Project
8

Hermes Web UI

The best way to use Hermes Agent from the web or from your phone

...It offers a clean, multi-panel layout that includes chat interaction, session management, and workspace file browsing. The interface allows users to manage agent sessions, configure models, and interact with persistent memory systems directly from a web environment. It is built using simple technologies like Python and vanilla JavaScript, avoiding complex frontend frameworks. The UI supports real-time interaction, context tracking, and visualization of token usage. It connects to a self-hosted agent that continuously learns and evolves over time. The project emphasizes usability, accessibility, and seamless integration with existing workflows.

Downloads: 25 This Week

Last Update: 2 hours ago
See Project
9

ex-skill

Distill your ex into an AI Skill

...The system works by ingesting various forms of personal data such as chat logs, social media content, photos, and user-provided descriptions, then structuring this information into a layered representation that combines memory and persona modeling. It is designed to run within Claude Code environments, where users can generate, manage, and interact with these personalized AI entities through command-based interfaces. The project emphasizes emotional realism by reconstructing conversational tone, habits, and contextual memories, enabling interactions that feel consistent with the original person.

Downloads: 25 This Week

Last Update: 2026-04-10
See Project
$300 Free Credits to Build on Google Cloud
New to Google Cloud? Get $300 in credits to explore Compute Engine, BigQuery, Cloud Run, Gemini Enterprise Agent Platform, and more.

Start your next project with $300 in free Google Cloud credit. Spin up VMs, run containers, query petabytes in BigQuery, or build agents with Gemini Enterprise Agent Platform. Once your credits are used, keep building with 20+ always-free tier products including Compute Engine, Cloud Storage, GKE, and Cloud Run functions. No commitment required—just sign up and start building.

Claim $300 Free
10

PicoLM

Run a 1-billion parameter LLM on a $10 board with 256MB RAM

PicoLM is an open-source inference framework designed to run large language models on extremely constrained hardware environments such as inexpensive single-board computers and embedded systems. The project focuses on enabling efficient local inference by optimizing memory usage, computation, and system dependencies so that relatively large models can operate on devices with minimal RAM. It is written primarily in C and designed with a minimalist architecture that removes unnecessary dependencies and external libraries. The runtime is capable of running language models with billions of parameters on devices with only a few hundred megabytes of memory, which is significantly lower than typical LLM infrastructure requirements. ...

Downloads: 1 This Week

Last Update: 2026-03-09
See Project
11

OpenClaw

Your own personal AI assistant. Any OS. Any Platform.

...It lets you send instructions through familiar messaging platforms like WhatsApp, Telegram, Discord, Slack, Signal, iMessage, and more, and then interprets those instructions to carry out actions such as managing calendars, sending emails or messages, browsing the web, executing system commands, and coordinating workflows across services — all while maintaining long-term memory and context across sessions. Because it runs locally or on infrastructure you choose (like a personal computer, VPS, or Raspberry Pi), OpenClaw emphasizes data ownership, privacy, and full transparency into how your instructions are handled and what actions are taken, giving users autonomy over their AI workflows.

1 Review

Downloads: 156 This Week

Last Update: 3 days ago
See Project
12

NullClaw

Fastest, smallest, and fully autonomous AI assistant infrastructure

...At just 678 KB with ~1 MB peak RAM usage, it boots in under 2 milliseconds and runs on virtually any hardware, including low-cost ARM boards. Despite its size, it delivers a complete AI stack with 22+ model providers, 18+ communication channels, integrated tools, hybrid memory, and sandboxed runtime support. Its architecture is fully modular, using vtable interfaces that allow providers, channels, tools, memory backends, and runtimes to be swapped without code changes. NullClaw is secure by design, enforcing pairing-based authentication, strict sandboxing, encrypted secrets, resource limits, and workspace scoping by default. ...

Downloads: 17 This Week

Last Update: 2026-05-29
See Project
13

MimiClaw

Run OpenClaw on a $5 chip

MimiClaw (from the mimiclaw project) is an edge-AI personal assistant that runs directly on extremely low-cost hardware like an ESP32-S3 microcontroller without a full operating system, Node.js, or cloud backend. By running pure C on a bare-metal chip, MimiClaw brings AI interactions and persistent memory to a tiny USB-powered device you can carry in your pocket. You connect the device to Wi-Fi and chat with it using Telegram, making it a convenient always-on assistant for tasks like reminders, quick lookups, or custom AI interactions. Even though it’s running on minimal hardware, MimiClaw maintains local memory that persists across power cycles, enabling context continuity over time without relying on cloud services. ...

Downloads: 7 This Week

Last Update: 2026-03-17
See Project
14

MCP Server Qdrant

An official Qdrant Model Context Protocol (MCP) server implementation

The Qdrant MCP Server is an official Model Context Protocol server that integrates with the Qdrant vector search engine. It acts as a semantic memory layer, allowing for the storage and retrieval of vector-based data, enhancing the capabilities of AI applications requiring semantic search functionalities.

Downloads: 6 This Week

Last Update: 2025-12-10
See Project
15

llmfit

157 models, 30 providers, one command to find what runs on hardware

llmfit is a terminal-based utility that helps developers determine which large language models can realistically run on their local hardware by analyzing system resources and model requirements. The tool automatically detects CPU, RAM, GPU, and VRAM specifications, then ranks available models based on performance factors such as speed, quality, and memory fit. It provides both an interactive terminal user interface and a traditional CLI mode, enabling flexible workflows for different user preferences. llmfit also supports advanced configurations including multi-GPU setups, mixture-of-experts architectures, and dynamic quantization recommendations. By presenting clear performance estimates and compatibility guidance, the project reduces the trial-and-error typically involved in local LLM experimentation. ...

Downloads: 31 This Week

Last Update: 5 days ago
See Project
16

MiroFish

A Simple and Universal Swarm Intelligence Engine

...The system extracts “seed” information from sources such as breaking news, policy documents, and market signals to construct a high-fidelity digital parallel world populated by thousands of virtual agents with independent memory and behavior rules. Users can inject variables or conditions into this simulated environment from a “god’s eye view,” enabling iterative prediction of future trends under different assumptions, which can be useful for decision support, scenario planning, or creative exploration. The engine includes both backend and frontend components, with configuration and deployment instructions for local and containerized setups, and is designed to produce detailed predictive reports based on interactions and emergent patterns within the simulated world.

Downloads: 340 This Week

Last Update: 2026-03-05
See Project
17

Personal AI Infrastructure

Agentic AI Infrastructure for magnifying HUMAN capabilities

...Its architecture supports long-term memory, verification of actions, and ongoing self-improvement, blurring the line between “assistant” and persistent, evolving collaborator.

Downloads: 3 This Week

Last Update: 2026-04-30
See Project
18

Claude Cognitive

Persistent context and multi-instance coordination

Claude Cognitive is an advanced memory and context-management extension designed to address the stateless limitations of Claude Code by giving the model a form of persistent “working memory” and multi-instance coordination. It introduces an attention-based context router that prioritizes files and content relevant to the current development discussion — tagging them as HOT, WARM, or COLD based on recency and keyword activation — so Claude Code doesn’t waste token budget rereading irrelevant code. ...

Downloads: 3 This Week

Last Update: 2026-01-28
See Project
19

bitnet.cpp

Official inference framework for 1-bit LLMs

bitnet.cpp is the official open-source inference framework and ecosystem designed to enable ultra-efficient execution of 1-bit large language models (LLMs), which quantize most model parameters to ternary values (-1, 0, +1) while maintaining competitive performance with full-precision counterparts. At its core is bitnet.cpp, a highly optimized C++ backend that supports fast, low-memory inference on both CPUs and GPUs, enabling models such as BitNet b1.58 to run without requiring enormous compute infrastructure. The project’s focus on extreme quantization dramatically reduces memory footprint and energy consumption compared with traditional 16-bit or 32-bit LLMs, making it practical to deploy advanced language understanding and generation models on everyday machines. ...

Downloads: 4 This Week

Last Update: 2026-03-10
See Project
20

ncnn

High-performance neural network inference framework for mobile

ncnn is a high-performance neural network inference computing framework designed specifically for mobile platforms. It brings artificial intelligence right at your fingertips with no third-party dependencies, and speeds faster than all other known open source frameworks for mobile phone cpu. ncnn allows developers to easily deploy deep learning algorithm models to the mobile platform and create intelligent APPs. It is cross-platform and supports most commonly used CNN networks, including...

Downloads: 64 This Week

Last Update: 2026-05-27
See Project
21

Claw Code

AI agent harness for AI coding agents

...It emphasizes harness engineering—how agents are structured, how they interact with tools, and how they maintain context during execution. The system is being actively expanded, with a Rust-based runtime in development to improve performance and memory safety. Overall, Claw Code serves as a research-driven platform for advancing agent-based software development systems.

Downloads: 18 This Week

Last Update: 7 days ago
See Project
22

NanoClaw

A lightweight alternative to Clawdbot / OpenClaw

...The project connects directly to WhatsApp, letting you deploy an assistant that can chat in a familiar interface while still supporting real agent behaviors instead of simple call-and-response prompts. It includes memory so the assistant can retain important context across interactions, enabling more consistent follow-through on ongoing tasks. It also supports scheduled jobs, making it suitable for recurring reminders, periodic automations, and timed workflows without needing an external orchestrator.

Downloads: 25 This Week

Last Update: 2026-05-18
See Project
23

R-KV

Redundancy-aware KV Cache Compression for Reasoning Models

...Modern transformer models rely heavily on KV caches during autoregressive decoding, which store intermediate attention states to accelerate generation. However, these caches can consume large amounts of memory, especially in reasoning-oriented models with long context windows. R-KV introduces a method for compressing the KV cache during decoding, allowing models to maintain reasoning performance while reducing memory consumption and computational overhead. The approach focuses on identifying which attention heads and cache components are most important for maintaining reasoning quality, allowing less critical information to be compressed or discarded. ...

Downloads: 0 This Week

Last Update: 2026-03-09
See Project
24

AgentScope

Build and run agents you can see, understand and trust

...It provides essential abstractions that evolve with advancing LLM capabilities, emphasizing reasoning, tool use, and flexible orchestration rather than rigid prompt constraints. With built-in support for ReAct agents, memory, planning, human-in-the-loop control, and real-time voice interaction, developers can create powerful agents in minutes. AgentScope integrates seamlessly with tools, long-term memory systems, MCP, A2A (Agent-to-Agent) protocols, and observability frameworks. It also supports reinforcement learning workflows for tuning agents and improving performance across complex tasks. ...

Downloads: 5 This Week

Last Update: 2026-06-05
See Project
25

LazyCodex

The one and only agent harness for complex codebases

LazyCodex is an agent harness for using Codex on complex software projects. It is designed to add structure around AI coding sessions through memory, planning, execution, verification, skills, hooks, routing, and diagnostics. The project helps developers move beyond one-off prompts by giving the agent a more organized workflow inside a codebase. It supports project memory so context can persist across sessions and decisions do not need to be repeatedly reintroduced. LazyCodex also emphasizes verified completion, which means the workflow is built around checking whether tasks are actually finished rather than only generating code. ...

Downloads: 2 This Week

Last Update: 11 hours ago
See Project