Showing 439 open source projects for "memory"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 1
    OpenHuman

    OpenHuman

    Your Personal AI super intelligence. Private, simple and powerful

    ...It focuses on a private, desktop-first experience with a friendly interface, onboarding flows, and a persistent assistant that can remember context over time. The project connects to common productivity tools, gathers fresh information from integrations, and organizes user knowledge into a local memory system. It also includes practical agent tools such as web search, web fetching, file access, coding utilities, voice input, text-to-speech, and model routing. Its goal is to make an AI assistant feel continuously useful across meetings, messages, documents, tasks, and personal workflows. Since it is still in early beta, it is best suited for technical users and early adopters who want to experiment with a customizable personal AI environment.
    Downloads: 60 This Week
    Last Update:
    See Project
  • 2
    Moltis

    Moltis

    A Rust-native claw you can trust

    ...The platform also includes long-term memory powered by hybrid vector and full-text search, allowing the assistant to retain context across sessions. With multi-channel access such as web UI, Telegram, and API endpoints, Moltis functions as a unified automation hub intended for developers and advanced users who want full control.
    Downloads: 13 This Week
    Last Update:
    See Project
  • 3
    Zep

    Zep

    Zep: A long-term memory store for LLM / Chatbot applications

    Easily add relevant documents, chat history memory & rich user data to your LLM app's prompts. Understands chat messages, roles, and user metadata, not just texts and embeddings. Zep Memory and VectorStore implementations are shipped with your favorite frameworks: LangChain, LangChain.js, LlamaIndex, and more. Automatically embed texts and messages using state-of-the-art opeb source models, OpenAI, or bring your own vectors.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 4
    OpenSquilla

    OpenSquilla

    Token-Efficient AI Agent with same budget, higher intelligence density

    OpenSquilla is a token-efficient microkernel AI agent runtime designed for CLI, web UI, and chat-based workflows. It routes each turn through a shared loop that can select lower-cost models when appropriate while preserving tool dispatch, retries, memory, and decision logging. The project supports multiple LLM providers through a pluggable provider layer, making it adaptable to different model ecosystems. It includes persistent memory, built-in web search, on-device embeddings, and sandboxing for safer execution. OpenSquilla is designed for users who want stronger agent capabilities without wasting tokens on every interaction. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure Icon
    Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure

    Native application identity and user-based security for your Azure cloud

    Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
    Get a free trial
  • 5
    DeepSeek-Reasonix

    DeepSeek-Reasonix

    DeepSeek-native AI coding agent for your terminal

    ...Reasonix includes a coding mode with filesystem and shell tools, a lighter chat mode, one-shot task execution, health checks, session utilities, and project-scoped memory. It supports reviewed SEARCH/REPLACE edits, plan mode, MCP servers, web search, hooks, skills, semantic indexing, transcript replay, event logs, and cost or cache tracking. The project is especially useful for developers who want an open, terminal-first coding agent optimized for DeepSeek’s cache mechanics. It also includes a prerelease desktop client for users who prefer a GUI over the same agent loop.
    Downloads: 45 This Week
    Last Update:
    See Project
  • 6
    Lossless Claw

    Lossless Claw

    LCM (Lossless Context Management) plugin for OpenClaw

    ...This structure enables agents to dynamically reconstruct detailed context by expanding summaries when needed, effectively simulating perfect long-term memory.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 7
    Neuron AI

    Neuron AI

    The PHP Agentic Framework to build production-ready AI driven apps

    Neuron AI is a PHP agentic framework for building production-ready AI applications that connect models, memory, vector databases, and tools into working agents. It is designed for developers who want to create systems such as RAG pipelines, multi-agent workflows, and business process automations without having to hand-build every integration from scratch. The framework provides an Agent class that can be extended to inherit core capabilities like memory, tools, function calling, and retrieval-augmented generation. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 8
    Hermes Web UI

    Hermes Web UI

    The best way to use Hermes Agent from the web or from your phone

    ...It offers a clean, multi-panel layout that includes chat interaction, session management, and workspace file browsing. The interface allows users to manage agent sessions, configure models, and interact with persistent memory systems directly from a web environment. It is built using simple technologies like Python and vanilla JavaScript, avoiding complex frontend frameworks. The UI supports real-time interaction, context tracking, and visualization of token usage. It connects to a self-hosted agent that continuously learns and evolves over time. The project emphasizes usability, accessibility, and seamless integration with existing workflows.
    Downloads: 25 This Week
    Last Update:
    See Project
  • 9
    ex-skill

    ex-skill

    Distill your ex into an AI Skill

    ...The system works by ingesting various forms of personal data such as chat logs, social media content, photos, and user-provided descriptions, then structuring this information into a layered representation that combines memory and persona modeling. It is designed to run within Claude Code environments, where users can generate, manage, and interact with these personalized AI entities through command-based interfaces. The project emphasizes emotional realism by reconstructing conversational tone, habits, and contextual memories, enabling interactions that feel consistent with the original person.
    Downloads: 25 This Week
    Last Update:
    See Project
  • $300 Free Credits to Build on Google Cloud Icon
    $300 Free Credits to Build on Google Cloud

    New to Google Cloud? Get $300 in credits to explore Compute Engine, BigQuery, Cloud Run, Gemini Enterprise Agent Platform, and more.

    Start your next project with $300 in free Google Cloud credit. Spin up VMs, run containers, query petabytes in BigQuery, or build agents with Gemini Enterprise Agent Platform. Once your credits are used, keep building with 20+ always-free tier products including Compute Engine, Cloud Storage, GKE, and Cloud Run functions. No commitment required—just sign up and start building.
    Claim $300 Free
  • 10
    PicoLM

    PicoLM

    Run a 1-billion parameter LLM on a $10 board with 256MB RAM

    PicoLM is an open-source inference framework designed to run large language models on extremely constrained hardware environments such as inexpensive single-board computers and embedded systems. The project focuses on enabling efficient local inference by optimizing memory usage, computation, and system dependencies so that relatively large models can operate on devices with minimal RAM. It is written primarily in C and designed with a minimalist architecture that removes unnecessary dependencies and external libraries. The runtime is capable of running language models with billions of parameters on devices with only a few hundred megabytes of memory, which is significantly lower than typical LLM infrastructure requirements. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    OpenClaw

    OpenClaw

    Your own personal AI assistant. Any OS. Any Platform.

    ...It lets you send instructions through familiar messaging platforms like WhatsApp, Telegram, Discord, Slack, Signal, iMessage, and more, and then interprets those instructions to carry out actions such as managing calendars, sending emails or messages, browsing the web, executing system commands, and coordinating workflows across services — all while maintaining long-term memory and context across sessions. Because it runs locally or on infrastructure you choose (like a personal computer, VPS, or Raspberry Pi), OpenClaw emphasizes data ownership, privacy, and full transparency into how your instructions are handled and what actions are taken, giving users autonomy over their AI workflows.
    Downloads: 156 This Week
    Last Update:
    See Project
  • 12
    NullClaw

    NullClaw

    Fastest, smallest, and fully autonomous AI assistant infrastructure

    ...At just 678 KB with ~1 MB peak RAM usage, it boots in under 2 milliseconds and runs on virtually any hardware, including low-cost ARM boards. Despite its size, it delivers a complete AI stack with 22+ model providers, 18+ communication channels, integrated tools, hybrid memory, and sandboxed runtime support. Its architecture is fully modular, using vtable interfaces that allow providers, channels, tools, memory backends, and runtimes to be swapped without code changes. NullClaw is secure by design, enforcing pairing-based authentication, strict sandboxing, encrypted secrets, resource limits, and workspace scoping by default. ...
    Downloads: 17 This Week
    Last Update:
    See Project
  • 13
    MimiClaw

    MimiClaw

    Run OpenClaw on a $5 chip

    MimiClaw (from the mimiclaw project) is an edge-AI personal assistant that runs directly on extremely low-cost hardware like an ESP32-S3 microcontroller without a full operating system, Node.js, or cloud backend. By running pure C on a bare-metal chip, MimiClaw brings AI interactions and persistent memory to a tiny USB-powered device you can carry in your pocket. You connect the device to Wi-Fi and chat with it using Telegram, making it a convenient always-on assistant for tasks like reminders, quick lookups, or custom AI interactions. Even though it’s running on minimal hardware, MimiClaw maintains local memory that persists across power cycles, enabling context continuity over time without relying on cloud services. ...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 14
    MCP Server Qdrant

    MCP Server Qdrant

    An official Qdrant Model Context Protocol (MCP) server implementation

    The Qdrant MCP Server is an official Model Context Protocol server that integrates with the Qdrant vector search engine. It acts as a semantic memory layer, allowing for the storage and retrieval of vector-based data, enhancing the capabilities of AI applications requiring semantic search functionalities. ​
    Downloads: 6 This Week
    Last Update:
    See Project
  • 15
    llmfit

    llmfit

    157 models, 30 providers, one command to find what runs on hardware

    llmfit is a terminal-based utility that helps developers determine which large language models can realistically run on their local hardware by analyzing system resources and model requirements. The tool automatically detects CPU, RAM, GPU, and VRAM specifications, then ranks available models based on performance factors such as speed, quality, and memory fit. It provides both an interactive terminal user interface and a traditional CLI mode, enabling flexible workflows for different user preferences. llmfit also supports advanced configurations including multi-GPU setups, mixture-of-experts architectures, and dynamic quantization recommendations. By presenting clear performance estimates and compatibility guidance, the project reduces the trial-and-error typically involved in local LLM experimentation. ...
    Downloads: 31 This Week
    Last Update:
    See Project
  • 16
    MiroFish

    MiroFish

    A Simple and Universal Swarm Intelligence Engine

    ...The system extracts “seed” information from sources such as breaking news, policy documents, and market signals to construct a high-fidelity digital parallel world populated by thousands of virtual agents with independent memory and behavior rules. Users can inject variables or conditions into this simulated environment from a “god’s eye view,” enabling iterative prediction of future trends under different assumptions, which can be useful for decision support, scenario planning, or creative exploration. The engine includes both backend and frontend components, with configuration and deployment instructions for local and containerized setups, and is designed to produce detailed predictive reports based on interactions and emergent patterns within the simulated world.
    Downloads: 340 This Week
    Last Update:
    See Project
  • 17
    Personal AI Infrastructure

    Personal AI Infrastructure

    Agentic AI Infrastructure for magnifying HUMAN capabilities

    ...Its architecture supports long-term memory, verification of actions, and ongoing self-improvement, blurring the line between “assistant” and persistent, evolving collaborator.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 18
    Claude Cognitive

    Claude Cognitive

    Persistent context and multi-instance coordination

    Claude Cognitive is an advanced memory and context-management extension designed to address the stateless limitations of Claude Code by giving the model a form of persistent “working memory” and multi-instance coordination. It introduces an attention-based context router that prioritizes files and content relevant to the current development discussion — tagging them as HOT, WARM, or COLD based on recency and keyword activation — so Claude Code doesn’t waste token budget rereading irrelevant code. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 19
    bitnet.cpp

    bitnet.cpp

    Official inference framework for 1-bit LLMs

    bitnet.cpp is the official open-source inference framework and ecosystem designed to enable ultra-efficient execution of 1-bit large language models (LLMs), which quantize most model parameters to ternary values (-1, 0, +1) while maintaining competitive performance with full-precision counterparts. At its core is bitnet.cpp, a highly optimized C++ backend that supports fast, low-memory inference on both CPUs and GPUs, enabling models such as BitNet b1.58 to run without requiring enormous compute infrastructure. The project’s focus on extreme quantization dramatically reduces memory footprint and energy consumption compared with traditional 16-bit or 32-bit LLMs, making it practical to deploy advanced language understanding and generation models on everyday machines. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 20
    ncnn

    ncnn

    High-performance neural network inference framework for mobile

    ncnn is a high-performance neural network inference computing framework designed specifically for mobile platforms. It brings artificial intelligence right at your fingertips with no third-party dependencies, and speeds faster than all other known open source frameworks for mobile phone cpu. ncnn allows developers to easily deploy deep learning algorithm models to the mobile platform and create intelligent APPs. It is cross-platform and supports most commonly used CNN networks, including...
    Downloads: 64 This Week
    Last Update:
    See Project
  • 21
    Claw Code

    Claw Code

    AI agent harness for AI coding agents

    ...It emphasizes harness engineering—how agents are structured, how they interact with tools, and how they maintain context during execution. The system is being actively expanded, with a Rust-based runtime in development to improve performance and memory safety. Overall, Claw Code serves as a research-driven platform for advancing agent-based software development systems.
    Downloads: 18 This Week
    Last Update:
    See Project
  • 22
    NanoClaw

    NanoClaw

    A lightweight alternative to Clawdbot / OpenClaw

    ...The project connects directly to WhatsApp, letting you deploy an assistant that can chat in a familiar interface while still supporting real agent behaviors instead of simple call-and-response prompts. It includes memory so the assistant can retain important context across interactions, enabling more consistent follow-through on ongoing tasks. It also supports scheduled jobs, making it suitable for recurring reminders, periodic automations, and timed workflows without needing an external orchestrator.
    Downloads: 25 This Week
    Last Update:
    See Project
  • 23
    R-KV

    R-KV

    Redundancy-aware KV Cache Compression for Reasoning Models

    ...Modern transformer models rely heavily on KV caches during autoregressive decoding, which store intermediate attention states to accelerate generation. However, these caches can consume large amounts of memory, especially in reasoning-oriented models with long context windows. R-KV introduces a method for compressing the KV cache during decoding, allowing models to maintain reasoning performance while reducing memory consumption and computational overhead. The approach focuses on identifying which attention heads and cache components are most important for maintaining reasoning quality, allowing less critical information to be compressed or discarded. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    AgentScope

    AgentScope

    Build and run agents you can see, understand and trust

    ...It provides essential abstractions that evolve with advancing LLM capabilities, emphasizing reasoning, tool use, and flexible orchestration rather than rigid prompt constraints. With built-in support for ReAct agents, memory, planning, human-in-the-loop control, and real-time voice interaction, developers can create powerful agents in minutes. AgentScope integrates seamlessly with tools, long-term memory systems, MCP, A2A (Agent-to-Agent) protocols, and observability frameworks. It also supports reinforcement learning workflows for tuning agents and improving performance across complex tasks. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 25
    LazyCodex

    LazyCodex

    The one and only agent harness for complex codebases

    LazyCodex is an agent harness for using Codex on complex software projects. It is designed to add structure around AI coding sessions through memory, planning, execution, verification, skills, hooks, routing, and diagnostics. The project helps developers move beyond one-off prompts by giving the agent a more organized workflow inside a codebase. It supports project memory so context can persist across sessions and decisions do not need to be repeatedly reintroduced. LazyCodex also emphasizes verified completion, which means the workflow is built around checking whether tasks are actually finished rather than only generating code. ...
    Downloads: 2 This Week
    Last Update:
    See Project
Auth0 Logo