Page 4 | memory free download

Showing 459 open source projects for "memory"

View related business solutions

Artificial Intelligence Linux Clear Filters & Widen Search

$300 Free Credits for Your Google Cloud Projects
Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.

Start Free Trial
Build Agents and Models on One Platform
Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.

Try It Free
1

Qwen-Agent

Agent framework and applications built upon Qwen>=3.0

Qwen-Agent is a framework for building applications / agents using Qwen models (version 3.0+). It provides components for instruction following, tool usage (function calling), planning, memory, RAG (retrieval augmented generation), code interpreter, etc. It ships with example applications (Browser Assistant, Code Interpreter, Custom Assistant), supports GUI front-ends, backends, server setups. Agent workflow can maintain context / memory to perform multi-turn or more complex logic over time. It acts as the backend for Qwen Chat among other use cases. ...

Downloads: 0 This Week

Last Update: 2025-09-23
See Project
2

Build Your Own OpenClaw

A step-by-step guide to build your own AI agent

Build Your Own OpenClaw is a step-by-step educational framework that teaches developers how to construct a fully functional AI agent system from scratch, gradually evolving from a simple chat loop into a multi-agent, production-ready architecture. The project is structured into 18 progressive stages, each introducing a new concept such as tool usage, memory persistence, event-driven design, and multi-agent coordination, with each step including both explanatory documentation and runnable code. It begins with foundational concepts like conversational loops and tool integration, then expands into more advanced capabilities such as dynamic skill loading, web interaction, and context management. ...

Downloads: 1 This Week

Last Update: 2026-06-03
See Project
3

OpenAI CS Agents Demo

Demo of a customer service use case implemented with the OpenAI Agents

...It also demonstrates guardrails to validate or constrain responses, memory usage to maintain context, and tracing to help debugging of workflows.

Downloads: 1 This Week

Last Update: 2025-12-11
See Project
4

Transformer Engine

A library for accelerating Transformer models on NVIDIA GPUs

...As the number of parameters in Transformer models continues to grow, training and inference for architectures such as BERT, GPT, and T5 become very memory and compute-intensive. Most deep learning frameworks train with FP32 by default. This is not essential, however, to achieve full accuracy for many deep learning models.

Downloads: 6 This Week

Last Update: 2026-06-09
See Project
Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
5

yourself-skill

Instead of distilling others, it is better to distil yourself

...It encourages systems to maintain awareness of user preferences, goals, and communication styles. The project emphasizes building more human-aligned interactions by incorporating memory and contextual reasoning. It can be integrated into broader AI systems to improve personalization and continuity across sessions. The design focuses on enhancing user experience through adaptive responses. It is particularly useful for conversational agents and assistants. Overall, it contributes to more context-aware and user-centered AI systems.

Downloads: 2 This Week

Last Update: 2026-04-30
See Project
6

MiMoCode

Where Models and Agents Co-Evolve

...The tool includes multiple agent modes, including a build mode for development, a plan mode for read-only analysis, and a compose mode for structured workflows. Its persistent memory system stores project notes, checkpoints, scratch notes, and task progress so the assistant can resume work with context. It also supports subagents, goal checking, voice input, MCP connections, and custom provider configuration. MiMo-Code is useful for developers who want an autonomous coding assistant that combines terminal workflows, long-running task management, and project-aware memory.

Downloads: 0 This Week

Last Update: 2 days ago
See Project
7

OpenSquilla

Token-Efficient AI Agent with same budget, higher intelligence density

OpenSquilla is a token-efficient microkernel AI agent runtime designed for CLI, web UI, and chat-based workflows. It routes each turn through a shared loop that can select lower-cost models when appropriate while preserving tool dispatch, retries, memory, and decision logging. The project supports multiple LLM providers through a pluggable provider layer, making it adaptable to different model ecosystems. It includes persistent memory, built-in web search, on-device embeddings, and sandboxing for safer execution. OpenSquilla is designed for users who want stronger agent capabilities without wasting tokens on every interaction. ...

Downloads: 0 This Week

Last Update: 2026-06-03
See Project
8

ECC

The agent harness performance optimization system

ECC is an agent harness performance optimization system for AI coding tools such as Claude Code, Codex, Opencode, and similar environments. It packages rules, skills, instincts, memory behavior, security practices, and research-first development patterns into a structured framework. The project is designed to make coding agents more reliable by improving how they plan, inspect context, make changes, review work, and avoid unnecessary mistakes. ECC includes installation guidance and language-specific rule folders for applying the system across different development setups. ...

Downloads: 4 This Week

Last Update: 2026-06-10
See Project
9

Team9

Team9 is a collaborative workspace for AI agents

...It builds on agent frameworks like OpenClaw and introduces a managed environment where agents can be assigned roles, share context, and execute tasks collaboratively. The system emphasizes a “local-first” architecture, allowing agents to run on user-controlled infrastructure while maintaining persistent memory and data privacy. It includes orchestration mechanisms that allow agents to operate continuously through scheduled tasks, event-driven triggers, and long-running processes. The platform also integrates messaging gateways and communication channels, enabling agents to interact with users and systems in real time. Its design reflects a shift toward treating AI agents as operational units within organizations rather than isolated tools.

Downloads: 4 This Week

Last Update: 2026-06-10
See Project
Atera - an All-in-one platform for IT management
Ideal for IT departments and MSPs (managed service providers)

Your IT essentials, integrated & elevated. Take your IT management from automated to autonomous, download Atera's agent to start your free trial!

Try Atera now
10

Reflexion

Reflexion: Language Agents with Verbal Reinforcement Learning

...Instead of relying solely on a single-pass response, Reflexion enables agents to evaluate their own outputs, identify errors, and refine their reasoning over multiple iterations, leading to more accurate and reliable results. The framework introduces a mechanism where agents maintain a memory of past attempts and use that memory to guide future decisions, effectively simulating a learning process without requiring traditional model retraining. This approach is particularly useful for complex reasoning tasks, coding challenges, and decision-making scenarios where initial outputs may be incomplete or incorrect. Reflexion also emphasizes transparency by making intermediate reasoning steps explicit, allowing developers to inspect how conclusions are reached and where improvements occur.

Downloads: 0 This Week

Last Update: 2026-03-18
See Project
11

Gollama

Go manage your Ollama models

...Beyond standard model management, Gollama can display metadata such as size, quantization level, model family, and modification date, which helps users compare models quickly. One of its more distinctive capabilities is a VRAM estimation system that can calculate memory requirements, estimate context limits, and help users choose quantization settings that fit available hardware.

Downloads: 4 This Week

Last Update: 4 days ago
See Project
12

Cloudflare Agents

Build and deploy AI Agents on Cloudflare

...The project includes SDKs, templates, and deployment tooling that simplify the process of connecting agents to external APIs, storage systems, and workflows. Its architecture emphasizes persistent memory, enabling agents to maintain context across sessions and interactions. Developers can orchestrate complex behaviors using workflows and durable objects, making it suitable for production-grade autonomous systems. Overall, Cloudflare Agents aims to streamline the development of scalable AI automation that operates close to users for improved performance.

Downloads: 4 This Week

Last Update: 23 hours ago
See Project
13

Claude Cognitive

Persistent context and multi-instance coordination

Claude Cognitive is an advanced memory and context-management extension designed to address the stateless limitations of Claude Code by giving the model a form of persistent “working memory” and multi-instance coordination. It introduces an attention-based context router that prioritizes files and content relevant to the current development discussion — tagging them as HOT, WARM, or COLD based on recency and keyword activation — so Claude Code doesn’t waste token budget rereading irrelevant code. ...

Downloads: 0 This Week

Last Update: 2026-01-28
See Project
14

CodeGeeX2

CodeGeeX2: A More Powerful Multilingual Code Generation Model

...With improved inference efficiency, quantization options, and multi-query/flash attention, CodeGeeX2 achieves faster generation speeds and lightweight deployment, requiring as little as 6GB GPU memory at INT4 precision. Its backend powers the CodeGeeX IDE plugins for VS Code, JetBrains, and other editors, offering developers interactive AI assistance with features like infilling and cross-file completion.

Downloads: 10 This Week

Last Update: 7 days ago
See Project
15

claude-obsidian

Claude + Obsidian knowledge companion

...The system follows the LLM Wiki pattern, where information is stored as persistent markdown files that grow richer over time through cross-referencing and synthesis. It includes features such as contradiction detection, orphaned note identification, and automatic indexing. A persistent memory layer ensures continuity across sessions, eliminating the need for repeated context. It also performs autonomous research to fill knowledge gaps and expand the knowledge base. Overall, it turns note-taking into an active, compounding intelligence system.

Downloads: 2 This Week

Last Update: 2026-05-28
See Project
16

LLM Telegram Bot

A Telegram bot for Large Language Models

...The project is designed to provide a customizable AI assistant that can operate within Telegram conversations, supporting dynamic responses based on user input and configurable parameters. It includes features such as conversation memory, allowing the bot to maintain context across multiple messages and provide more coherent responses. The system supports multiple modes or personas, enabling users to switch between different conversational styles or use cases. It also allows fine-tuning of generation parameters such as temperature and token limits, giving users control over response behavior. ...

Downloads: 2 This Week

Last Update: 2026-04-20
See Project
17

METATRON

AI-powered penetration testing assistant using local LLM on linux

...It provides a structured system for task delegation, communication, and collaboration between agents. The framework emphasizes scalability, allowing multiple agents to work together on large or complex problems. It includes mechanisms for managing context, memory, and execution flow across tasks. METATRON is particularly useful for building advanced AI systems that require coordination rather than isolated responses. Its architecture supports modular expansion and integration with different models. Overall, it enables the creation of collaborative AI ecosystems.

Downloads: 1 This Week

Last Update: 2026-04-30
See Project
18

Memvid

Video-based AI memory library. Store millions of text chunks in MP4

Memvid encodes text chunks as QR codes within MP4 frames to build a portable “video memory” for AI systems. This innovative approach uses standard video containers and offers millisecond-level semantic search across large corpora with dramatically less storage than vector DBs. It's self-contained—no DB needed—and supports features like PDF indexing, chat integration, and cloud dashboards.

Downloads: 1 This Week

Last Update: 2026-05-27
See Project
19

PicoClaw

Ultra-Efficient AI Assistant in Go

PicoClaw is an ultra-lightweight, open-source personal AI assistant written in Go, architected from the ground up to operate with extremely low memory usage (under 10 MB) and fast boot times, making it suitable for inexpensive hardware platforms and embedded devices. Inspired by earlier AI assistant projects like “nanobot,” it was refactored to emphasize resource efficiency while still supporting meaningful AI-driven interactions such as conversational workflows, planning tasks, and automation. ...

Downloads: 6 This Week

Last Update: 2026-05-29
See Project
20

SAM 2

The repository provides code for running inference with SAM 2

...It retains the core promptable interface—accepting points, boxes, or masks—but incorporates architectural and training enhancements to produce higher-fidelity masks, better boundary adherence, and robustness to complex scenes. The updated model is optimized for faster inference and lower memory use, enabling real-time interactivity even on larger images or constrained hardware. SAM2 comes with pretrained weights and easy-to-use APIs, enabling developers and researchers to integrate promptable segmentation into annotation tools, vision pipelines, or downstream tasks. The project also includes scripts and notebooks to compare SAM2 against SAM on edge cases, benchmarks showing improvements, and evaluation suites to measure mask quality metrics like IoU and boundary error.

Downloads: 6 This Week

Last Update: 2025-10-06
See Project
21

Model Explorer

A modern model graph visualizer and debugger

Model Explorer is a visual tool for exploring, debugging, and optimizing ML models deployed on edge devices. Developed by Google AI Edge, it offers a browser-based interface to inspect layer-wise performance, memory usage, and inference timing of TensorFlow Lite and other supported models. It’s a powerful utility for developers optimizing models for constrained environments.

Downloads: 0 This Week

Last Update: 2026-02-09
See Project
22

Pro Workflow

Claude Code learns from your corrections: self-correcting memory

Pro Workflow is a productivity framework for Claude Code that introduces self-improving workflows through memory, context engineering, and structured agent orchestration. The system learns from user corrections over time, storing feedback and refining its behavior across sessions to improve accuracy and efficiency. It supports advanced development setups such as parallel worktrees, enabling multiple tasks to be handled simultaneously without interference.

Downloads: 0 This Week

Last Update: 2026-05-09
See Project
23

OpenSage

An agent framework that enables AI to create their own agent

...Unlike traditional agent frameworks that require developers to manually define workflows, tools, and structures, OpenSage introduces a system where large language models can dynamically generate their own agent architectures, including sub-agents, toolchains, and execution strategies. The framework is built around the concept of an Agent Development Kit (ADK), providing structured components for memory, reasoning, and task decomposition while allowing agents to iteratively improve their own design. A key innovation is its hierarchical and graph-based memory system, which enables agents to store, retrieve, and organize information across complex workflows with improved efficiency and contextual awareness.

Downloads: 0 This Week

Last Update: 2026-04-07
See Project
24

FlexLLMGen

Running large language models on a single GPU

...The system focuses on high-throughput generation workloads where large batches of text must be processed quickly, such as large-scale data extraction or document analysis tasks. Instead of requiring expensive multi-GPU systems, the framework uses techniques such as memory offloading, compression, and optimized batching to run large models on commodity hardware. The architecture distributes computation and memory usage across the GPU, CPU, and disk in order to maximize the number of tokens processed during inference. This design allows organizations to deploy powerful language models for high-volume tasks without the infrastructure costs typically associated with large-scale AI systems. ...

Downloads: 0 This Week

Last Update: 2026-03-10
See Project
25

uzu

A high-performance inference engine for AI models

...The engine implements a hybrid architecture in which model layers can be executed either as custom GPU kernels or through Apple’s MPSGraph API, allowing it to balance performance and compatibility depending on the workload. By utilizing Apple’s unified memory architecture, uzu reduces memory copying overhead and improves inference throughput for local AI workloads. The system includes a simple high-level API that enables developers to run models, create inference sessions, and generate outputs with minimal configuration.

Downloads: 0 This Week

Last Update: 2026-06-08
See Project