Page 2 | memory free download

Showing 248 open source projects for "memory"

View related business solutions

Artificial Intelligence Python Clear Filters & Widen Search

Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure
Native application identity and user-based security for your Azure cloud

Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.

Get a free trial
Ship Agents Faster
Transform your applications and workflows into powerful agentic systems at global scale.

Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.

Get Started Free
1

Koila

Prevent PyTorch's `CUDA error: out of memory` in just 1 line of code

Koila is a lightweight Python library designed to help developers avoid memory errors when training deep learning models with PyTorch. The library introduces a lazy evaluation mechanism that delays computation until it is actually required, allowing the framework to better estimate the memory requirements of a model before execution. By building a computational graph first and executing operations only when necessary, koila reduces the risk of running out of GPU memory during the forward pass of neural network training. ...

Downloads: 0 This Week

Last Update: 2026-05-26
See Project
2

xLSTM

Neural Network architecture based on ideas of the original LSTM

xLSTM is an open-source machine learning architecture that reimagines the classic Long Short-Term Memory (LSTM) network for modern large-scale language modeling and sequence processing tasks. The project introduces a new recurrent neural network design that incorporates exponential gating mechanisms and enhanced memory structures to overcome limitations of traditional LSTM models. By introducing innovations such as matrix-based memory and improved normalization techniques, xLSTM improves the ability of recurrent networks to capture long-range dependencies in sequential data. ...

Downloads: 0 This Week

Last Update: 2026-03-06
See Project
3

CoPaw

Your Personal AI Assistant; easy to install, deploy on local or coud

...It includes a browser-based Console for chatting, configuring models, managing memory, and extending capabilities with custom skills. With built-in cron scheduling, heartbeat check-ins, and extensible skill loading, CoPaw grows with your workflow over time. Easy installation options—including pip, one-line scripts, Docker, and cloud deployment—make it accessible for both developers and non-technical users.

1 Review

Downloads: 27 This Week

Last Update: 4 days ago
See Project
4

Memobase

Fast backend for long-term AI user memory via structured profiles

Memobase is an open source backend system that enables long-term user memory functionality for AI applications by capturing and structuring information about users across interactions. Its design centers on creating user profiles and recording event timelines, allowing AI systems to remember, understand, and evolve in their behaviour toward individual users over time. Instead of relying purely on traditional embedding-based retrieval or RAG systems, Memobase uses profile and timeline structures to deliver memory that reflects user context efficiently and meaningfully. ...

Downloads: 1 This Week

Last Update: 2 hours ago
See Project
Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
5

OpenAI Agents (Python)

A lightweight, powerful framework for multi-agent workflows

openai-agents-python is a library developed by OpenAI to simplify the process of creating and running agents that interact with tools and APIs using OpenAI models. It provides abstractions for tool usage, memory management, and agent workflows, enabling developers to define function-calling agents that reason through multi-step tasks. Ideal for building custom AI workflows, the library supports dynamic tool definitions and contextual memory handling.

Downloads: 3 This Week

Last Update: 4 days ago
See Project
6

Agno

Lightweight framework for building Agents with memory, knowledge, etc.

Agno is a modular, open-source artificial general intelligence (AGI) research platform that allows developers to build, evaluate, and experiment with cognitive architectures in a composable way. It provides a flexible framework for modeling reasoning, memory, decision-making, and planning, aimed at long-term AI research beyond narrow learning. Agno embraces multi-agent environments and symbolic reasoning as part of its core design, enabling experiments with structured knowledge, goal-oriented behaviors, and meta-learning. It’s designed for researchers seeking an extensible platform to explore AGI components without being tied to black-box models.

Downloads: 6 This Week

Last Update: 3 days ago
See Project
7

Memary

The Open Source Memory Layer For Autonomous Agents

Memary is a journaling and personal memory management application that helps users record and retrieve past experiences. It focuses on simplicity, ease of use, and structured data storage for personal reflections and knowledge tracking.

Downloads: 3 This Week

Last Update: 2025-02-20
See Project
8

Qwen-Agent

Agent framework and applications built upon Qwen>=3.0

Qwen-Agent is a framework for building applications / agents using Qwen models (version 3.0+). It provides components for instruction following, tool usage (function calling), planning, memory, RAG (retrieval augmented generation), code interpreter, etc. It ships with example applications (Browser Assistant, Code Interpreter, Custom Assistant), supports GUI front-ends, backends, server setups. Agent workflow can maintain context / memory to perform multi-turn or more complex logic over time. It acts as the backend for Qwen Chat among other use cases. ...

Downloads: 3 This Week

Last Update: 2025-09-23
See Project
9

OpenSquilla

Token-Efficient AI Agent with same budget, higher intelligence density

OpenSquilla is a token-efficient microkernel AI agent runtime designed for CLI, web UI, and chat-based workflows. It routes each turn through a shared loop that can select lower-cost models when appropriate while preserving tool dispatch, retries, memory, and decision logging. The project supports multiple LLM providers through a pluggable provider layer, making it adaptable to different model ecosystems. It includes persistent memory, built-in web search, on-device embeddings, and sandboxing for safer execution. OpenSquilla is designed for users who want stronger agent capabilities without wasting tokens on every interaction. ...

Downloads: 4 This Week

Last Update: 2026-06-03
See Project
Compliant and Reliable File Transfers Backed by Top Security Certifications
Cerberus FTP Server delivers SOC 2 Type II certified security and FIPS 140-2 validated encryption.

Stop relying on non-certified, legacy file transfer tools that creak under the weight of modern security demands. Get full audit trails, advanced access controls and more supported by an award-winning team of experts. Start your free 25-day trial today.

Start Free Trial
10

Hermes Web UI

The best way to use Hermes Agent from the web or from your phone

...It offers a clean, multi-panel layout that includes chat interaction, session management, and workspace file browsing. The interface allows users to manage agent sessions, configure models, and interact with persistent memory systems directly from a web environment. It is built using simple technologies like Python and vanilla JavaScript, avoiding complex frontend frameworks. The UI supports real-time interaction, context tracking, and visualization of token usage. It connects to a self-hosted agent that continuously learns and evolves over time. The project emphasizes usability, accessibility, and seamless integration with existing workflows.

Downloads: 24 This Week

Last Update: 9 hours ago
See Project
11

Claude Cognitive

Persistent context and multi-instance coordination

Claude Cognitive is an advanced memory and context-management extension designed to address the stateless limitations of Claude Code by giving the model a form of persistent “working memory” and multi-instance coordination. It introduces an attention-based context router that prioritizes files and content relevant to the current development discussion — tagging them as HOT, WARM, or COLD based on recency and keyword activation — so Claude Code doesn’t waste token budget rereading irrelevant code. ...

Downloads: 3 This Week

Last Update: 2026-01-28
See Project
12

Claw Code

AI agent harness for AI coding agents

...It emphasizes harness engineering—how agents are structured, how they interact with tools, and how they maintain context during execution. The system is being actively expanded, with a Rust-based runtime in development to improve performance and memory safety. Overall, Claw Code serves as a research-driven platform for advancing agent-based software development systems.

Downloads: 20 This Week

Last Update: 2026-06-08
See Project
13

bitnet.cpp

Official inference framework for 1-bit LLMs

bitnet.cpp is the official open-source inference framework and ecosystem designed to enable ultra-efficient execution of 1-bit large language models (LLMs), which quantize most model parameters to ternary values (-1, 0, +1) while maintaining competitive performance with full-precision counterparts. At its core is bitnet.cpp, a highly optimized C++ backend that supports fast, low-memory inference on both CPUs and GPUs, enabling models such as BitNet b1.58 to run without requiring enormous compute infrastructure. The project’s focus on extreme quantization dramatically reduces memory footprint and energy consumption compared with traditional 16-bit or 32-bit LLMs, making it practical to deploy advanced language understanding and generation models on everyday machines. ...

Downloads: 4 This Week

Last Update: 2026-03-10
See Project
14

ex-skill

Distill your ex into an AI Skill

...The system works by ingesting various forms of personal data such as chat logs, social media content, photos, and user-provided descriptions, then structuring this information into a layered representation that combines memory and persona modeling. It is designed to run within Claude Code environments, where users can generate, manage, and interact with these personalized AI entities through command-based interfaces. The project emphasizes emotional realism by reconstructing conversational tone, habits, and contextual memories, enabling interactions that feel consistent with the original person.

Downloads: 18 This Week

Last Update: 2026-04-10
See Project
15

MiroFish

A Simple and Universal Swarm Intelligence Engine

...The system extracts “seed” information from sources such as breaking news, policy documents, and market signals to construct a high-fidelity digital parallel world populated by thousands of virtual agents with independent memory and behavior rules. Users can inject variables or conditions into this simulated environment from a “god’s eye view,” enabling iterative prediction of future trends under different assumptions, which can be useful for decision support, scenario planning, or creative exploration. The engine includes both backend and frontend components, with configuration and deployment instructions for local and containerized setups, and is designed to produce detailed predictive reports based on interactions and emergent patterns within the simulated world.

Downloads: 304 This Week

Last Update: 2026-03-05
See Project
16

R-KV

Redundancy-aware KV Cache Compression for Reasoning Models

...Modern transformer models rely heavily on KV caches during autoregressive decoding, which store intermediate attention states to accelerate generation. However, these caches can consume large amounts of memory, especially in reasoning-oriented models with long context windows. R-KV introduces a method for compressing the KV cache during decoding, allowing models to maintain reasoning performance while reducing memory consumption and computational overhead. The approach focuses on identifying which attention heads and cache components are most important for maintaining reasoning quality, allowing less critical information to be compressed or discarded. ...

Downloads: 0 This Week

Last Update: 2026-03-09
See Project
17

AgentScope

Build and run agents you can see, understand and trust

...It provides essential abstractions that evolve with advancing LLM capabilities, emphasizing reasoning, tool use, and flexible orchestration rather than rigid prompt constraints. With built-in support for ReAct agents, memory, planning, human-in-the-loop control, and real-time voice interaction, developers can create powerful agents in minutes. AgentScope integrates seamlessly with tools, long-term memory systems, MCP, A2A (Agent-to-Agent) protocols, and observability frameworks. It also supports reinforcement learning workflows for tuning agents and improving performance across complex tasks. ...

Downloads: 5 This Week

Last Update: 2026-06-05
See Project
18

Vibe-Trading

Vibe-Trading: Your Personal Trading Agent

...It features a swarm-based architecture with prebuilt expert agent teams for research, trading, and risk management. Advanced backtesting engines provide statistical validation, optimization, and performance metrics. The system also includes persistent memory, enabling it to learn from past interactions and refine strategies over time. Overall, it delivers an end-to-end AI-driven trading environment for both research and execution.

Downloads: 16 This Week

Last Update: 2026-06-01
See Project
19

Harmonist

Portable AI agent orchestration with mechanical protocol enforcement

...It is designed to make agent workflows more reliable by enforcing protocol rules mechanically instead of trusting prompts alone. The framework includes a catalog of specialized agents, validated memory behavior, supply-chain checks, and hooks that gate code-changing turns. If required reviewers do not run, memory is not updated, or shipped files fail integrity checks, Harmonist can block the workflow from completing. The project uses Python, has no runtime dependencies beyond the standard library, and is positioned as a drop-in agent coordination pack. ...

Downloads: 2 This Week

Last Update: 5 days ago
See Project
20

xFormers

Hackable and optimized Transformers building blocks

...One of its key goals is efficient attention: it supports dense, sparse, low-rank, and approximate attention mechanisms (e.g. FlashAttention, Linformer, Performer) via interchangeable modules. The library includes memory-efficient operator implementations in both Python and optimized C++/CUDA, ensuring that performance isn’t sacrificed for modularity. It also integrates with PyTorch seamlessly so you can drop in its blocks to existing models, replace default attention layers, or build new architectures from scratch. xformers includes training, deployment, and memory profiling tools.

Downloads: 3 This Week

Last Update: 2026-02-20
See Project
21

Datasets

Hub of ready-to-use datasets for ML models

...Load a dataset in a single line of code, and use our powerful data processing methods to quickly get your dataset ready for training in a deep learning model. Backed by the Apache Arrow format, process large datasets with zero-copy reads without any memory constraints for optimal speed and efficiency. We also feature a deep integration with the Hugging Face Hub, allowing you to easily load and share a dataset with the wider NLP community. There are currently over 2658 datasets, and more than 34 metrics available. Datasets naturally frees the user from RAM memory limitation, all datasets are memory-mapped using an efficient zero-serialization cost backend (Apache Arrow). ...

Downloads: 5 This Week

Last Update: 2026-06-05
See Project
22

Mistral Finetune

Memory-efficient and performant finetuning of Mistral's models

mistral-finetune is an official lightweight codebase designed for memory-efficient and performant finetuning of Mistral’s open models (e.g. 7B, instruct variants). It builds on techniques like LoRA (Low-Rank Adaptation) to allow customizing models without full parameter updates, which reduces GPU memory footprint and training cost. The repo includes utilities for data preprocessing (e.g. reformat_data.py), validation scripts, and example YAML configs for training variants like 7B base or instruct models. ...

Downloads: 0 This Week

Last Update: 2025-10-04
See Project
23

Memvid

Video-based AI memory library. Store millions of text chunks in MP4

Memvid encodes text chunks as QR codes within MP4 frames to build a portable “video memory” for AI systems. This innovative approach uses standard video containers and offers millisecond-level semantic search across large corpora with dramatically less storage than vector DBs. It's self-contained—no DB needed—and supports features like PDF indexing, chat integration, and cloud dashboards.

Downloads: 5 This Week

Last Update: 2026-05-27
See Project
24

Agentex

Open source codebase for Scale Agentex

AgentEX is an open framework from Scale for building, running, and evaluating agentic workflows, with an emphasis on reproducibility and measurable outcomes rather than ad-hoc demos. It treats an “agent” as a composition of a policy (the LLM), tools, memory, and an execution runtime so you can test the whole loop, not just prompting. The repo focuses on structured experiments: standardized tasks, canonical tool interfaces, and logs that make it possible to compare models, prompts, and tool sets fairly. It also includes evaluation harnesses that capture success criteria and partial credit, plus traces you can inspect to understand where reasoning or tool use failed. ...

Downloads: 20 This Week

Last Update: 6 days ago
See Project
25

Transformer Engine

A library for accelerating Transformer models on NVIDIA GPUs

...As the number of parameters in Transformer models continues to grow, training and inference for architectures such as BERT, GPT, and T5 become very memory and compute-intensive. Most deep learning frameworks train with FP32 by default. This is not essential, however, to achieve full accuracy for many deep learning models.

Downloads: 14 This Week

Last Update: 6 days ago
See Project