Showing 248 open source projects for "memory"

View related business solutions
  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • AI-powered service management for IT and enterprise teams Icon
    AI-powered service management for IT and enterprise teams

    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
    Try it Free
  • 1
    Operit AI

    Operit AI

    Powerful Android AI agent with tools, automation, and Linux shell

    ...Operit supports both local and remote AI models, including offline execution through frameworks like llama.cpp and MNN, helping preserve user privacy while maintaining flexibility. Operit also includes an intelligent memory system that stores, organizes, and retrieves user interactions to provide more personalized and context-aware responses. In addition, it offers workflow automation, plugin extensibility, & a rich tool ecosystem, making it suitable for advanced productivity.
    Downloads: 19 This Week
    Last Update:
    See Project
  • 2
    claude-obsidian

    claude-obsidian

    Claude + Obsidian knowledge companion

    ...The system follows the LLM Wiki pattern, where information is stored as persistent markdown files that grow richer over time through cross-referencing and synthesis. It includes features such as contradiction detection, orphaned note identification, and automatic indexing. A persistent memory layer ensures continuity across sessions, eliminating the need for repeated context. It also performs autonomous research to fill knowledge gaps and expand the knowledge base. Overall, it turns note-taking into an active, compounding intelligence system.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 3
    MCP Server Qdrant

    MCP Server Qdrant

    An official Qdrant Model Context Protocol (MCP) server implementation

    The Qdrant MCP Server is an official Model Context Protocol server that integrates with the Qdrant vector search engine. It acts as a semantic memory layer, allowing for the storage and retrieval of vector-based data, enhancing the capabilities of AI applications requiring semantic search functionalities. ​
    Downloads: 2 This Week
    Last Update:
    See Project
  • 4
    Clawbolt

    Clawbolt

    The AI Assistant that actually does things for the trades

    ...The platform allows users to interact with an AI assistant through iMessage, SMS, RCS, Telegram, and related messaging channels to handle tasks such as estimates, invoices, scheduling, reminders, and client communication. Clawbolt combines large language model orchestration with memory systems, file storage integrations, and tool-calling workflows to create an assistant capable of managing real operational tasks instead of only answering prompts. The project supports integrations with QuickBooks Online, Google Calendar, Dropbox, and Google Drive, enabling automated business workflows tied directly to conversations. ...
    Downloads: 7 This Week
    Last Update:
    See Project
  • Stop Storing Third-Party Tokens in Your Database Icon
    Stop Storing Third-Party Tokens in Your Database

    Auth0 Token Vault handles secure token storage, exchange, and refresh for external providers so you don't have to build it yourself.

    Rolling your own OAuth token storage can be a security liability. Token Vault securely stores access and refresh tokens from federated providers and handles exchange and renewal automatically. Connected accounts, refresh exchange, and privileged worker flows included.
    Try Auth0 for Free
  • 5
    KVCache-Factory

    KVCache-Factory

    Unified KV Cache Compression Methods for Auto-Regressive Models

    ...In large language models, the key-value cache stores intermediate attention states that enable efficient token generation during inference, but these caches can consume large amounts of GPU memory when handling long contexts. KVCache-Factory provides a platform for implementing and evaluating multiple compression strategies that reduce memory usage while preserving model performance. The framework integrates several state-of-the-art methods such as PyramidKV, SnapKV, H2O, and StreamingLLM, allowing researchers to compare and experiment with different approaches within the same environment. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    Build Your Own OpenClaw

    Build Your Own OpenClaw

    A step-by-step guide to build your own AI agent

    Build Your Own OpenClaw is a step-by-step educational framework that teaches developers how to construct a fully functional AI agent system from scratch, gradually evolving from a simple chat loop into a multi-agent, production-ready architecture. The project is structured into 18 progressive stages, each introducing a new concept such as tool usage, memory persistence, event-driven design, and multi-agent coordination, with each step including both explanatory documentation and runnable code. It begins with foundational concepts like conversational loops and tool integration, then expands into more advanced capabilities such as dynamic skill loading, web interaction, and context management. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    Stable Diffusion WebUI Forge

    Stable Diffusion WebUI Forge

    Stable Diffusion WebUI Forge is a platform on top of Stable Diffusion

    ...It also focuses on stability during long sessions, aiming to reduce out-of-memory failures and provide clearer diagnostics when they occur. The UI surfaces advanced options in a way that remains recognizable to WebUI users, so migration costs are low while gaining experimental features. In practice, Forge serves as a proving ground for ideas that may later influence upstream tools, giving power users early access to cutting-edge techniques.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    Reflexion

    Reflexion

    Reflexion: Language Agents with Verbal Reinforcement Learning

    ...Instead of relying solely on a single-pass response, Reflexion enables agents to evaluate their own outputs, identify errors, and refine their reasoning over multiple iterations, leading to more accurate and reliable results. The framework introduces a mechanism where agents maintain a memory of past attempts and use that memory to guide future decisions, effectively simulating a learning process without requiring traditional model retraining. This approach is particularly useful for complex reasoning tasks, coding challenges, and decision-making scenarios where initial outputs may be incomplete or incorrect. Reflexion also emphasizes transparency by making intermediate reasoning steps explicit, allowing developers to inspect how conclusions are reached and where improvements occur.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Colossal-AI

    Colossal-AI

    Making large AI models cheaper, faster and more accessible

    The Transformer architecture has improved the performance of deep learning models in domains such as Computer Vision and Natural Language Processing. Together with better performance come larger model sizes. This imposes challenges to the memory wall of the current accelerator hardware such as GPU. It is never ideal to train large models such as Vision Transformer, BERT, and GPT on a single GPU or a single machine. There is an urgent demand to train models in a distributed environment. However, distributed training, especially model parallelism, often requires domain expertise in computer systems and architecture. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • $300 Free Credits to Build on Google Cloud Icon
    $300 Free Credits to Build on Google Cloud

    New to Google Cloud? Get $300 in credits to explore Compute Engine, BigQuery, Cloud Run, Gemini Enterprise Agent Platform, and more.

    Start your next project with $300 in free Google Cloud credit. Spin up VMs, run containers, query petabytes in BigQuery, or build agents with Gemini Enterprise Agent Platform. Once your credits are used, keep building with 20+ always-free tier products including Compute Engine, Cloud Storage, GKE, and Cloud Run functions. No commitment required—just sign up and start building.
    Claim $300 Free
  • 10
    Taipy

    Taipy

    Turns Data and AI algorithms into production-ready web applications

    ...Taipy enhances performance with caching control of graphical events, optimizing rendering by selectively updating graphical components only upon interaction. Effortlessly manage massive datasets with Taipy's built-in decimator for charts, intelligently reducing the number of data points to save time and memory without losing the essence of your data's shape. Struggle with sluggish performance and excessive memory usage, as every data point demands processing. Large datasets become cumbersome, complicating the user experience and data analysis. Scenarios are made easy with Taipy Studio. A powerful VS Code extension that unlocks a convenient graphical editor. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 11
    CodeGeeX2

    CodeGeeX2

    CodeGeeX2: A More Powerful Multilingual Code Generation Model

    ...With improved inference efficiency, quantization options, and multi-query/flash attention, CodeGeeX2 achieves faster generation speeds and lightweight deployment, requiring as little as 6GB GPU memory at INT4 precision. Its backend powers the CodeGeeX IDE plugins for VS Code, JetBrains, and other editors, offering developers interactive AI assistance with features like infilling and cross-file completion.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 12
    Agent Framework

    Agent Framework

    Framework for building, orchestrating, and deploying AI agents

    ...Microsoft Agent Framework supports graph-based orchestration that enables developers to connect agents, functions, and tools into structured workflows capable of handling multi-step processes. It also includes components such as agent sessions for managing state, context providers for maintaining memory, and middleware for intercepting and extending agent behavior. Developers can integrate external tools and services so that agents can execute actions beyond text generation.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 13
    Phidata

    Phidata

    Build multi-modal Agents with memory, knowledge, tools and reasoning

    Phidata is an open source platform for building, deploying, and monitoring AI agents. It enables users to create domain-specific agents with memory, knowledge, and external tools, enhancing AI capabilities for various tasks. The platform supports a range of large language models and integrates seamlessly with different databases, vector stores, and APIs. Phidata offers pre-configured templates to accelerate development and deployment, allowing users to quickly go from building agents to shipping them into production. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 14
    Pedalboard

    Pedalboard

    A Python library for audio

    pedalboard is a Python library for working with audio: reading, writing, rendering, adding effects, and more. It supports the most popular audio file formats and a number of common audio effects out of the box and also allows the use of VST3® and Audio Unit formats for loading third-party software instruments and effects. pedalboard was built by Spotify’s Audio Intelligence Lab to enable using studio-quality audio effects from within Python and TensorFlow. Internally at Spotify, pedalboard...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 15
    LLM Telegram Bot

    LLM Telegram Bot

    A Telegram bot for Large Language Models

    ...The project is designed to provide a customizable AI assistant that can operate within Telegram conversations, supporting dynamic responses based on user input and configurable parameters. It includes features such as conversation memory, allowing the bot to maintain context across multiple messages and provide more coherent responses. The system supports multiple modes or personas, enabling users to switch between different conversational styles or use cases. It also allows fine-tuning of generation parameters such as temperature and token limits, giving users control over response behavior. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 16
    OpenSage

    OpenSage

    An agent framework that enables AI to create their own agent

    ...Unlike traditional agent frameworks that require developers to manually define workflows, tools, and structures, OpenSage introduces a system where large language models can dynamically generate their own agent architectures, including sub-agents, toolchains, and execution strategies. The framework is built around the concept of an Agent Development Kit (ADK), providing structured components for memory, reasoning, and task decomposition while allowing agents to iteratively improve their own design. A key innovation is its hierarchical and graph-based memory system, which enables agents to store, retrieve, and organize information across complex workflows with improved efficiency and contextual awareness.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    FlexLLMGen

    FlexLLMGen

    Running large language models on a single GPU

    ...The system focuses on high-throughput generation workloads where large batches of text must be processed quickly, such as large-scale data extraction or document analysis tasks. Instead of requiring expensive multi-GPU systems, the framework uses techniques such as memory offloading, compression, and optimized batching to run large models on commodity hardware. The architecture distributes computation and memory usage across the GPU, CPU, and disk in order to maximize the number of tokens processed during inference. This design allows organizations to deploy powerful language models for high-volume tasks without the infrastructure costs typically associated with large-scale AI systems. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    FramePack

    FramePack

    Lets make video diffusion practical

    ...The idea is to “pack” frames by detecting shared structure and storing differences efficiently, which can accelerate training or inference on video-like data. By reducing I/O and memory bandwidth, datasets become lighter to load while models still see the essential temporal variation. The repository demonstrates both packing and unpacking steps, making it straightforward to integrate into preprocessing pipelines. It’s useful for diffusion and generative models that learn from sequential image datasets, as well as classical pipelines that batch many related frames. ...
    Downloads: 53 This Week
    Last Update:
    See Project
  • 19
    Claw Compactor

    Claw Compactor

    14-stage Fusion Pipeline for LLM token compression

    ...It is especially useful in autonomous workflows where agents accumulate large volumes of interaction history over time. The project aligns with broader strategies in AI systems that balance memory retention with computational constraints. Overall, claw-compactor functions as an infrastructure component that enhances scalability and stability in persistent AI agent environments.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 20
    BitNet

    BitNet

    BitNet: Scaling 1-bit Transformers for Large Language Models

    ...The project implements the BitNet architecture described in research on scaling transformer models using extremely low-bit quantization techniques. In this approach, neural network weights are quantized to approximately one bit per parameter, allowing models to operate with far lower memory usage than traditional 16-bit or 32-bit neural networks. The architecture introduces specialized layers such as BitLinear, which replace standard linear projections in transformer networks with quantized operations. By limiting weight precision while maintaining efficient scaling and normalization strategies, the architecture aims to retain competitive performance while significantly reducing hardware requirements.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 21
    CowAgent

    CowAgent

    AI assistant based on large models that can actively think and plan

    ...It enables automated message handling by connecting WeChat accounts with AI models that can generate contextual replies, process voice messages, and produce images directly inside chats. The platform has evolved beyond a simple chatbot into a more autonomous agent capable of planning complex tasks, maintaining long-term memory, and invoking external tools to complete workflows. It supports multi-turn conversations with per-user context tracking, allowing more natural and persistent interactions across private and group chats. Developers can extend functionality through a plugin architecture and customizable rules, making it suitable for both personal assistants and enterprise automation scenarios.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 22
    agentic-stack

    agentic-stack

    One brain, many harnesses. Portable .agent/ folder

    ...It likely provides components for managing agent workflows, communication, and task execution across different systems. The project emphasizes modularity, enabling developers to assemble custom pipelines using various AI models, tools, and APIs. It may include abstractions for memory, planning, and tool usage, reflecting modern agentic AI design patterns. The stack is intended to accelerate development by providing reusable building blocks for complex AI systems. Overall, it represents an infrastructure layer for creating autonomous or semi-autonomous AI applications.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23
    WanGP

    WanGP

    AI video generator optimized for low VRAM and older GPUs use

    Wan2GP is an open source AI video generation toolkit designed to make modern generative models accessible on consumer-grade hardware with limited GPU memory. It acts as a unified interface for running multiple video, image, and audio generation models, including Wan-based models as well as other systems like Hunyuan Video, Flux, and Qwen. A key focus of the project is reducing VRAM requirements, enabling some workflows to run on as little as 6 GB while still supporting older Nvidia and certain AMD GPUs. ...
    Downloads: 45 This Week
    Last Update:
    See Project
  • 24
    Pyreft

    Pyreft

    ReFT: Representation Finetuning for Language Models

    PyreFT is a tool by Stanford NLP for fine-tuning transformer models with an emphasis on efficient, resource-conserving training and customizability for NLP tasks.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 25
    ACE-Step 1.5

    ACE-Step 1.5

    The most powerful local music generation model

    ...It integrates cutting-edge generative techniques—such as diffusion-based synthesis combined with compressed autoencoders and lightweight transformer elements—to produce high-quality full-length music tracks with rapid inference times, capable of generating a complete song in seconds on modern GPUs while remaining efficient enough to run on consumer-grade hardware with minimal memory requirements. Beyond straightforward text-to-music synthesis, ACE-Step 1.5 enables flexible creative workflows, including tasks like cover generation, editing existing tracks, transforming vocals to background accompaniment, and stylistic personalization using low-rank adaptation from just a few example songs.
    Downloads: 54 This Week
    Last Update:
    See Project
Auth0 Logo