Showing 19 open source projects for "shared memory allocator"

View related business solutions
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • Compliant and Reliable File Transfers Backed by Top Security Certifications Icon
    Compliant and Reliable File Transfers Backed by Top Security Certifications

    Cerberus FTP Server delivers SOC 2 Type II certified security and FIPS 140-2 validated encryption.

    Stop relying on non-certified, legacy file transfer tools that creak under the weight of modern security demands. Get full audit trails, advanced access controls and more supported by an award-winning team of experts. Start your free 25-day trial today.
    Start Free Trial
  • 1
    Cipher

    Cipher

    Cipher is an opensource memory layer designed for coding agents

    Cipher is an open‑source memory layer designed for AI coding agents. It records both high‑level conceptual memory (design decisions, business logic) and agent reasoning steps so AI assistants can maintain context seamlessly across IDEs or sessions. Built to integrate with MCP-compatible environments (e.g. Cursor, Windsurf, Gemini CLI, Claude Desktop, AWS Kiro), it enables team-aware workspace memory sharing and real-time collaboration.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 2
    Vellum

    Vellum

    A personal AI assistant that evolves with you

    Vellum is an open-source personal AI assistant platform designed to function as a persistent, autonomous digital companion across desktop and messaging environments. Unlike traditional chatbot interfaces, the project focuses on long-term memory, identity, proactive behavior, and real-world tool usage, enabling assistants to evolve alongside the user over time. The system integrates with macOS, Telegram, Slack, SMS, and additional communication channels while maintaining shared memory and context across platforms. Its architecture combines local-first storage, tool orchestration, sandboxed execution, and extensible workflow automation to allow assistants to read files, manage schedules, send messages, browse the web, and control applications. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    METATRON

    METATRON

    AI-powered penetration testing assistant using local LLM on linux

    METATRON is a multi-agent AI orchestration framework designed to coordinate complex workflows across multiple intelligent agents. It provides a structured system for task delegation, communication, and collaboration between agents. The framework emphasizes scalability, allowing multiple agents to work together on large or complex problems. It includes mechanisms for managing context, memory, and execution flow across tasks. METATRON is particularly useful for building advanced AI systems...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    ChatGPT Clone

    ChatGPT Clone

    ChatGPT interface with better UI

    ChatGPT Clone demonstrates a ChatGPT-style conversational interface wired to large-language-model backends, packaged so developers can self-host and extend. The goal is to replicate the core chat UX—message history, streaming tokens, code blocks, and system prompts—while letting you plug in different provider APIs or local models. It showcases a clean separation between the web client and the message orchestration layer so you can experiment with prompts, roles, and memory strategies. The...
    Downloads: 12 This Week
    Last Update:
    See Project
  • Stop Storing Third-Party Tokens in Your Database Icon
    Stop Storing Third-Party Tokens in Your Database

    Auth0 Token Vault handles secure token storage, exchange, and refresh for external providers so you don't have to build it yourself.

    Rolling your own OAuth token storage can be a security liability. Token Vault securely stores access and refresh tokens from federated providers and handles exchange and renewal automatically. Connected accounts, refresh exchange, and privileged worker flows included.
    Try Auth0 for Free
  • 5
    FramePack

    FramePack

    Lets make video diffusion practical

    FramePack explores compact representations for sequences of image frames, targeting tasks where many near-duplicate frames carry redundant information. The idea is to “pack” frames by detecting shared structure and storing differences efficiently, which can accelerate training or inference on video-like data. By reducing I/O and memory bandwidth, datasets become lighter to load while models still see the essential temporal variation. The repository demonstrates both packing and unpacking steps, making it straightforward to integrate into preprocessing pipelines. ...
    Downloads: 13 This Week
    Last Update:
    See Project
  • 6
    CogVideo

    CogVideo

    Text and image to video generation: CogVideoX and CogVideo

    CogVideo is an open-source family of advanced video generation models that can create videos from text, images, or existing video inputs. Built on large-scale Transformer and diffusion architectures, it enables multimodal generation across text-to-video, image-to-video, and video continuation tasks. The latest CogVideoX models offer higher resolution outputs, longer video durations, and improved controllability through prompt engineering. The project includes tools for inference,...
    Downloads: 17 This Week
    Last Update:
    See Project
  • 7
    Oh My codeX (OMX)

    Oh My codeX (OMX)

    Your codex is not alone. Add hooks, agent teams, HUDs

    Oh My codeX (OMX) is a multi-agent orchestration layer designed to extend the capabilities of OpenAI Codex CLI by introducing structured teamwork, automation, and advanced workflow management. It addresses limitations in the base Codex environment, such as the lack of hooks, agent coordination, and persistent execution, by layering a shell-based system that enables richer interaction patterns. The project transforms a single AI coding assistant into a coordinated system of specialized agents...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 8
    Qwen

    Qwen

    The official repo of Qwen chat & pretrained large language model

    Qwen is a series of large language models developed by Alibaba Cloud, consisting of various pretrained versions like Qwen-1.8B, Qwen-7B, Qwen-14B, and Qwen-72B. These models, which range from smaller to larger configurations, are designed for a wide range of natural language processing tasks. They are openly available for research and commercial use, with Qwen's code and model weights shared on GitHub. Qwen's capabilities include text generation, comprehension, and conversation, making it a...
    Downloads: 14 This Week
    Last Update:
    See Project
  • 9
    IronClaw

    IronClaw

    IronClaw is OpenClaw inspired but focused on privacy & security

    ...With dynamic tool building, persistent memory, and background automation, IronClaw is built to securely expand and adapt to your personal and professional workflows.
    Downloads: 3 This Week
    Last Update:
    See Project
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 10
    Context Hub

    Context Hub

    Makes coding agents get smarter with every task

    Context Hub is a curated documentation system built to help coding agents write more accurate code. It gives agents versioned, language-specific reference material instead of forcing them to rely on noisy web searches or stale model memory. The project includes a CLI called chub that agents can use to search for available docs, fetch specific API guidance, and request only the files they need. It also supports local annotations, allowing an agent to remember project-specific notes, pitfalls, or workarounds across future sessions. Feedback can be sent back to maintainers so shared documentation improves over time. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 11
    LMCache

    LMCache

    Supercharge Your LLM with the Fastest KV Cache Layer

    LMCache is an extension layer for LLM serving engines that accelerates inference, especially with long contexts, by storing and reusing key-value (KV) attention caches across requests. Instead of rebuilding KV states for repeated or shared text segments, LMCache persists and retrieves them from multiple tiers—GPU memory, CPU DRAM, and local disk—then injects them into subsequent requests to reduce TTFT and increase throughput. Its design supports reuse beyond strict prefix matching and enables sharing across serving instances, improving efficiency under real multi-tenant traffic. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 12
    Extractous

    Extractous

    Fast and efficient unstructured data extraction

    ...Its purpose is to extract text and metadata efficiently from formats such as PDF, Word, HTML, email archives, images, and more, without depending on external APIs or separate parsing servers. The project emphasizes performance and low memory usage, and its maintainers describe it as a local-first alternative to heavier extraction stacks. For broader format support, the system combines its Rust core with ahead-of-time compiled Apache Tika shared libraries, which allows it to extend parsing coverage while still avoiding traditional server-based overhead. It also supports OCR for images and scanned documents through Tesseract, making it useful for document ingestion pipelines that include image-based or scanned inputs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Refly

    Refly

    The first open-source agent skills builder

    ...With a focus on making automation accessible, it provides a visual canvas and low-code components that feel similar to drag-and-drop builders but backed by powerful AI orchestration, memory handling, and integrations with external services. Refly’s approach bridges the gap between workflow ideas and stable, deterministic infrastructure: skills become governed capabilities that can be versioned, shared, and monetized, not just temporary scripts.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Tiny CUDA Neural Networks

    Tiny CUDA Neural Networks

    Lightning fast C++/CUDA neural network framework

    ...We provide a sample application where an image function (x,y) -> (R,G,B) is learned. The fully fused MLP component of this framework requires a very large amount of shared memory in its default configuration. It will likely only work on an RTX 3090, an RTX 2080 Ti, or high-end enterprise GPUs. Lower-end cards must reduce the n_neurons parameter or use the CutlassMLP (better compatibility but slower) instead. tiny-cuda-nn comes with a PyTorch extension that allows using the fast MLPs and input encodings from within a Python context. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Punica

    Punica

    Serving multiple LoRA finetuned LLM as one

    Punica is a system designed to efficiently serve multiple LoRA-fine-tuned large language models within a shared GPU environment. LoRA is a parameter-efficient fine-tuning method that allows developers to adapt large pretrained models to specific tasks by adding lightweight adapter layers rather than retraining the entire model. Punica introduces a serving architecture that allows multiple LoRA adapters to share the same base model during inference, significantly reducing memory consumption and computational overhead. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16

    rt-agents

    Agent system for HPC real time applications

    Agent system for HPC real time applications
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17

    popt4jlib

    Parallel Optimization Library for Java

    ...A fast parallel implementation of the network simplex method, and some full-fledged parallel/distributed MIP solvers will be added in the next version. In general, emphasis is given in improving the efficiency of the algorithms in shared-memory models via java threads, since multi-core machines are so wide-spread today.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Blackboard implements a lightweight, portable tuple space suitable for multi-agent system and distributed component design. Supports implicit invocation via content-filtered asynchronous events, blocking call semantics, and shared memory messaging.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    The PSU Mars Rover Software System is a collection of modules connected via shared memory space which allow the operation of various sub-systems to control the rover in all of its tasks; especially navigation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB