Showing 75 open source projects for "operating"

View related business solutions
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    Let your crypto work for you

    Put idle assets to work with competitive interest rates, borrow without selling, and trade with precision. All in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.
    Start Free
  • 1
    Self-Operating Computer

    Self-Operating Computer

    A framework to enable multimodal models to operate a computer

    The Self-Operating Computer Framework is an innovative system that enables multimodal models to autonomously operate a computer by interpreting the screen and executing mouse and keyboard actions to achieve specified objectives. This framework is compatible with various multimodal models and currently integrates with GPT-4o, o1, Gemini Pro Vision, Claude 3, and LLaVa.
    Downloads: 11 This Week
    Last Update:
    See Project
  • 2
    Chatbox

    Chatbox

    The Ultimate AI Copilot on Your Desktop

    Chatbox is a cross-platform desktop AI client designed to give you a fast, polished, and private way to work with modern language models. It runs locally on Windows, macOS, and Linux, keeping your conversations and data stored on your own device. Chatbox acts as a unified interface for popular LLMs like ChatGPT, Claude, Gemini, and local models via Ollama, making it easy to switch providers without changing tools. Built with an ergonomic UI, it’s optimized for long sessions, prompt...
    Downloads: 20 This Week
    Last Update:
    See Project
  • 3
    Open-AutoGLM

    Open-AutoGLM

    An open phone agent model & framework

    Open-AutoGLM is an open-source framework and model designed to empower autonomous mobile intelligent assistants by enabling AI agents to understand and interact with phone screens in a multimodal manner, blending vision and language capability to control real devices. It aims to create an “AI phone agent” that can perceive on-screen content, reason about user goals, and execute sequences of taps, swipes, and text input via automated device control interfaces like ADB, enabling hands-off...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 4
    OpenFang

    OpenFang

    Open-source Agent Operating System

    OpenFang is an open-source agent operating system designed to orchestrate autonomous AI agents and workflows in a structured, production-oriented environment. Written primarily in Rust, the project focuses on building a high-performance runtime where multiple specialized agents can collaborate to complete complex computational or development tasks. It aims to move beyond simple chat-based agents by providing infrastructure for persistent agent memory, task coordination, and scalable execution. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 5
    AIChat

    AIChat

    All-in-one LLM CLI tool featuring Shell Assistant

    AIChat is a lightweight terminal-based chatbot powered by GPT models, enabling AI-driven conversations directly from the command line.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6
    Umbrel

    Umbrel

    A beautiful personal server OS for Raspberry Pi or any Linux distro

    Run your personal server with a Bitcoin and Lightning node in your home, self-host open source apps like Nextcloud and Matrix to break away from big tech, and take full control of your data. For free. All our interactions on the internet today are mediated by a few companies who offer “free” services in exchange for storing our data on their servers to spy on us. Running a personal server fundamentally changes that. You and your family’s photos, videos, files, notes, passwords, everything,...
    Downloads: 26 This Week
    Last Update:
    See Project
  • 7
    AskUI Vision Agent

    AskUI Vision Agent

    Enable AI to control your desktop, mobile and HMI devices

    AskUI’s Vision Agent is an automation framework that allows you—and AI agents—to control real desktops, mobile devices, and HMI systems by perceiving the UI and performing actions like clicking, typing, scrolling, and drag-and-drop. It is designed for multi-platform compatibility and supports multiple AI models so you can tailor perception and decision-making to your workload. The repository presents a feature overview, sample media, and frequent release notes, which show ongoing...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 8
    ChatMCP

    ChatMCP

    ChatMCP is an AI chat client implementing the Model Context Protocol

    ChatMCP is a cross‑platform AI chat client that implements the Model Context Protocol (MCP) to provide unified chat experiences across environments—including desktop, mobile, and web—with synchronization and protocol support tailored for MCP. You can install MCP Server from MCP Server Market, MCP Server Market is a collection of MCP Server, you can use it to chat with different data. Tested on major distributions: Ubuntu, Fedora, Arch Linux, openSUSE. Improved Experience: Latest versions...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    CogAgent

    CogAgent

    An open sourced end-to-end VLM-based GUI Agent

    CogAgent is a 9B-parameter bilingual vision-language GUI agent model based on GLM-4V-9B, trained with staged data curation, optimization, and strategy upgrades to improve perception, action prediction, and generalization across tasks. It focuses on operating real user interfaces from screenshots plus text, and follows a strict input–output format that returns structured actions, grounded operations, and optional sensitivity annotations. The model is designed for agent-style execution rather than freeform chat, maintaining a continuous execution history across steps while requiring a fresh session for each new task. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10
    Android Use

    Android Use

    Automate native Android apps with AI using accessibility APIs

    android-action-kernel is an open source Python library designed to let AI agents control and automate native Android applications running on real devices or emulators. It fills a gap in automation tooling by focusing on mobile-first workflows where traditional browser or desktop-based automation doesn’t work; such as logistics, gig work, field operations, and other industries reliant on phones or tablets. The project works by using Android’s accessibility API to extract structured UI state...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 11
    agency-agents-zh

    agency-agents-zh

    193 plug-and-play AI expert roles

    agency-agents-zh is a framework focused on building and coordinating multiple AI agents, likely with a particular emphasis on Chinese-language environments or documentation. The project appears to explore the concept of agent collaboration, where different agents handle specialized tasks and communicate to achieve broader objectives. It is designed to simulate organizational workflows, enabling complex problem-solving through distributed intelligence rather than a single monolithic model....
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    The Agency

    The Agency

    A complete AI agency at your fingertips

    Agency Agents is an open-source collection of specialized AI agent personas designed to function like a complete virtual agency spanning engineering, design, marketing, product, project management, testing, support, spatial computing, and other specialized roles. Rather than providing generic prompts, the project organizes each agent as a structured expert profile with personality traits, mission, workflow, deliverables, examples, and success metrics so that each one feels more like a...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Claude Code CLI

    Claude Code CLI

    A comprehensive guide for setting up Claude Code CLI

    Claude Code CLI is a structured resource designed to help developers use AI-assisted tools effectively in iOS application development. It provides guidance on how to integrate AI workflows into the process of building mobile applications using Swift and Apple’s development ecosystem. The project covers best practices for structuring prompts, managing code generation, and refining outputs to match platform-specific requirements. It emphasizes real-world development scenarios, helping users...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    holaOS

    holaOS

    An Open Agent Computer for ANY digital work

    holaOSc is an AI-native operating system concept designed to integrate intelligent agents directly into the core of the computing environment. It provides a framework where AI agents manage tasks, workflows, and interactions across applications. The system emphasizes seamless automation, allowing users to interact with their computer through natural language and high-level instructions.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    MemOS

    MemOS

    AI memory OS for LLM and Agent systems

    ...It aims to support advanced workflows like persistent in-memory data structures, crash-resilient state handling, and seamless sharing of data across tasks without copying. By abandoning some of the historical assumptions of Unix-style operating systems, MemOS attempts to unlock new performance and scalability tradeoffs for applications that need high throughput and low latency on memory-intensive workloads.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    Klavis AI

    Klavis AI

    MCP integration platforms for AI agents to use tools at any scale

    Klavis AI is a Y Combinator X25-backed open-source infrastructure platform that enables AI agents to reliably connect with external tools and services at scale through Model Context Protocol (MCP). Founded by ex-Google DeepMind and ex-Lyft engineers, Klavis provides 50+ production-ready MCP servers with enterprise OAuth support for GitHub, Slack, Gmail, Salesforce, Linear, Notion, and more. The flagship product Strata solves tool overload through progressive discovery, achieving +13% higher...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Alan AI for Android

    Alan AI for Android

    Assistant SDK to build a multimodal conversational UX for Android

    Quickly add voice to your app with the Alan Platform. Create an in-app voice assistant to enable human-like conversations and provide a personalized voice experience for every user. Alan is a conversational voice AI platform that lets you create an intelligent voice assistant for your app. It offers all the necessary tools to design, embed, and host your voice solutions. A powerful web-based IDE where you can write, test and debug dialog scenarios for your voice assistant or chatbot. Alan's...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Open Interface

    Open Interface

    Control Any Computer Using LLMs

    Open Interface is a cross-platform application that allows users to control their computers using large language models (LLMs). By sending user requests to an LLM backend, it determines the necessary steps and executes them by simulating keyboard and mouse inputs. The system can adjust its actions based on real-time feedback, providing a self-driving computer experience.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 19
    Agent Payments Protocol (AP2)

    Agent Payments Protocol (AP2)

    Building a Secure and Interoperable Future for AI-Driven Payments

    AP2 is a project released by Google’s “Agentic Commerce” initiative, focusing on a protocol and reference implementation for agent-driven or AI-mediated payments. In effect, AP2 aims to define a secure, interoperable protocol that allows software agents to act on behalf of users—making payments or shopping decisions autonomously—while preserving necessary security, auditability, and trust. The repository contains sample scenarios (in Python, Android, etc.) that illustrate how agents,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Alan AI for iOS

    Alan AI for iOS

    In-App assistant SDK to build a multimodal conversational UX for iOS

    Quickly add voice to your app with the Alan Platform. Create an in-app voice assistant to enable human-like conversations and provide a personalized voice experience for every user. Alan is a conversational voice AI platform that lets you create an intelligent voice assistant for your app. It offers all the necessary tools to design, embed, and host your voice solutions. A powerful web-based IDE where you can write, test and debug dialog scenarios for your voice assistant or chatbot. Alan's...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    CLI-Anything

    CLI-Anything

    Making ALL Software Agent-Native

    CLI-Anything is a framework designed to transform traditional software applications into agent-native command-line interfaces that can be directly controlled by AI systems. It is built on the idea that the command-line interface is the most universal, structured, and composable interface for both humans and AI agents, enabling deterministic and predictable execution of workflows. The system provides a methodology and tooling for generating CLI wrappers around existing applications, allowing...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 22
    Open Autonomy

    Open Autonomy

    A framework for the creation of autonomous agent services

    Open Autonomy is a framework that enables the development of autonomous economic agents (AEAs) capable of operating independently in various economic contexts.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23
    NemoClaw

    NemoClaw

    NVIDIA plugin for secure installation of OpenClaw

    ...Through its command-line interface, developers can deploy, monitor, and manage AI assistants running inside isolated sandboxes. By combining sandbox orchestration, agent management, and AI model integration, NemoClaw provides a secure foundation for building and operating autonomous AI assistants.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 24
    HybridClaw

    HybridClaw

    The enterprise operating layer for open agents

    HybridClaw is an emerging open-source framework focused on enabling hybrid AI agent systems that combine local execution, tool integration, and multi-agent orchestration into a cohesive development environment. It is designed to work alongside modern agent ecosystems such as OpenClaw, Claude Code, and similar agentic coding tools, providing a flexible infrastructure for managing agent behaviors, workflows, and capabilities. The project emphasizes modularity, allowing developers to define and...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 25
    008

    008

    Open-source event-driven AI powered Softphone

    ...The name '008' or 'agent 008' reflects our ambition: beyond crafting the premier Open Source Softphone, we aim to introduce a programmable, event-driven AI agent. This agent utilizes embedded artificial intelligence models operating directly on the softphone, ensuring efficiency and reduced operational costs. This project is a WebRTC softphone, and communication is achieved via SIP over a socket. Leading PBX systems like Asterisk or Freeswitch support socket connections. If your provider does not offer this feature, consider using a SIP proxy such as Kamailio, Opensip or Routr. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • Next
MongoDB Logo MongoDB