Showing 21 open source projects for "operating"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Fully Managed MySQL, PostgreSQL, and SQL Server Icon
    Fully Managed MySQL, PostgreSQL, and SQL Server

    Automatic backups, patching, replication, and failover. Focus on your app, not your database.

    Cloud SQL handles your database ops end to end, so you can focus on your app.
    Try Free
  • 1
    Self-Operating Computer

    Self-Operating Computer

    A framework to enable multimodal models to operate a computer

    The Self-Operating Computer Framework is an innovative system that enables multimodal models to autonomously operate a computer by interpreting the screen and executing mouse and keyboard actions to achieve specified objectives. This framework is compatible with various multimodal models and currently integrates with GPT-4o, o1, Gemini Pro Vision, Claude 3, and LLaVa.
    Downloads: 11 This Week
    Last Update:
    See Project
  • 2
    Chatbox

    Chatbox

    The Ultimate AI Copilot on Your Desktop

    Chatbox is a cross-platform desktop AI client designed to give you a fast, polished, and private way to work with modern language models. It runs locally on Windows, macOS, and Linux, keeping your conversations and data stored on your own device. Chatbox acts as a unified interface for popular LLMs like ChatGPT, Claude, Gemini, and local models via Ollama, making it easy to switch providers without changing tools. Built with an ergonomic UI, it’s optimized for long sessions, prompt...
    Downloads: 20 This Week
    Last Update:
    See Project
  • 3
    AIChat

    AIChat

    All-in-one LLM CLI tool featuring Shell Assistant

    AIChat is a lightweight terminal-based chatbot powered by GPT models, enabling AI-driven conversations directly from the command line.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 4
    AskUI Vision Agent

    AskUI Vision Agent

    Enable AI to control your desktop, mobile and HMI devices

    AskUI’s Vision Agent is an automation framework that allows you—and AI agents—to control real desktops, mobile devices, and HMI systems by perceiving the UI and performing actions like clicking, typing, scrolling, and drag-and-drop. It is designed for multi-platform compatibility and supports multiple AI models so you can tailor perception and decision-making to your workload. The repository presents a feature overview, sample media, and frequent release notes, which show ongoing...
    Downloads: 2 This Week
    Last Update:
    See Project
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 5
    CogAgent

    CogAgent

    An open sourced end-to-end VLM-based GUI Agent

    CogAgent is a 9B-parameter bilingual vision-language GUI agent model based on GLM-4V-9B, trained with staged data curation, optimization, and strategy upgrades to improve perception, action prediction, and generalization across tasks. It focuses on operating real user interfaces from screenshots plus text, and follows a strict input–output format that returns structured actions, grounded operations, and optional sensitivity annotations. The model is designed for agent-style execution rather than freeform chat, maintaining a continuous execution history across steps while requiring a fresh session for each new task. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    Android Use

    Android Use

    Automate native Android apps with AI using accessibility APIs

    android-action-kernel is an open source Python library designed to let AI agents control and automate native Android applications running on real devices or emulators. It fills a gap in automation tooling by focusing on mobile-first workflows where traditional browser or desktop-based automation doesn’t work; such as logistics, gig work, field operations, and other industries reliant on phones or tablets. The project works by using Android’s accessibility API to extract structured UI state...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 7
    agency-agents-zh

    agency-agents-zh

    193 plug-and-play AI expert roles

    agency-agents-zh is a framework focused on building and coordinating multiple AI agents, likely with a particular emphasis on Chinese-language environments or documentation. The project appears to explore the concept of agent collaboration, where different agents handle specialized tasks and communicate to achieve broader objectives. It is designed to simulate organizational workflows, enabling complex problem-solving through distributed intelligence rather than a single monolithic model....
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    holaOS

    holaOS

    An Open Agent Computer for ANY digital work

    holaOSc is an AI-native operating system concept designed to integrate intelligent agents directly into the core of the computing environment. It provides a framework where AI agents manage tasks, workflows, and interactions across applications. The system emphasizes seamless automation, allowing users to interact with their computer through natural language and high-level instructions.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    Alan AI for Android

    Alan AI for Android

    Assistant SDK to build a multimodal conversational UX for Android

    Quickly add voice to your app with the Alan Platform. Create an in-app voice assistant to enable human-like conversations and provide a personalized voice experience for every user. Alan is a conversational voice AI platform that lets you create an intelligent voice assistant for your app. It offers all the necessary tools to design, embed, and host your voice solutions. A powerful web-based IDE where you can write, test and debug dialog scenarios for your voice assistant or chatbot. Alan's...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.
    Start Free
  • 10
    Open Interface

    Open Interface

    Control Any Computer Using LLMs

    Open Interface is a cross-platform application that allows users to control their computers using large language models (LLMs). By sending user requests to an LLM backend, it determines the necessary steps and executes them by simulating keyboard and mouse inputs. The system can adjust its actions based on real-time feedback, providing a self-driving computer experience.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 11
    Alan AI for iOS

    Alan AI for iOS

    In-App assistant SDK to build a multimodal conversational UX for iOS

    Quickly add voice to your app with the Alan Platform. Create an in-app voice assistant to enable human-like conversations and provide a personalized voice experience for every user. Alan is a conversational voice AI platform that lets you create an intelligent voice assistant for your app. It offers all the necessary tools to design, embed, and host your voice solutions. A powerful web-based IDE where you can write, test and debug dialog scenarios for your voice assistant or chatbot. Alan's...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Open Autonomy

    Open Autonomy

    A framework for the creation of autonomous agent services

    Open Autonomy is a framework that enables the development of autonomous economic agents (AEAs) capable of operating independently in various economic contexts.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    HybridClaw

    HybridClaw

    The enterprise operating layer for open agents

    HybridClaw is an emerging open-source framework focused on enabling hybrid AI agent systems that combine local execution, tool integration, and multi-agent orchestration into a cohesive development environment. It is designed to work alongside modern agent ecosystems such as OpenClaw, Claude Code, and similar agentic coding tools, providing a flexible infrastructure for managing agent behaviors, workflows, and capabilities. The project emphasizes modularity, allowing developers to define and...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 14
    NemoClaw

    NemoClaw

    NVIDIA plugin for secure installation of OpenClaw

    ...Through its command-line interface, developers can deploy, monitor, and manage AI assistants running inside isolated sandboxes. By combining sandbox orchestration, agent management, and AI model integration, NemoClaw provides a secure foundation for building and operating autonomous AI assistants.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 15
    008

    008

    Open-source event-driven AI powered Softphone

    ...The name '008' or 'agent 008' reflects our ambition: beyond crafting the premier Open Source Softphone, we aim to introduce a programmable, event-driven AI agent. This agent utilizes embedded artificial intelligence models operating directly on the softphone, ensuring efficiency and reduced operational costs. This project is a WebRTC softphone, and communication is achieved via SIP over a socket. Leading PBX systems like Asterisk or Freeswitch support socket connections. If your provider does not offer this feature, consider using a SIP proxy such as Kamailio, Opensip or Routr. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 16
    Eko

    Eko

    Build Production-ready Agentic Workflow with Natural Language

    Eko (Eko Keeps Operating) is a JavaScript framework designed for building production-ready agent-based workflows using natural language commands. It allows developers to create automated agents that can handle complex workflows in both computer and browser environments. With a focus on high development efficiency, Eko simplifies the creation of multi-step workflows, enabling users to integrate and automate tasks across platforms.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    zhangxuefeng-skill

    zhangxuefeng-skill

    Zhang Xuefeng's cognitive operating system

    zhangxuefeng-skill is an AI agent skill package that distills the decision-making frameworks and practical reasoning style of Chinese educator Zhang Xuefeng into a reusable, executable cognitive system. Rather than functioning as a simple quote collection, it encodes structured heuristics, mental models, and decision logic derived from books, interviews, and real-life case analysis. The skill is designed to be used within AI coding agents such as Claude Code, where it can be invoked to...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    MimiClaw

    MimiClaw

    Run OpenClaw on a $5 chip

    MimiClaw (from the mimiclaw project) is an edge-AI personal assistant that runs directly on extremely low-cost hardware like an ESP32-S3 microcontroller without a full operating system, Node.js, or cloud backend. By running pure C on a bare-metal chip, MimiClaw brings AI interactions and persistent memory to a tiny USB-powered device you can carry in your pocket. You connect the device to Wi-Fi and chat with it using Telegram, making it a convenient always-on assistant for tasks like reminders, quick lookups, or custom AI interactions. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    agents.md

    agents.md

    A simple, open format for guiding coding agents

    ...The idea is that AGENTS.md acts as a “README for agents”: a predictable, structured place where humans can put instructions, conventions, build/test commands, environment setup, and other guidance that generative agents (e.g. code-writing, code-assisting tools) should consult when operating in the repo. Instead of putting everything in README or doc files (which are more human-oriented and might mix high-level narrative), AGENTS.md is intended to surface agent-relevant details that help them “do the right thing” (tests, style, project structure, tooling).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    OAGI Python SDK

    OAGI Python SDK

    Python SDK for the Computer Use model Lux, developed by OpenAGI

    OAGI Python SDK is a Python client library for the Lux computer-use model that turns Lux into a programmable automation layer for operating human-facing software via vision and actions. It exposes the OAGI API in an ergonomic way, letting you trigger Lux in three main modes: Tasker for precise scripted sequences, Actor for fast one-shot tasks, and Thinker for open-ended, multi-step objectives. The SDK is designed around “computer use” as a paradigm, where the AI actually navigates interfaces, clicks, types, scrolls, and reads the screen through screenshots instead of only calling APIs. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Chippy is a free and open source adventure game in development where the player controls an AI agent named "Chippy", a small robotic chip that takes control of enemy robots and uses their abilities to fight other robots and solve puzzles.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB