Showing 64 open source projects for "control"

View related business solutions
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • 1
    UI-TARS Desktop

    UI-TARS Desktop

    A GUI Agent app based on UI-TARS to control your computer using AI

    UI-TARS Desktop is a graphical user interface (GUI) agent application that leverages the UI-TARS vision-language model to enable natural language control of computers. This cross-platform tool supports both Windows and macOS, allowing users to perform tasks through intuitive commands. Key features include screenshot-based visual recognition, precise mouse and keyboard control, and real-time feedback on actions. Provides immediate responses and visual feedback on actions performed. ...
    Downloads: 93 This Week
    Last Update:
    See Project
  • 2
    n8n

    n8n

    Free and source-available fair-code licensed workflow automation tool

    n8n is an extendable workflow automation tool. With a fair-code distribution model, n8n will always have visible source code, be available to self-host, and allow you to add your own custom functions, logic and apps. n8n's node-based approach makes it highly versatile, enabling you to connect anything to everything. n8n has 200+ different nodes to automate workflows.
    Downloads: 784 This Week
    Last Update:
    See Project
  • 3
    OpenClaw

    OpenClaw

    Your own personal AI assistant. Any OS. Any Platform.

    OpenClaw (formerly Clawdbot/Moltbot) is an open-source, self-hosted autonomous AI assistant designed to run on user-controlled hardware and bridge conversational natural language with real-world task execution, effectively acting as a proactive digital assistant rather than a reactive chatbot. It lets you send instructions through familiar messaging platforms like WhatsApp, Telegram, Discord, Slack, Signal, iMessage, and more, and then interprets those instructions to carry out actions such...
    Downloads: 192 This Week
    Last Update:
    See Project
  • 4
    OpenAI Codex CLI

    OpenAI Codex CLI

    Lightweight coding agent that runs in your terminal

    OpenAI Codex CLI is a lightweight, open-source coding assistant that runs directly in your terminal, designed to bring ChatGPT-level reasoning to your code workflows. It allows developers to interactively query, edit, and generate code within their repositories, all while maintaining version control. The CLI can scaffold new files, run code in sandboxed environments, install dependencies, and commit changes automatically, streamlining chat-driven development. It supports various approval modes—from suggestion-only to full automation—ensuring safe and controlled code execution. Codex CLI can also handle multimodal inputs like screenshots and diagrams to implement features intelligently. ...
    Downloads: 176 This Week
    Last Update:
    See Project
  • Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure Icon
    Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure

    Native application identity and user-based security for your Azure cloud

    Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
    Get a free trial
  • 5
    LobeHub

    LobeHub

    Workspace to find, build, and collaborate with AI agents

    ...Users can build personalized agent teams that understand their workflows, preferences, and goals over time. LobeHub brings multiple models, tools, and modalities into a single unified environment under the user’s control. With built-in collaboration features, agents can work in parallel, share context, and support complex projects seamlessly. The platform is built around the idea of co-evolution, where both humans and agents continuously learn and improve together.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 6
    chrome-cdp

    chrome-cdp

    Give your AI agent access to your live Chrome session

    chrome-cdp-skill is a specialized integration that enables AI agents to control and interact with web browsers through the Chrome DevTools Protocol (CDP). It allows agents to perform tasks such as navigating pages, extracting data, interacting with elements, and executing scripts in a browser environment. The project is designed to extend the capabilities of AI systems beyond static knowledge by giving them real-time access to web content and interactive interfaces.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Project AIRI

    Project AIRI

    Self hosted, you-owned Grok Companion

    ...AIRI integrates real-time voice chat capabilities and can interact with external applications such as games, enabling more immersive and dynamic experiences. The system emphasizes user ownership and local hosting so developers maintain full control over their AI companion instances. Overall, AIRI serves as an extensible framework for building lifelike AI-driven virtual characters and interactive assistants.
    Downloads: 41 This Week
    Last Update:
    See Project
  • 8
    LobsterAI

    LobsterAI

    Your 24/7 all-scenario AI agent that gets work done for you

    ...Its central Cowork mode allows it to run tools, manipulate files, and execute commands in a local or sandboxed environment under user supervision. The project includes built-in skills for office documents, browser automation, web search, and video generation. It also supports remote control through messaging platforms, making it possible to trigger tasks from a phone. LobsterAI is designed for users who want an agent that can actually perform work, not just answer questions.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 9
    CoPaw

    CoPaw

    Your Personal AI Assistant; easy to install, deploy on local or coud

    CoPaw is a personal AI assistant designed to run on your own machine or in the cloud, giving you full control over memory, models, and data. Built by the AgentScope team, it connects to multiple chat platforms—including DingTalk, Feishu, QQ, Discord, iMessage, and more—through a single unified assistant. CoPaw supports both cloud-based LLM providers and fully local models such as llama.cpp, MLX, and Ollama, allowing you to operate without API keys if preferred.
    Downloads: 10 This Week
    Last Update:
    See Project
  • Secure File Transfer for Windows with Cerberus by Redwood Icon
    Secure File Transfer for Windows with Cerberus by Redwood

    Protect and share files over FTP/S, SFTP, HTTPS and SCP with the #1 rated Windows file transfer server.

    Cerberus supports unlimited users and connections on a single IP, with built-in encryption, 2FA, and a browser-based web client — all deployable in under 15 minutes with a 25-day free trial.
    Try for Free
  • 10
    AskUI Vision Agent

    AskUI Vision Agent

    Enable AI to control your desktop, mobile and HMI devices

    AskUI’s Vision Agent is an automation framework that allows you—and AI agents—to control real desktops, mobile devices, and HMI systems by perceiving the UI and performing actions like clicking, typing, scrolling, and drag-and-drop. It is designed for multi-platform compatibility and supports multiple AI models so you can tailor perception and decision-making to your workload. The repository presents a feature overview, sample media, and frequent release notes, which show ongoing improvements such as CORS checks and other operational tweaks. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    ex-skill

    ex-skill

    Distill your ex into an AI Skill

    ex-skill is an experimental AI tooling project that allows users to transform personal memories, particularly past relationships, into interactive AI “skills” that replicate the communication style, personality, and behavioral patterns of a specific individual. The system works by ingesting various forms of personal data such as chat logs, social media content, photos, and user-provided descriptions, then structuring this information into a layered representation that combines memory and...
    Downloads: 29 This Week
    Last Update:
    See Project
  • 12
    serve-sim

    serve-sim

    The `npx serve` of Apple Simulators

    serve-sim is a developer tool for hosting Apple Simulators through a local or network-accessible web interface. It is described as the npx serve equivalent for Apple Simulators, allowing users to preview and control a simulator from a browser. The tool is especially useful for AI coding agents such as Codex, Cursor, and Claude Desktop because they can interact with a simulator through browser-driven workflows. It can run locally, over a LAN, or through a remote Mac with tunneling. The web UI streams the simulator and forwards clicks, enabling browser-based end-to-end testing and debugging. serve-sim is best suited for iOS development, agent testing, remote simulator access, and mobile UI automation workflows.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 13
    Accomplish

    Accomplish

    Accomplish is the open source Al coworker that lives on your desktop

    ...It can handle file management, document creation, and browser-based workflows through natural language instructions. The system runs locally, ensuring that user data remains private and under full control. It supports integration with multiple AI providers or local models, giving users flexibility in how intelligence is powered. Accomplish emphasizes autonomy, allowing it to execute multi-step tasks without constant supervision. Its design focuses on replacing repetitive manual workflows with intelligent automation. Overall, it acts as a personal AI coworker embedded in the desktop environment.
    Downloads: 21 This Week
    Last Update:
    See Project
  • 14
    Vibium

    Vibium

    Browser automation for AI agents and humans

    Vibium is an open-source browser automation infrastructure built to serve both AI agents and human developers by simplifying control and interaction with real browsers. It integrates a single lightweight binary that manages browser lifecycle, implements a WebDriver BiDi proxy, and exposes a Model Context Protocol (MCP) server so language models or automation clients can control browser behavior without complex setup. This design makes it ideal for AI agents that need to interact with the web, perform tasks, or simulate human interactions in a browser environment, and it also works well for traditional testing and automation workflows. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Browser Use

    Browser Use

    Make websites accessible for AI agents

    Browser Use is an AI-powered browser automation framework designed to let agents interact with websites just like humans do. It enables developers and AI systems to perform complex online tasks such as form filling, data extraction, and navigation through natural language instructions. Built with Python and compatible with modern LLMs, it integrates seamlessly with tools like ChatBrowserUse, Google Gemini, and Anthropic models. The platform supports both open-source deployment and a fully...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 16
    SafeClaw

    SafeClaw

    Chat with it via text and voice

    ...It emphasizes privacy and predictability by using traditional programming, rule-based intent parsing, and established machine learning tools rather than large language models, meaning there are no per-token API costs and deterministic behavior. The assistant offers features such as voice control using fully local speech-to-text (Whisper) and text-to-speech (Piper) capabilities, news aggregation with extractive summarization, and smart home or Bluetooth device control. SafeClaw supports multiple channels, including CLI and Telegram, and avoids prompt injection risk because it doesn’t rely on LLMs for core operations.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Browser Agent

    Browser Agent

    AI Browser Agent is an advanced Browser AI tool

    Browser Agent Python is an AI-powered browser automation tool developed by Oxylabs that enables users to control web interactions through natural language instead of traditional scripting. The tool allows developers to describe tasks in plain English, such as navigating pages, clicking elements, filling forms, and extracting data, and the system executes those actions as if a human were interacting with the browser. It is designed to simplify complex automation workflows by removing the need for manually written selectors or step-by-step scripts. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    Android Use

    Android Use

    Automate native Android apps with AI using accessibility APIs

    android-action-kernel is an open source Python library designed to let AI agents control and automate native Android applications running on real devices or emulators. It fills a gap in automation tooling by focusing on mobile-first workflows where traditional browser or desktop-based automation doesn’t work; such as logistics, gig work, field operations, and other industries reliant on phones or tablets. The project works by using Android’s accessibility API to extract structured UI state (as XML) from the device, which is then fed to a large language model (LLM) like OpenAI’s models for decision-making, and actions are executed via the Android Debug Bridge (ADB). ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 19
    MolmoWeb

    MolmoWeb

    Open multimodal web agent built by Ai2

    MolmoWeb is an open-source multimodal web agent designed to autonomously navigate and interact with web browsers using vision-language models, representing a significant step toward fully agentic AI systems that can operate in real-world digital environments. The system takes natural language instructions and translates them into sequences of browser actions such as clicking, typing, scrolling, and navigating, effectively performing tasks on behalf of the user. Unlike traditional automation...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    mcp-use

    mcp-use

    A solution to build and deploy MCP agents and applications

    ...It simplifies authentication, access control, audit logging, observability, sandboxed runtime environments, and deployment workflows, whether self-hosted or managed, making MCP development production-ready. With integrations for popular frameworks like LangChain (Python) and LangChain.js (TypeScript), mcp-use accelerates the creation of tool-enabled AI agents.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 21
    Open Interface

    Open Interface

    Control Any Computer Using LLMs

    Open Interface is a cross-platform application that allows users to control their computers using large language models (LLMs). By sending user requests to an LLM backend, it determines the necessary steps and executes them by simulating keyboard and mouse inputs. The system can adjust its actions based on real-time feedback, providing a self-driving computer experience.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    GELab-Zero

    GELab-Zero

    GUI Exploration Lab. One of the best GUI agent solutions

    ...The idea is to let developers or users harness an AI agent that can simulate clicking, typing, reading UI elements, and interacting with apps in a human-like way via the GUI, which can enable tasks like automated testing, scriptable workflows, or even autonomous usage of GUI-based applications. Because GELab-Zero is fully open-source and doesn’t require external services, it offers privacy and control: everything runs locally under your control. The project provides a lightweight base model (4B parameters in its public release) that can run on modest hardware (depending on quantization), making it more accessible than many large-scale AI solutions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Steel Browser

    Steel Browser

    Open Source Browser API for AI Agents & Apps

    Steel Browser is a privacy-focused web browser built with security and performance optimizations, designed to minimize tracking and enhance user control.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 24
    OpenMonoAgent

    OpenMonoAgent

    Terminal-native coding agent powered by local LLMs

    ...It pairs a .NET CLI with a local llama.cpp inference server so developers can use agentic coding workflows without cloud subscriptions or per-token billing. The project emphasizes privacy, local control, and ownership of the model, compute, and project data. It includes a terminal-native workflow, built-in tools, Docker sandboxing, and code intelligence features. The system can run on CPU or GPU and is designed to auto-configure itself when possible. OpenMonoAgent.ai is best suited for developers who want a local AI development stack with no API keys, no cloud dependency, and no telemetry.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 25
    Claude Code Router

    Claude Code Router

    Use Claude Code as the foundation for coding infrastructure

    ...It also includes an extensible agent-oriented architecture for custom tools and workflows, including support for image-related tasks. Overall, it gives technical users more control over Claude Code infrastructure without abandoning the familiar coding assistant workflow.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • Next
Auth0 Logo