Showing 26 open source projects for "interface"

View related business solutions
  • AI-powered service management for IT and enterprise teams Icon
    AI-powered service management for IT and enterprise teams

    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
    Try it Free
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • 1
    Open Interface

    Open Interface

    Control Any Computer Using LLMs

    Open Interface is a cross-platform application that allows users to control their computers using large language models (LLMs). By sending user requests to an LLM backend, it determines the necessary steps and executes them by simulating keyboard and mouse inputs. The system can adjust its actions based on real-time feedback, providing a self-driving computer experience.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 2
    Hermes Agent

    Hermes Agent

    The agent that grows with you

    ...The agent interfaces with messaging platforms like Telegram, Discord, Slack, and WhatsApp through a single gateway process, and also offers an interactive terminal user interface with history, autocomplete, and streamable tool output. It supports scheduled automation in natural language, allowing users to set up recurring tasks such as daily briefings or system audits that it runs unattended.
    Downloads: 134 This Week
    Last Update:
    See Project
  • 3
    Hermes Web UI

    Hermes Web UI

    The best way to use Hermes Agent from the web or from your phone

    Hermes WebUI is a browser-based interface for interacting with the Hermes autonomous agent, providing full feature parity with its command-line experience. It offers a clean, multi-panel layout that includes chat interaction, session management, and workspace file browsing. The interface allows users to manage agent sessions, configure models, and interact with persistent memory systems directly from a web environment.
    Downloads: 24 This Week
    Last Update:
    See Project
  • 4
    OmniParser

    OmniParser

    A simple screen parsing tool towards pure vision based GUI agent

    OmniParser is a comprehensive method for parsing user interface screenshots into structured elements, significantly enhancing the ability of multimodal models like GPT-4 to generate actions accurately grounded in corresponding regions of the interface. It reliably identifies interactable icons within user interfaces and understands the semantics of various elements in a screenshot, associating intended actions with the correct screen regions.
    Downloads: 1 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    Agent Reach

    Agent Reach

    Give your AI agent eyes to see the entire internet

    ...It is designed for agent workflows that need to read and search sources like Twitter, Reddit, YouTube, GitHub, Bilibili, and XiaoHongShu without relying on paid platform APIs. The project focuses on giving AI tools broader visibility into social, video, code, and community data through one interface. It can be useful for research agents, automation workflows, competitive monitoring, content analysis, and social listening. Agent-Reach also supports modern agent environments by exposing capabilities that can be connected into larger AI workflows. Its main value is making scattered web platforms easier for agents to query, inspect, and summarize from the command line.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 6
    SWE-agent

    SWE-agent

    SWE-agent takes a GitHub issue and tries to automatically fix it

    ...We accomplish our results by designing simple LM-centric commands and feedback formats to make it easier for the LM to browse the repository, and view, edit, and execute code files. We call this an Agent-Computer Interface (ACI).
    Downloads: 4 This Week
    Last Update:
    See Project
  • 7
    Agent Zero

    Agent Zero

    Agent Zero AI framework

    ...Agents can communicate with their superiors and subordinates, asking questions, giving instructions, and providing guidance. Instruct your agents in the system prompt on how to communicate effectively. The terminal interface is real-time streamed and interactive. You can stop and intervene at any point. If you see your agent heading in the wrong direction, just stop and tell it right away. There is a lot of freedom in this framework. You can instruct your agents to regularly report back to superiors asking for permission to continue. You can instruct them to use point-scoring systems when deciding when to delegate subtasks. ...
    Downloads: 25 This Week
    Last Update:
    See Project
  • 8
    Magentic UI

    Magentic UI

    A research prototype of a human-centered web agent

    Magentic-UI is a research prototype developed by Microsoft that serves as a human-centered interface powered by a multi-agent system. It enables users to automate complex web tasks, such as browsing, form filling, and data analysis, while maintaining control over the process. The system emphasizes transparency and user involvement, making it suitable for tasks requiring both automation and human oversight.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    Letta

    Letta

    Letta (formerly MemGPT) is a framework for creating LLM services

    Letta is an AI-powered task automation framework designed to handle workflow automation, natural language commands, and AI-driven decision-making.
    Downloads: 6 This Week
    Last Update:
    See Project
  • Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure Icon
    Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure

    Native application identity and user-based security for your Azure cloud

    Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
    Get a free trial
  • 10
    BambooAI

    BambooAI

    A Python library powered by Language Models (LLMs)

    BambooAI is a Python library powered by large language models (LLMs) for conversational data discovery and analysis, allowing users to interact with data through natural language.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 11
    Cua

    Cua

    Open-source infrastructure for Computer-Use Agents. Sandboxes

    Cua is an open-source command-line utility and workflow orchestrator designed to help developers define, compose, and run common tasks with a unified interface, promoting consistency and reuse across projects. It introduces a declarative syntax for specifying build scripts, automation pipelines, environment setups, and project-specific commands so contributors don’t need to memorize disparate scripts or tooling across languages and ecosystems. Cua can also manage task dependencies, handle cross-platform invocations, and simplify complex workflows into simple aliases or compound commands that are easy to share in teams. ...
    Downloads: 11 This Week
    Last Update:
    See Project
  • 12
    Browser Use

    Browser Use

    Make websites accessible for AI agents

    Browser Use is an AI-powered browser automation framework designed to let agents interact with websites just like humans do. It enables developers and AI systems to perform complex online tasks such as form filling, data extraction, and navigation through natural language instructions. Built with Python and compatible with modern LLMs, it integrates seamlessly with tools like ChatBrowserUse, Google Gemini, and Anthropic models. The platform supports both open-source deployment and a fully...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 13
    AnyTool

    AnyTool

    AnyTool: Universal Tool-Use Layer for AI Agents

    AnyTool is an open-source universal tool-use layer for AI agents that addresses the critical problem of how autonomous agents reliably interact with external tools and environments. Rather than having each agent handle tool invocation logic on its own, AnyTool provides a standardized interface and orchestrator that intelligently selects and manages tools, reduces context overhead, and improves execution reliability across diverse capabilities like web APIs, local commands, and GUI automation. It uses progressive filtering and adaptive orchestration to ensure the right tools are retrieved efficiently and work cohesively with agents of varying complexity, scaling to thousands of tools with self-optimizing behavior. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    AgentForge

    AgentForge

    Extensible AGI Framework

    AgentForge is a framework for creating and deploying AI agents that can perform autonomous decision-making and task execution. It enables developers to define agent behaviors, train models, and integrate AI-powered automation into various applications.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 15
    Agent S

    Agent S

    Agent S: an open agentic framework that uses computers like a human

    Agent S is an open-source agentic framework designed to enable autonomous computer use through an Agent-Computer Interface (ACI). Built to operate graphical user interfaces like a human, it allows AI agents to perceive screens, reason about tasks, and execute actions across macOS, Windows, and Linux systems. The latest version, Agent S3, surpasses human-level performance on the OSWorld benchmark, demonstrating state-of-the-art results in complex multi-step computer tasks.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 16
    Sandstorm

    Sandstorm

    One API call, pull Claude agent, completely sandboxed

    ...The core idea is to provide “one API call” access to a robust Claude agent loop that runs inside a secure sandbox, so you can upload files, connect tools, and run long-running tasks — all managed behind a simple REST-style interface that disappears when the work is done. This approach lowers the friction of building autonomous agents by removing the need to provision servers, orchestrate distributed agents, or manage persistent tooling; agents can be spun up in parallel without manual setup and shut down when complete. The sandbox environment isolates agent execution for security and predictability, and project updates continue to harden observability, fault handling, and configuration validation.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 17
    Agent Stack

    Agent Stack

    Deploy and share agents with open infrastructure

    ...Under the hood it’s built on the “Agent2Agent” (A2A) protocol, enabling interoperability between different agent ecosystems, runtime services, and frameworks. The platform supports agents built in frameworks like LangChain, CrewAI, etc., enabling them to be hosted, managed and shared through a unified interface. It also offers multi-model, multi-provider support (OpenAI, Anthropic, Gemini, IBM WatsonX, Ollama etc.), letting users compare performance and cost across models. For developers and organizations building AI-agent products or automations, Agent Stack gives a scaffold that handles the “plumbing”, so they can focus on logic and domain.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 18
    Gemini Fullstack LangGraph Quickstart

    Gemini Fullstack LangGraph Quickstart

    Get started w/ building Fullstack Agents using Gemini 2.5 & LangGraph

    ...It then iteratively refines its search until it produces a comprehensive, well-cited answer synthesized by the Gemini model. The repository provides both a browser-based chat interface and a command-line script (cli_research.py) for executing research queries directly. For production deployment, the backend integrates with Redis and PostgreSQL to manage persistent memory, streaming outputs, & background task coordination.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 19
    gpt-engineer

    gpt-engineer

    Full stack AI software engineer

    ...The platform allows users to specify software requirements in plain language, and the AI generates and executes the corresponding code. It can also handle improvements and iterative development, giving users more control over the software they’re building. Built with a terminal-based interface, gpt-engineer is customizable, enabling developers to experiment with AI-assisted programming and refine their development process. It is especially useful for automating the coding and iterative feedback loop in software development.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 20
    MAI-UI

    MAI-UI

    Real-World Centric Foundation GUI Agents

    MAI-UI is a cutting-edge open-source project that implements a family of foundation GUI (Graphical User Interface) agent models capable of interpreting natural language and performing real-world GUI navigation and control tasks across mobile and desktop environments. Developed by Tongyi-MAI (Alibaba’s research initiative), the MAI-UI models are multimodal agents trained to understand user instructions and corresponding screenshots, grounding those instructions to on-screen elements and generating sequences of GUI actions such as taps, swipes, text input, and system commands. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Mini Agent

    Mini Agent

    A minimal yet professional single agent demo project

    ...It is designed both as a reference implementation and as a usable agent, demonstrating a full execution loop that includes planning, tool calls, and iterative refinement. The project exposes an Anthropic-compatible API interface and fully supports interleaved thinking, letting the agent alternate between reasoning steps and tool invocations during long, complex tasks. It includes a basic toolset for file-system operations and shell commands, plus integrations with MCP tools such as web search and knowledge graph access. Mini-Agent also comes with “Claude Skills”-style predefined skills for tasks like document processing, design work, and testing, packaged as reusable behaviors that can be invoked by the agent as needed.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    AgentPilot

    AgentPilot

    A versatile workflow automation platform to create AI workflows

    AgentPilot is a versatile workflow automation platform designed to help users create, organize, and execute AI-driven workflows. It supports everything from simple tasks using a single large language model (LLM) to complex multi-step processes. The platform features a user-friendly interface that allows for real-time interaction with workflows, and it supports flexible configurations, including branching workflows and customizable user interfaces. Users can also schedule tasks based on natural language time expressions and integrate various tools to enhance their workflows.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 23
    OWL

    OWL

    Optimized Workforce Learning for General Multi-Agent Assistance

    Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation. OWL (Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation) is an advanced framework designed to enhance multi-agent collaboration, improving task automation across various domains. By utilizing dynamic agent interactions, OWL aims to streamline and optimize complex workflows, making AI collaboration more natural, efficient, and adaptable. It is built on...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 24
    SuperAGI

    SuperAGI

    A dev-first open source autonomous AI agent framework

    An open-source autonomous AI framework to enable you to develop and deploy useful autonomous agents quickly & reliably. Join a community of developers constantly contributing to make SuperAGI better. Access your agents through a graphical user interface. Interact with agents by giving them input, permissions, etc. Agents typically learn and improve their performance over time with feedback loops. Run multiple agents simultaneously to improve efficiency and productivity. Connect to multiple Vector DBs to enhance your agent’s performance. Each agent is unique, use different models of your choice. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 25
    AI-Agent-Host

    AI-Agent-Host

    The AI Agent Host is a module-based development environment.

    ...The AI Agent Host is a module-based environment designed to facilitate rapid experimentation and testing. It includes a docker-compose configuration with QuestDB, Grafana, Code-Server and Nginx. The AI Agent Host provides a seamless interface for managing and querying data, visualizing results, and coding in real-time. The AI Agent Host is built specifically for LangChain, a framework dedicated to developing applications powered by language models. LangChain recognizes that the most powerful and distinctive applications go beyond simply utilizing a language model and strive to be data-aware and agentic. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
Auth0 Logo