Showing 160 open source projects for "web browser windows"

View related business solutions
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • 1
    Browser Agent

    Browser Agent

    AI Browser Agent is an advanced Browser AI tool

    Browser Agent Python is an AI-powered browser automation tool developed by Oxylabs that enables users to control web interactions through natural language instead of traditional scripting. The tool allows developers to describe tasks in plain English, such as navigating pages, clicking elements, filling forms, and extracting data, and the system executes those actions as if a human were interacting with the browser.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    agent-browser

    agent-browser

    Browser automation CLI for AI agents

    agent-browser is a toolkit that embeds AI agent capabilities directly into the web browser, enabling agents to interact with web content, scripts, and user actions while maintaining security boundaries that respect user privacy and browser constraints. It effectively provides a sandbox where AI agents can read, scroll, click, and interpret pages in context, allowing them to automate workflows, answer questions about page content, or generate structured summaries directly from the user’s current tab. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 3
    Steel Browser

    Steel Browser

    Open Source Browser API for AI Agents & Apps

    Steel Browser is a privacy-focused web browser built with security and performance optimizations, designed to minimize tracking and enhance user control.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 4
    Hermes Web UI

    Hermes Web UI

    The best way to use Hermes Agent from the web or from your phone

    Hermes WebUI is a browser-based interface for interacting with the Hermes autonomous agent, providing full feature parity with its command-line experience. It offers a clean, multi-panel layout that includes chat interaction, session management, and workspace file browsing. The interface allows users to manage agent sessions, configure models, and interact with persistent memory systems directly from a web environment.
    Downloads: 10 This Week
    Last Update:
    See Project
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 5
    Browser Use

    Browser Use

    Make websites accessible for AI agents

    ...The platform supports both open-source deployment and a fully hosted cloud version for enhanced scalability and performance. Its cloud offering includes advanced capabilities like stealth browsing, CAPTCHA solving, and proxy rotation for reliable automation. Overall, Browser Use transforms web interaction into an intelligent, programmable workflow driven by AI agents.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Gemma 4 Browser Assistant

    Gemma 4 Browser Assistant

    On-device AI agent Chrome extension powered by Transformers.js

    ...It can access and analyze page content, browsing history, and tab state to provide contextual assistance. The architecture follows modern browser extension standards, with separate components for background processing, content scripts, and UI rendering. It also supports tool-calling capabilities, allowing the AI to perform actions such as navigating tabs or highlighting elements. Overall, it demonstrates how to build fully local, agent-based assistants inside web browsers.
    Downloads: 57 This Week
    Last Update:
    See Project
  • 7
    web-access

    web-access

    Skill for installing full networking capabilities for Claude Code

    web-access is a tool designed to give AI agents structured and controlled access to web content, enabling them to retrieve, navigate, and process information from online sources in real time. It abstracts common web interactions such as page loading, data extraction, and navigation into reusable functions that can be invoked by agents. The system emphasizes safety and control, likely including mechanisms to manage permissions, rate limits, and content filtering. This allows agents to operate...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 8
    Browserbase Skills

    Browserbase Skills

    Claude Agent SDK with a web browsing tool

    Browserbase Skills is a collection of reusable automation “skills” designed to enable AI agents to interact with web environments programmatically. It provides structured workflows that abstract browser actions such as navigation, form filling, and data extraction into composable building blocks. The system is intended to simplify the development of browser-based agents by offering prebuilt capabilities that can be orchestrated together. It integrates with headless browser infrastructure, allowing scalable automation across multiple sessions. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 9
    Notte

    Notte

    Opensource browser using agents

    Notte is an open-source browser framework that enables the development and deployment of web-based AI agents. It introduces a perception layer that transforms web pages into structured, navigable maps described in natural language, allowing agents to interact with the internet more effectively. Notte is designed for building scalable and efficient browser-based AI applications.
    Downloads: 2 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.
    Start Free
  • 10
    Nanobrowser

    Nanobrowser

    Open-Source Chrome extension for AI-powered web automation

    Nanobrowser is an open-source AI web automation tool that runs in your browser. A free alternative to OpenAI Operator with flexible LLM options and a multi-agent system. Nanobrowser, as a chrome extension, delivers premium web automation capabilities while keeping you in complete control. No subscription fees or hidden costs. Just install and use your own API keys, and you only pay what you use with your own API keys.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 11
    Vibium

    Vibium

    Browser automation for AI agents and humans

    ...This design makes it ideal for AI agents that need to interact with the web, perform tasks, or simulate human interactions in a browser environment, and it also works well for traditional testing and automation workflows. Vibium strikes a balance between AI-native capabilities and conventional developer usability by offering language bindings and client APIs for JavaScript and Python.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 12
    chrome-cdp

    chrome-cdp

    Give your AI agent access to your live Chrome session

    chrome-cdp-skill is a specialized integration that enables AI agents to control and interact with web browsers through the Chrome DevTools Protocol (CDP). It allows agents to perform tasks such as navigating pages, extracting data, interacting with elements, and executing scripts in a browser environment. The project is designed to extend the capabilities of AI systems beyond static knowledge by giving them real-time access to web content and interactive interfaces. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    Web Quality Skills

    Web Quality Skills

    Agent Skills for optimizing web quality based on Lighthouse

    This repository is a curated set of AI agent skills that encapsulate best practices for improving web quality, performance, accessibility, search engine optimization, and general best practices for web projects. It encodes knowledge drawn from Google Lighthouse audits, Core Web Vitals heuristics, WCAG accessibility guidelines, and real-world engineering experience, allowing coding agents to automatically assess and suggest improvements. These skills are framework-agnostic, meaning they apply...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Pinchtab

    Pinchtab

    High-performance browser automation bridge and orchestrator

    Pinchtab is a lightweight browser automation backend built specifically for AI agents that need efficient, programmatic web control. Implemented as a small standalone HTTP server, it allows any agent or script to interact with web pages using simple API calls instead of heavyweight browser frameworks. The tool emphasizes accessibility-first snapshots that dramatically reduce token usage compared to screenshot-based approaches, making it cost-effective for large-scale automation. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 15
    Playwriter

    Playwriter

    Chrome extension to let agents control your browser

    ...Playwriter’s architecture supports both extension-based control for real browser windows and CLI integration, giving developers flexibility in how they build and run browser automation workflows.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    WebMCP

    WebMCP

    Enabling web apps to get accessed by AI agents

    WebMCP is a proposed web standard that enables web applications to expose their functionality as JavaScript-based “tools” accessible to AI agents, browser assistants, and assistive technologies. It allows developers to define structured, natural-language-described functions directly in client-side code, effectively turning web pages into Model Context Protocol (MCP)-like servers running in the browser.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 17
    OpenClaw

    OpenClaw

    Your own personal AI assistant. Any OS. Any Platform.

    OpenClaw (formerly Clawdbot/Moltbot) is an open-source, self-hosted autonomous AI assistant designed to run on user-controlled hardware and bridge conversational natural language with real-world task execution, effectively acting as a proactive digital assistant rather than a reactive chatbot. It lets you send instructions through familiar messaging platforms like WhatsApp, Telegram, Discord, Slack, Signal, iMessage, and more, and then interprets those instructions to carry out actions such...
    Downloads: 1,103 This Week
    Last Update:
    See Project
  • 18
    MolmoWeb

    MolmoWeb

    Open multimodal web agent built by Ai2

    MolmoWeb is an open-source multimodal web agent designed to autonomously navigate and interact with web browsers using vision-language models, representing a significant step toward fully agentic AI systems that can operate in real-world digital environments. The system takes natural language instructions and translates them into sequences of browser actions such as clicking, typing, scrolling, and navigating, effectively performing tasks on behalf of the user. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Anything Analyzer

    Anything Analyzer

    AI Agent/IDE | All-in-one protocol analysis toolkit

    Anything Analyzer is an all-in-one protocol analysis toolkit designed to inspect, intercept, and understand network traffic across modern web environments. It combines browser-based packet capture, MITM proxy capabilities, and JavaScript hooking into a unified interface for deep inspection of requests and responses. The tool supports fingerprint spoofing and behavioral simulation, allowing users to analyze how systems react under different conditions. It integrates AI-powered analysis to interpret captured data and provide insights into protocols and behaviors. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 20
    Obscura

    Obscura

    The headless browser for AI agents and web scraping

    Obscura is a security-focused project aimed at providing tools and techniques for enhancing privacy, anonymity, and operational security in digital environments. It is designed for users who need to obscure their digital footprint and reduce traceability across systems. The project typically includes utilities for masking identity, managing secure communication, and mitigating surveillance risks. It emphasizes practical implementations of privacy-preserving workflows rather than purely...
    Downloads: 56 This Week
    Last Update:
    See Project
  • 21
    Suna

    Suna

    Suna - Open Source Generalist AI Agent

    ...Designed to assist users in accomplishing real-world tasks through natural conversation, Suna combines powerful capabilities with an intuitive interface. It serves as a digital companion for research, data analysis, and everyday challenges, integrating tools like browser automation, file management, web crawling, command-line execution, website deployment, and API integration. Suna's architecture comprises a FastAPI-based backend, a Next.js/React frontend, an agent Docker environment, and a Supabase database for state management. This modular design allows for seamless interaction and task execution through simple conversations. ​
    Downloads: 5 This Week
    Last Update:
    See Project
  • 22
    html-ppt

    html-ppt

    AgentSkill with 24 themes, 31 layouts, 20+ animations

    html-ppt-skill is an AI-oriented skill designed to generate presentation slides using HTML as the underlying structure instead of traditional PowerPoint formats. It enables users to create visually structured, web-based slide decks that can be rendered in browsers or converted into presentation formats. The system focuses on translating structured ideas into clean layouts, combining content organization with lightweight styling. It integrates well with AI workflows, allowing automated...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 23
    OpenClaw Studio

    OpenClaw Studio

    A clean web dashboard for OpenClaw

    OpenClaw Studio is a web-based dashboard designed to manage and interact with OpenClaw agents through a centralized interface. It allows users to connect to an OpenClaw Gateway, monitor agents, and control workflows from a single location. The platform provides real-time chat capabilities, approval management, and job configuration tools for agent operations.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 24
    DenchClaw

    DenchClaw

    Fully Managed OpenClaw Framework for all knowledge work ever

    DenchClaw is a local-first AI-powered CRM and productivity platform built on top of the OpenClaw framework, designed to transform a user’s entire computer into a programmable, agent-driven workspace. Unlike traditional cloud-based CRMs or AI tools, it runs entirely on the user’s machine and exposes a web interface locally, allowing full control over data, workflows, and automation without relying on external servers. The system combines database management, browser automation, and AI reasoning into a unified interface where users can interact with their data and tools using natural language commands. It can ingest data from sources such as Google Drive, Notion, Gmail, and CRM platforms, consolidating everything into a centralized workspace for analysis and action. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 25
    DeerFlow

    DeerFlow

    Deep Research framework, combining language models with tools

    DeerFlow is an open-source, community-driven “deep research” framework / multi-agent orchestration platform developed by ByteDance. It aims to combine the reasoning power of large language models (LLMs) with automated tool-use — such as web search, web crawling, Python execution, and data processing — to enable complex, end-to-end research workflows. Instead of a monolithic AI assistant, DeerFlow defines multiple specialized agents (e.g. “planner,” “searcher,” “coder,” “report generator”)...
    Downloads: 201 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB