Showing 199 open source projects for "html web browser"

View related business solutions
  • Fully Managed MySQL, PostgreSQL, and SQL Server Icon
    Fully Managed MySQL, PostgreSQL, and SQL Server

    Automatic backups, patching, replication, and failover. Focus on your app, not your database.

    Cloud SQL handles your database ops end to end, so you can focus on your app.
    Try Free
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    More flexibility. More control.

    Generate interest, access liquidity without selling, and execute trades seamlessly. All in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • 1
    LaVague

    LaVague

    Framework for building AI agents that automate complex web tasks

    LaVague is an open source framework designed to help developers build AI-powered web agents capable of automating tasks across websites and web applications. It implements the concept of a Large Action Model framework, allowing agents to interpret a user-provided objective and translate it into a sequence of actions performed in a browser. These agents can navigate web pages, retrieve information, fill out forms, and execute multi-step workflows automatically. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 2
    Notte

    Notte

    Opensource browser using agents

    Notte is an open-source browser framework that enables the development and deployment of web-based AI agents. It introduces a perception layer that transforms web pages into structured, navigable maps described in natural language, allowing agents to interact with the internet more effectively. Notte is designed for building scalable and efficient browser-based AI applications.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 3
    draw-a-ui

    draw-a-ui

    Draw wireframe sketches and generate HTML with AI vision models

    ...The result is an automated design-to-code workflow where rough interface ideas can quickly become functional web layouts. It is implemented as a modern web application built with Next.js and TypeScript and requires a compatible Node.js runtime to run locally. It is designed primarily as a demonstration of AI-assisted UI generation rather than a production-ready platform.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 4
    Playwright MCP

    Playwright MCP

    Playwright MCP server

    An MCP server developed by Microsoft that offers browser automation capabilities using Playwright, enabling LLMs to interact with web pages through structured accessibility snapshots without relying on visual data. ​
    Downloads: 5 This Week
    Last Update:
    See Project
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 5
    Nanobrowser

    Nanobrowser

    Open-Source Chrome extension for AI-powered web automation

    Nanobrowser is an open-source AI web automation tool that runs in your browser. A free alternative to OpenAI Operator with flexible LLM options and a multi-agent system. Nanobrowser, as a chrome extension, delivers premium web automation capabilities while keeping you in complete control. No subscription fees or hidden costs. Just install and use your own API keys, and you only pay what you use with your own API keys.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 6
    Vibium

    Vibium

    Browser automation for AI agents and humans

    ...This design makes it ideal for AI agents that need to interact with the web, perform tasks, or simulate human interactions in a browser environment, and it also works well for traditional testing and automation workflows. Vibium strikes a balance between AI-native capabilities and conventional developer usability by offering language bindings and client APIs for JavaScript and Python.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 7
    chrome-cdp

    chrome-cdp

    Give your AI agent access to your live Chrome session

    chrome-cdp-skill is a specialized integration that enables AI agents to control and interact with web browsers through the Chrome DevTools Protocol (CDP). It allows agents to perform tasks such as navigating pages, extracting data, interacting with elements, and executing scripts in a browser environment. The project is designed to extend the capabilities of AI systems beyond static knowledge by giving them real-time access to web content and interactive interfaces. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    HyperAgent

    HyperAgent

    AI Browser Automation

    HyperAgent is an open-source browser automation framework that combines large language models with modern browser scripting tools to create intelligent web automation agents. Built on top of Playwright, the framework allows developers to automate complex browser interactions using natural language commands rather than fragile selectors or hard-coded scripts. Instead of manually writing logic for clicking elements, extracting data, or navigating web pages, developers can instruct the agent in plain language and allow the AI layer to interpret and execute the task. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 9
    Magnitude

    Magnitude

    Vision AI browser agent for automation, testing, and extraction

    Browser Agent by Magnitude is an open source, vision-first browser automation framework that enables users to control web interfaces using natural language instructions. It leverages visually grounded AI models to interpret and interact with web pages based on what is seen on the screen rather than relying solely on the DOM structure. This approach allows the agent to generalize better across complex and modern websites, making it more robust than traditional selector-based automation tools. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Build Securely on Azure with Proven Frameworks Icon
    Build Securely on Azure with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 10
    BrowserNode

    BrowserNode

    Make websites accessible for AI agents. Automate tasks online

    Browsernode is an open-source TypeScript framework that allows AI agents to interact directly with web browsers in order to automate tasks and gather information from websites. The project acts as a bridge between AI models and browser automation tools, enabling language models to control web pages programmatically. Built as an implementation compatible with the Browser-use ecosystem, Browsernode allows agents to perform actions such as navigating pages, extracting information, filling forms, or interacting with dynamic web interfaces. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Index

    Index

    The SOTA Open-Source Browser Agent

    Index is an open-source browser automation agent designed to autonomously perform complex tasks across websites by transforming web interfaces into programmable APIs. The system enables developers to instruct an AI agent to interact with web pages using natural language rather than traditional automation scripts. Instead of writing detailed browser automation code, users can describe the desired task and allow the agent to interpret the page structure, interact with elements, and complete multi-step workflows automatically. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Pinchtab

    Pinchtab

    High-performance browser automation bridge and orchestrator

    Pinchtab is a lightweight browser automation backend built specifically for AI agents that need efficient, programmatic web control. Implemented as a small standalone HTTP server, it allows any agent or script to interact with web pages using simple API calls instead of heavyweight browser frameworks. The tool emphasizes accessibility-first snapshots that dramatically reduce token usage compared to screenshot-based approaches, making it cost-effective for large-scale automation. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 13
    Playwriter

    Playwriter

    Chrome extension to let agents control your browser

    Playwriter is an open-source project that combines a Chrome extension with a CLI to allow autonomous agents to control a web browser directly using Playwright code in a stateful sandbox environment. The system enables browser automation by running Playwright commands through a persistent session managed by a background extension, allowing agents or scripts to navigate, interact with, and query browser contexts without losing state between commands. This makes it valuable for scenarios where AI agents need to perform complex web automation tasks—like multi-step navigation, form interaction, or content extraction—without reinitializing context or state every time. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    browserable

    browserable

    Open source and self-hostable browser automation library for AI agents

    ...It is designed to be self-hostable, which means developers can deploy and run it on their own infrastructure without relying on third-party services. The platform enables the creation of browser-based agents capable of performing complex online workflows such as data collection, research tasks, and automated interactions with web platforms.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Secret Llama

    Secret Llama

    Fully private LLM chatbot that runs entirely with a browser

    Secret Llama is a privacy-first large-language-model chatbot that runs entirely inside your web browser, meaning no server is required and your conversation data never leaves your device. It focuses on open-source model support, letting you load families like Llama and Mistral directly in the client for fully local inference. Because everything happens in-browser, it can work offline once models are cached, which is helpful for air-gapped environments or travel. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 16
    BrowserAI

    BrowserAI

    Run local LLMs like llama, deepseek, kokoro etc. inside your browser

    BrowserAI is a cutting-edge platform that allows users to run large language models (LLMs) directly in their web browser without the need for a server. It leverages WebGPU for accelerated performance and supports offline functionality, making it a highly efficient and privacy-conscious solution. The platform provides a developer-friendly SDK with pre-configured popular models, and it allows for seamless switching between MLC and Transformer engines.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 17
    Easy Diffusion

    Easy Diffusion

    An easy 1-click way to create beautiful artwork on your PC using AI

    Easy Diffusion is a widely used community-driven repository offering a simple, one-click way to install and use Stable Diffusion-based generative AI on a personal computer without advanced technical skills or prior setup. It provides a browser-based user interface that runs locally, allowing users to type text prompts and immediately generate images directly within their web browser, democratizing access to powerful text-to-image models for artists and hobbyists alike. The project abstracts away environment setup, dependencies, and model installation — tasks that can be daunting to beginners — and instead lets users focus on creative experimentation with prompt phrasing, model parameters, and image output settings. ...
    Downloads: 47 This Week
    Last Update:
    See Project
  • 18
    Browserbase MCP Server

    Browserbase MCP Server

    Allow LLMs to control a browser with Browserbase and Stagehand

    Browserbase MCP Server is a server implementation of the Model Context Protocol (MCP) that enables large language models to interact with web browsers programmatically through cloud-based automation. The project provides a standardized interface for connecting AI systems to real-world web environments, allowing them to navigate pages, extract structured data, and perform user-like actions such as clicking, typing, and form submission. It leverages Browserbase infrastructure along with Stagehand to deliver high-performance browser automation with improved speed and efficiency through caching and optimized execution pipelines. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 19
    Happy Coder

    Happy Coder

    Mobile and Web client for Codex and Claude Code, with realtime voice

    Happy is an open-source, cross-platform mobile and web client designed to bring powerful AI coding agents such as Claude Code and Codex to your fingertips no matter where you are. At its core, Happy wraps existing AI coding tools with a unified interface, providing real-time voice interactions, encrypted communication, and seamless device switching between desktop and mobile. You can start a coding session locally through the Happy CLI or connect from a phone or browser, allowing developers to inspect, interact with, and guide the AI as it generates, tests, or explains code. ...
    Downloads: 37 This Week
    Last Update:
    See Project
  • 20
    BrowserGym

    BrowserGym

    A Gym environment for web task automation

    BrowserGym is an open framework for web task automation research that exposes browser interaction as a Gym-style environment for training and evaluating agents. It is intended for researchers building web agents rather than for end users looking for a consumer automation product. The project provides a common environment where agents can interact with websites, execute tasks, and be evaluated against standardized benchmarks.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 21
    WebMCP

    WebMCP

    Enabling web apps to get accessed by AI agents

    WebMCP is a proposed web standard that enables web applications to expose their functionality as JavaScript-based “tools” accessible to AI agents, browser assistants, and assistive technologies. It allows developers to define structured, natural-language-described functions directly in client-side code, effectively turning web pages into Model Context Protocol (MCP)-like servers running in the browser.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 22
    Groq AppGen

    Groq AppGen

    Project showcasing Llama 3.3 70B HTML codegen abilities

    Groq AppGen is an interactive web application (built with Next.js and TypeScript) that uses Groq’s LLM API to generate or modify web application code based on natural-language prompts. Essentially, you tell the app what kind of web app or page you want (in plain English), and groq-appgen will produce HTML/JSX code scaffolding, layout, and optionally application logic accordingly.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23
    Bolt.new

    Bolt.new

    Prompt, run, edit, and deploy full-stack web applications

    Bolt.new is an AI-powered full-stack development platform created by StackBlitz that enables users to build, run, edit, and deploy complete web applications directly from the browser without requiring any local setup or traditional development environment. It operates as an intelligent coding agent where users describe what they want to build in natural language, and the system generates functional applications, including frontend, backend, and infrastructure components. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    LLM Scraper

    LLM Scraper

    Extract structured data from webpages using LLM-powered scraping

    LLM Scraper is a TypeScript library designed to extract structured data from webpages using large language models. Instead of relying on fragile HTML selectors or manual parsing rules, the tool interprets webpage content with language models and converts it into structured data according to a defined schema. Developers can specify the data structure using tools such as Zod or JSON Schema, enabling the model to extract relevant information directly into typed objects. LLM Scraper integrates browser automation through Playwright, allowing it to load webpages and process their content before sending it to a language model for interpretation. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 25
    PowerPoint-ist

    PowerPoint-ist

    Web presentation editor replicating many PowerPoint features online

    PPTist is a web-based presentation editing application designed to replicate many of the commonly used features found in traditional slide presentation software. It allows users to create, edit, and present slide decks directly within a web browser while maintaining a desktop-like editing experience. PPTist is built with Vue 3 and TypeScript and focuses on providing a highly interactive slide editing environment with extensive customization and extension potential. ...
    Downloads: 10 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB