Showing 65 open source projects for "html web browser"

View related business solutions
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • Stop Storing Third-Party Tokens in Your Database Icon
    Stop Storing Third-Party Tokens in Your Database

    Auth0 Token Vault handles secure token storage, exchange, and refresh for external providers so you don't have to build it yourself.

    Rolling your own OAuth token storage can be a security liability. Token Vault securely stores access and refresh tokens from federated providers and handles exchange and renewal automatically. Connected accounts, refresh exchange, and privileged worker flows included.
    Try Auth0 for Free
  • 1
    html-ppt

    html-ppt

    AgentSkill with 24 themes, 31 layouts, 20+ animations

    html-ppt-skill is an AI-oriented skill designed to generate presentation slides using HTML as the underlying structure instead of traditional PowerPoint formats. It enables users to create visually structured, web-based slide decks that can be rendered in browsers or converted into presentation formats. The system focuses on translating structured ideas into clean layouts, combining content organization with lightweight styling.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 2
    camofox-browser

    camofox-browser

    Headless browser automation server for AI agents to visit sites

    camofox-browser is a headless browser automation server built specifically for AI agents that need to interact with websites that often block standard automation stacks. It wraps Camoufox, a Firefox fork that performs fingerprint spoofing at the C++ level, which means many browser characteristics are altered before page scripts can inspect them, rather than relying on JavaScript-layer stealth patches.
    Downloads: 14 This Week
    Last Update:
    See Project
  • 3
    Steel Browser

    Steel Browser

    Open Source Browser API for AI Agents & Apps

    Steel Browser is a privacy-focused web browser built with security and performance optimizations, designed to minimize tracking and enhance user control.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 4
    MCP Server RAG Web Browser

    MCP Server RAG Web Browser

    A MCP Server for the RAG Web Browser Actor

    The MCP Server for the RAG Web Browser Actor allows AI assistants and LLMs to perform web searches and extract information from web pages. It facilitates interaction with the web, enabling up-to-date context retrieval for AI applications. ​
    Downloads: 0 This Week
    Last Update:
    See Project
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 5
    Stable Diffusion web UI for AMDGPUs

    Stable Diffusion web UI for AMDGPUs

    Stable Diffusion WebUI optimized for AMD GPUs with editing tools

    Stable Diffusion WebUI AMDGPU is a browser-based interface for generating images using Stable Diffusion, built with Gradio and adapted for AMD graphics hardware. It provides both text-to-image and image-to-image workflows, allowing users to create, refine, and upscale visuals within a single interface. It includes tools such as inpainting and outpainting for editing specific areas of an image, along with features like prompt matrix generation and attention controls to fine-tune outputs....
    Downloads: 9 This Week
    Last Update:
    See Project
  • 6
    Browserbase Skills

    Browserbase Skills

    Claude Agent SDK with a web browsing tool

    Browserbase Skills is a collection of reusable automation “skills” designed to enable AI agents to interact with web environments programmatically. It provides structured workflows that abstract browser actions such as navigation, form filling, and data extraction into composable building blocks. The system is intended to simplify the development of browser-based agents by offering prebuilt capabilities that can be orchestrated together. It integrates with headless browser infrastructure, allowing scalable automation across multiple sessions. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 7
    MCP Server Playwright

    MCP Server Playwright

    MCP server for browser automation using Playwright

    An MCP (Model Context Protocol) server that leverages Playwright to provide browser automation capabilities, enabling large language models (LLMs) to interact with web pages, take screenshots, and execute JavaScript within a real browser environment. ​
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    LaVague

    LaVague

    Framework for building AI agents that automate complex web tasks

    LaVague is an open source framework designed to help developers build AI-powered web agents capable of automating tasks across websites and web applications. It implements the concept of a Large Action Model framework, allowing agents to interpret a user-provided objective and translate it into a sequence of actions performed in a browser. These agents can navigate web pages, retrieve information, fill out forms, and execute multi-step workflows automatically. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 9
    chrome-cdp

    chrome-cdp

    Give your AI agent access to your live Chrome session

    chrome-cdp-skill is a specialized integration that enables AI agents to control and interact with web browsers through the Chrome DevTools Protocol (CDP). It allows agents to perform tasks such as navigating pages, extracting data, interacting with elements, and executing scripts in a browser environment. The project is designed to extend the capabilities of AI systems beyond static knowledge by giving them real-time access to web content and interactive interfaces. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 10
    Magnitude

    Magnitude

    Vision AI browser agent for automation, testing, and extraction

    Browser Agent by Magnitude is an open source, vision-first browser automation framework that enables users to control web interfaces using natural language instructions. It leverages visually grounded AI models to interpret and interact with web pages based on what is seen on the screen rather than relying solely on the DOM structure. This approach allows the agent to generalize better across complex and modern websites, making it more robust than traditional selector-based automation tools. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    browserable

    browserable

    Open source and self-hostable browser automation library for AI agents

    ...It is designed to be self-hostable, which means developers can deploy and run it on their own infrastructure without relying on third-party services. The platform enables the creation of browser-based agents capable of performing complex online workflows such as data collection, research tasks, and automated interactions with web platforms.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Easy Diffusion

    Easy Diffusion

    An easy 1-click way to create beautiful artwork on your PC using AI

    Easy Diffusion is a widely used community-driven repository offering a simple, one-click way to install and use Stable Diffusion-based generative AI on a personal computer without advanced technical skills or prior setup. It provides a browser-based user interface that runs locally, allowing users to type text prompts and immediately generate images directly within their web browser, democratizing access to powerful text-to-image models for artists and hobbyists alike. The project abstracts away environment setup, dependencies, and model installation — tasks that can be daunting to beginners — and instead lets users focus on creative experimentation with prompt phrasing, model parameters, and image output settings. ...
    Downloads: 44 This Week
    Last Update:
    See Project
  • 13
    BrowserTools MCP

    BrowserTools MCP

    Monitor browser logs directly from Cursor

    Browser Tools MCP is an MCP server and Chrome extension that gives AI agents safe, structured access to your live browser for debugging and automation. It can capture console/network logs, DOM snapshots, and screenshots, and expose them as typed resources the agent can query or act on. The design aims to make IDE agents (e.g., Cursor, Claude Desktop) more “web-aware,” enabling workflows like reproducing a bug, collecting evidence, and proposing fixes without copy-pasting. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 14
    NSFWJS

    NSFWJS

    Client-side indecent content checking powered by TensorFlow.js

    ...NSFWJS offers a 'browserified' version, an NSFW filter web extension that filters out NSFW images from your browser, and also has a separate React Native app.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 15
    xgplayer

    xgplayer

    A HTML5 video player with a parser that saves traffic

    xgplayer is a web-friendly, open-source media player library maintained by ByteDance, designed for playing audio/video streams in browsers or web applications with robust control, flexibility, and extensibility. It abstracts many of the lower-level complexities of HTML5 media, providing a consistent API for playback control, custom UI overlays, adaptive streaming, plugin hooks, and cross-browser compatibility.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Color Thief

    Color Thief

    Grab the color palette from an image using just Javascript

    The Color Thief package includes multiple distribution files to support different environments and build processes. Gets the dominant color from the image. Color is returned as an array of three integers representing red, green, and blue values. When called in the browser, the image argument expects an HTML image element, not a URL. When run in Node, this argument expects a path to the image. quality is an optional argument that must be an Integer of value 1 or greater, and defaults to 10. The number determines how many pixels are skipped before the next one is sampled. We rarely need to sample every single pixel in the image to get good results. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 17
    Scribe.js

    Scribe.js

    JavaScript OCR and text extraction for images and PDFs

    ...The library can take image files (such as PNG or JPEG) and recognize the text they contain, and it can also extract text from PDF files that either already contain text or are image-based scans, using modern web standards and WebAssembly under the hood. In addition to simple text extraction, Scribe.js supports writing or injecting a high-quality invisible text layer back into PDFs, effectively making them searchable and improving usability for indexing or accessibility. It is written in modern ECMAScript Modules (ESM), so it can be imported in both browser and Node.js environments without a build step, though browser usage requires same-origin hosting of the files.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    Every Code

    Every Code

    Local AI coding agent CLI with multi-agent orchestration tools

    Every Code (often referred to simply as Code) is a fast, local AI-powered coding agent designed to run directly in the terminal environment. It is a community-driven fork of the Codex CLI, with a strong emphasis on improving real-world developer ergonomics and workflows. Every Code enhances the traditional coding assistant model by introducing multi-agent orchestration, allowing multiple AI agents to collaborate, compare solutions, and refine outputs in parallel. It supports integration with...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 19
    AstronRPA

    AstronRPA

    Agent-ready RPA suite with visual workflow automation tools engine

    Astron RPA is an enterprise-grade robotic process automation platform designed to help organizations and developers build automated workflows for desktop and web applications. It provides a visual workflow designer that supports low-code and no-code development, allowing users to create automation processes through a drag-and-drop interface instead of writing extensive code. It enables automation of common desktop software and browser-based tasks, making it suitable for repetitive business operations and system integrations. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 20
    KeepChatGPT

    KeepChatGPT

    Browser userscript that enhances ChatGPT reliability and usability

    KeepChatGPT is an open source browser userscript designed to enhance the reliability, usability, and efficiency of the ChatGPT web interface. It runs through userscript managers and injects additional functionality directly into the page, allowing users to improve their workflow without requiring a backend service or separate application. It focuses on solving common problems experienced during AI conversations, such as session timeouts, network errors, message failures, and interruptions during long chats. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 21
    SwarmUI

    SwarmUI

    Modular AI image and video generation web UI with extensible tools

    SwarmUI is a modular web-based user interface designed for AI-driven image generation, with a strong focus on usability, performance, and extensibility. It serves as a unified environment for working with multiple AI models, including Stable Diffusion and newer image and video generation systems, allowing users to create and manage outputs through a browser interface.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 22
    StableSwarmUI

    StableSwarmUI

    Multi-user UI for managing and running Stable Diffusion workflows tool

    StableSwarmUI is a web-based interface designed to manage and coordinate Stable Diffusion image generation workflows in a multi-user environment. It focuses on enabling multiple users to interact with shared resources, making it suitable for collaborative or server-based deployments. It provides a centralized system where users can submit, monitor, and manage generation tasks through a browser interface.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 23
    DocsGPT

    DocsGPT

    Private AI platform for agents, enterprise search and RAG pipelines

    DocsGPT is an open-source AI platform for deploying private RAG pipelines, AI agents, and enterprise search on your own infrastructure. Connect any data source (PDFs, DOCX, CSV, Excel, HTML, audio, GitHub, databases, URLs) and get accurate, hallucination-free answers with source citations. Choose your LLM: OpenAI, Anthropic, Google Gemini, or local models. Works with Qdrant, MongoDB, and Elasticsearch and more. Deploy via Docker or Kubernetes with full data sovereignty. Build...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 24
    Midscene

    Midscene

    Vision-based AI framework for cross-platform UI automation tasks

    Midscene.js is an open source AI-driven UI automation framework designed to control user interfaces across multiple platforms using natural language instructions. Instead of relying on traditional selectors, DOM structures, or accessibility attributes, it uses a vision-first approach where screenshots are analyzed by visual-language models to identify interface elements and perform actions. It allows developers to automate interactions on web applications, desktop software, and mobile...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 25
    Search-Index

    Search-Index

    A persistent, network resilient, full text search library

    Search-Index is a lightweight and fast JavaScript-based search engine that enables full-text search indexing and retrieval for web applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • Next
MongoDB Logo MongoDB