Showing 48 open source projects for "web browsers"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Train ML Models With SQL You Already Know Icon
    Train ML Models With SQL You Already Know

    BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

    Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
    Try Free
  • 1
    WebLLM

    WebLLM

    Bringing large-language models and chat to web browsers

    WebLLM is a modular, customizable javascript package that directly brings language model chats directly onto web browsers with hardware acceleration. Everything runs inside the browser with no server support and is accelerated with WebGPU. We can bring a lot of fun opportunities to build AI assistants for everyone and enable privacy while enjoying GPU acceleration. WebLLM offers a minimalist and modular interface to access the chatbot in the browser.
    Downloads: 11 This Week
    Last Update:
    See Project
  • 2
    Lightpanda Browser

    Lightpanda Browser

    Lightpanda: the headless browser designed for AI and automation

    Lightpanda is an open-source headless browser designed specifically for automation, artificial intelligence workflows, and large-scale web interaction tasks. Unlike traditional browsers that include full graphical rendering engines meant for human users, Lightpanda is built from scratch to operate entirely in headless mode, focusing only on the components required for programmatic web interaction. This design allows it to execute JavaScript and interact with web pages while avoiding the overhead associated with rendering images, fonts, and layout elements intended for visual display. ...
    Downloads: 27 This Week
    Last Update:
    See Project
  • 3
    Gemma 4 Browser Assistant

    Gemma 4 Browser Assistant

    On-device AI agent Chrome extension powered by Transformers.js

    ...It also supports tool-calling capabilities, allowing the AI to perform actions such as navigating tabs or highlighting elements. Overall, it demonstrates how to build fully local, agent-based assistants inside web browsers.
    Downloads: 63 This Week
    Last Update:
    See Project
  • 4
    MCP Browser Kit

    MCP Browser Kit

    MCP Server for interacting with manifest v2 compatible browsers

    An MCP server that integrates with browser extensions to enable AI assistants to interact with the user's browser, allowing actions like starring repositories on GitHub through natural language commands. ​
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8 Monitoring Tools in One APM. Install in 5 Minutes. Icon
    8 Monitoring Tools in One APM. Install in 5 Minutes.

    Errors, performance, logs, uptime, hosts, anomalies, dashboards, and check-ins. One interface.

    AppSignal works out of the box for Ruby, Elixir, Node.js, Python, and more. 30-day free trial, no credit card required.
    Start Free
  • 5
    BrowserNode

    BrowserNode

    Make websites accessible for AI agents. Automate tasks online

    Browsernode is an open-source TypeScript framework that allows AI agents to interact directly with web browsers in order to automate tasks and gather information from websites. The project acts as a bridge between AI models and browser automation tools, enabling language models to control web pages programmatically. Built as an implementation compatible with the Browser-use ecosystem, Browsernode allows agents to perform actions such as navigating pages, extracting information, filling forms, or interacting with dynamic web interfaces. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    MLC LLM

    MLC LLM

    Universal LLM Deployment Engine with ML Compilation

    MLC LLM is a machine learning compiler and deployment framework designed to enable efficient execution of large language models across a wide range of hardware platforms. The project focuses on compiling models into optimized runtimes that can run natively on devices such as GPUs, mobile processors, browsers, and edge hardware. By leveraging machine learning compilation techniques, mlc-llm produces high-performance inference engines that maintain consistent APIs across platforms. The system supports deployment on environments including Linux, macOS, Windows, iOS, Android, and web browsers while utilizing different acceleration technologies such as CUDA, Vulkan, Metal, and WebGPU. ...
    Downloads: 31 This Week
    Last Update:
    See Project
  • 7
    chrome-cdp

    chrome-cdp

    Give your AI agent access to your live Chrome session

    chrome-cdp-skill is a specialized integration that enables AI agents to control and interact with web browsers through the Chrome DevTools Protocol (CDP). It allows agents to perform tasks such as navigating pages, extracting data, interacting with elements, and executing scripts in a browser environment. The project is designed to extend the capabilities of AI systems beyond static knowledge by giving them real-time access to web content and interactive interfaces. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 8
    wllama

    wllama

    WebAssembly binding for llama.cpp - Enabling on-browser LLM inference

    wllama is a WebAssembly-based library that enables large language model inference directly inside a web browser. Built as a binding for the llama.cpp inference engine, the project allows developers to run LLM models locally without requiring a server backend or dedicated GPU hardware. The library leverages WebAssembly SIMD capabilities to achieve efficient execution within modern browsers while maintaining compatibility across platforms.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 9
    Pinchtab

    Pinchtab

    High-performance browser automation bridge and orchestrator

    Pinchtab is a lightweight browser automation backend built specifically for AI agents that need efficient, programmatic web control. Implemented as a small standalone HTTP server, it allows any agent or script to interact with web pages using simple API calls instead of heavyweight browser frameworks. The tool emphasizes accessibility-first snapshots that dramatically reduce token usage compared to screenshot-based approaches, making it cost-effective for large-scale automation. It launches...
    Downloads: 3 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 10
    Open ChatGPT Atlas

    Open ChatGPT Atlas

    Open Source and Free Alternative to ChatGPT Atlas

    Open ChatGPT Atlas is an open-source toolkit and interface for working with OpenAI’s ChatGPT models in a more extensible, adaptable, and composable way than standard web UIs allow. It provides an architecture where developers and power users can manage state, tool integrations, and multi-turn workflows with more control, enabling custom UIs, automation layers, and advanced routing logic. Unlike a fixed chat app, Atlas is designed as a foundation that can be extended with plugins, external...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 11
    Just the Browser

    Just the Browser

    Remove AI features, telemetry data reporting, sponsored content

    Just the Browser is a configuration and automation tool that helps users strip away unwanted features from mainstream web browsers like Chrome, Edge, and Firefox, focusing on removing AI integrations, telemetry reporting, sponsored content, and other built-in annoyances so that the browser behaves more like a pure web client. Instead of modifying browser binaries, it applies supported group policies and configuration files that disable intrusive UI elements, data collection features, default pop-ups, and integrated services, giving users more control over privacy and interface simplicity. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    BrowserOS

    BrowserOS

    Agentic browser; privacy-first alternative to ChatGPT Atlas

    BrowserOS is an open-source, agentic web browser built on a Chromium base that integrates AI agents directly into the browsing experience. Rather than just doing standard browsing, it places AI intelligence at the core: you can connect your own API keys (for e.g., OpenAI, Anthropic, Google Gemini) or run local models (via e.g., Ollama) so that your browsing data and automation stay on your machine — privacy and control are emphasized throughout. The interface remains familiar to users of...
    Downloads: 20 This Week
    Last Update:
    See Project
  • 13
    Emscripten

    Emscripten

    Emscripten: An LLVM-to-WebAssembly Compiler

    Emscripten is a complete open-source compiler toolchain that transforms C, C++, and other LLVM-based source code into WebAssembly (and JavaScript), enabling native‑like applications to run in web browsers, Node.js, and other Wasm environments. While Emscripten mostly focuses on compiling C and C++ using Clang, it can be integrated with other LLVM-using compilers (for example, Rust has Emscripten integration, with the wasm32-unknown-emscripten and asmjs-unknown-emscripten targets). Emscripten provides Web support for popular portable APIs such as OpenGL and SDL2, allowing complex graphical native applications to be ported, such as the Unity game engine and Google Earth. ...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 14
    html-ppt

    html-ppt

    AgentSkill with 24 themes, 31 layouts, 20+ animations

    html-ppt-skill is an AI-oriented skill designed to generate presentation slides using HTML as the underlying structure instead of traditional PowerPoint formats. It enables users to create visually structured, web-based slide decks that can be rendered in browsers or converted into presentation formats. The system focuses on translating structured ideas into clean layouts, combining content organization with lightweight styling. It integrates well with AI workflows, allowing automated generation of presentations from prompts or documents. The approach emphasizes flexibility, enabling customization through standard web technologies like HTML and CSS. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 15
    Browserbase MCP Server

    Browserbase MCP Server

    Allow LLMs to control a browser with Browserbase and Stagehand

    Browserbase MCP Server is a server implementation of the Model Context Protocol (MCP) that enables large language models to interact with web browsers programmatically through cloud-based automation. The project provides a standardized interface for connecting AI systems to real-world web environments, allowing them to navigate pages, extract structured data, and perform user-like actions such as clicking, typing, and form submission. It leverages Browserbase infrastructure along with Stagehand to deliver high-performance browser automation with improved speed and efficiency through caching and optimized execution pipelines. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 16
    xgplayer

    xgplayer

    A HTML5 video player with a parser that saves traffic

    xgplayer is a web-friendly, open-source media player library maintained by ByteDance, designed for playing audio/video streams in browsers or web applications with robust control, flexibility, and extensibility. It abstracts many of the lower-level complexities of HTML5 media, providing a consistent API for playback control, custom UI overlays, adaptive streaming, plugin hooks, and cross-browser compatibility.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    MolmoWeb

    MolmoWeb

    Open multimodal web agent built by Ai2

    MolmoWeb is an open-source multimodal web agent designed to autonomously navigate and interact with web browsers using vision-language models, representing a significant step toward fully agentic AI systems that can operate in real-world digital environments. The system takes natural language instructions and translates them into sequences of browser actions such as clicking, typing, scrolling, and navigating, effectively performing tasks on behalf of the user. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Midscene

    Midscene

    Vision-based AI framework for cross-platform UI automation tasks

    Midscene.js is an open source AI-driven UI automation framework designed to control user interfaces across multiple platforms using natural language instructions. Instead of relying on traditional selectors, DOM structures, or accessibility attributes, it uses a vision-first approach where screenshots are analyzed by visual-language models to identify interface elements and perform actions. It allows developers to automate interactions on web applications, desktop software, and mobile...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 19
    Atomic Chat

    Atomic Chat

    Atomic-Chat is an open source alternative to ChatGPT

    Atomic Chat is part of the AtomicBot ecosystem and represents a chat-based interface layer for interacting with autonomous AI agents powered by the OpenClaw framework, enabling users to communicate with and control an AI assistant through natural language conversations. The system is designed to go beyond traditional chatbots by allowing the assistant to take real actions across connected tools such as email, calendars, browsers, and file systems, effectively functioning as a task-executing agent rather than a passive responder. It integrates with multiple messaging platforms and communication channels, allowing users to interact with the assistant from environments like web interfaces, messaging apps, or desktop clients. The architecture supports multi-model AI usage, meaning users can choose between different large language models depending on their needs, including cloud-based or local deployments.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 20
    MediaPipe Solutions

    MediaPipe Solutions

    Cross-platform, customizable ML solutions

    ...The system provides developers with tools and reusable components that allow them to combine multiple machine learning models with preprocessing and postprocessing logic into efficient perception pipelines. These pipelines can run on a wide variety of platforms including mobile devices, desktop systems, web browsers, and embedded edge devices. MediaPipe is widely used in computer vision and multimedia applications such as hand tracking, face detection, pose estimation, object recognition, and gesture analysis. The framework includes prebuilt solutions that developers can quickly integrate into applications as well as lower-level APIs that allow custom pipeline construction.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 21
    Composio

    Composio

    Composio equip's your AI agents & LLMs

    Empower your AI agents with Composio - a platform for managing and integrating tools with LLMs & AI agents using Function Calling. Equip your agent with high-quality tools & integrations without worrying about authentication, accuracy, and reliability in a single line of code.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 22
    Vibium

    Vibium

    Browser automation for AI agents and humans

    Vibium is an open-source browser automation infrastructure built to serve both AI agents and human developers by simplifying control and interaction with real browsers. It integrates a single lightweight binary that manages browser lifecycle, implements a WebDriver BiDi proxy, and exposes a Model Context Protocol (MCP) server so language models or automation clients can control browser behavior without complex setup. This design makes it ideal for AI agents that need to interact with the web, perform tasks, or simulate human interactions in a browser environment, and it also works well for traditional testing and automation workflows. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 23
    Groq TypeScript / Node.s

    Groq TypeScript / Node.s

    The official Node.js / Typescript library for the Groq API

    Groq TypeScript / Node.s (also often referred to as “groq-sdk” on npm) is the official Node.js / TypeScript client library for Groq’s REST API, enabling JavaScript/TypeScript developers to integrate LLM and AI-powered services into web backends, serverless functions, or frontend apps. It exports strongly-typed interfaces for models, chat completions, file uploads (e.g. for audio transcription), and other endpoints, allowing for better type safety and developer experience when using Groq from...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 24
    Gradio

    Gradio

    Create UIs for your machine learning model in Python in 3 minutes

    ...One of the best ways to share your machine learning model, API, or data science workflow with others is to create an interactive demo that allows your users or colleagues to try out the demo in their browsers.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 25
    Portia SDK Python

    Portia SDK Python

    Portia Labs Python SDK for building agentic workflows

    portia‑sdk‑python is an open-source Python SDK by Portia Labs for creating reliable, stateful, authenticated multi-agent AI workflows. It supports tool-backed agents capable of real-world interactions—like web browsing, API access, and human-in-the-loop clarifications—while maintaining transparency and auditability through structured plans and execution hooks. Designed for production environments, the SDK integrates with local or cloud LLMs (e.g. OpenAI, Anthropic, Mistral, Gemini) and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB