Showing 233 open source projects for "web browser windows"

View related business solutions
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    Let your crypto work for you

    Put idle assets to work with competitive interest rates, borrow without selling, and trade with precision. All in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 1
    WebLLM

    WebLLM

    Bringing large-language models and chat to web browsers

    WebLLM is a modular, customizable javascript package that directly brings language model chats directly onto web browsers with hardware acceleration. Everything runs inside the browser with no server support and is accelerated with WebGPU. We can bring a lot of fun opportunities to build AI assistants for everyone and enable privacy while enjoying GPU acceleration. WebLLM offers a minimalist and modular interface to access the chatbot in the browser.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    agent-browser

    agent-browser

    Browser automation CLI for AI agents

    agent-browser is a toolkit that embeds AI agent capabilities directly into the web browser, enabling agents to interact with web content, scripts, and user actions while maintaining security boundaries that respect user privacy and browser constraints. It effectively provides a sandbox where AI agents can read, scroll, click, and interpret pages in context, allowing them to automate workflows, answer questions about page content, or generate structured summaries directly from the user’s current tab. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 3
    Browser MCP

    Browser MCP

    Browser MCP is a Model Context Provider (MCP) server

    Browser MCP is an MCP server plus browser integration that lets AI apps automate the user’s real browser instead of launching a separate automation session. By adapting a Playwright-style approach to control the running browser profile, it reuses logged-in sessions and cookies, which reduces re-authentication friction and helps avoid some bot-detection heuristics.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    MCP Browser Kit

    MCP Browser Kit

    MCP Server for interacting with manifest v2 compatible browsers

    An MCP server that integrates with browser extensions to enable AI assistants to interact with the user's browser, allowing actions like starring repositories on GitHub through natural language commands. ​
    Downloads: 2 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.
    Start Free
  • 5
    Gemma 4 Browser Assistant

    Gemma 4 Browser Assistant

    On-device AI agent Chrome extension powered by Transformers.js

    ...It can access and analyze page content, browsing history, and tab state to provide contextual assistance. The architecture follows modern browser extension standards, with separate components for background processing, content scripts, and UI rendering. It also supports tool-calling capabilities, allowing the AI to perform actions such as navigating tabs or highlighting elements. Overall, it demonstrates how to build fully local, agent-based assistants inside web browsers.
    Downloads: 43 This Week
    Last Update:
    See Project
  • 6
    LaVague

    LaVague

    Framework for building AI agents that automate complex web tasks

    LaVague is an open source framework designed to help developers build AI-powered web agents capable of automating tasks across websites and web applications. It implements the concept of a Large Action Model framework, allowing agents to interpret a user-provided objective and translate it into a sequence of actions performed in a browser. These agents can navigate web pages, retrieve information, fill out forms, and execute multi-step workflows automatically. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 7
    Playwright MCP

    Playwright MCP

    Playwright MCP server

    An MCP server developed by Microsoft that offers browser automation capabilities using Playwright, enabling LLMs to interact with web pages through structured accessibility snapshots without relying on visual data. ​
    Downloads: 6 This Week
    Last Update:
    See Project
  • 8
    RAG Web UI

    RAG Web UI

    RAG Web UI is an intelligent dialogue system based on RAG

    RAG Web UI is an open-source intelligent dialogue system built on retrieval-augmented generation technology, designed to enable users to create AI-powered question answering systems grounded in their own knowledge bases. It combines document retrieval with large language models to provide accurate, context-aware responses based on indexed data rather than generic model knowledge. The platform supports ingestion of multiple document formats, including PDFs, Word files, Markdown, and plain...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9
    Nanobrowser

    Nanobrowser

    Open-Source Chrome extension for AI-powered web automation

    Nanobrowser is an open-source AI web automation tool that runs in your browser. A free alternative to OpenAI Operator with flexible LLM options and a multi-agent system. Nanobrowser, as a chrome extension, delivers premium web automation capabilities while keeping you in complete control. No subscription fees or hidden costs. Just install and use your own API keys, and you only pay what you use with your own API keys.
    Downloads: 4 This Week
    Last Update:
    See Project
  • Add Two Lines of Code. Get Full APM. Icon
    Add Two Lines of Code. Get Full APM.

    AppSignal installs in minutes and auto-configures dashboards, alerts, and error tracking.

    Works out of the box for Rails, Django, Express, Phoenix, and more. Monitoring exceptions and performance in no time.
    Start Free
  • 10
    HyperAgent

    HyperAgent

    AI Browser Automation

    HyperAgent is an open-source browser automation framework that combines large language models with modern browser scripting tools to create intelligent web automation agents. Built on top of Playwright, the framework allows developers to automate complex browser interactions using natural language commands rather than fragile selectors or hard-coded scripts. Instead of manually writing logic for clicking elements, extracting data, or navigating web pages, developers can instruct the agent in plain language and allow the AI layer to interpret and execute the task. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 11
    BrowserNode

    BrowserNode

    Make websites accessible for AI agents. Automate tasks online

    Browsernode is an open-source TypeScript framework that allows AI agents to interact directly with web browsers in order to automate tasks and gather information from websites. The project acts as a bridge between AI models and browser automation tools, enabling language models to control web pages programmatically. Built as an implementation compatible with the Browser-use ecosystem, Browsernode allows agents to perform actions such as navigating pages, extracting information, filling forms, or interacting with dynamic web interfaces. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Magnitude

    Magnitude

    Vision AI browser agent for automation, testing, and extraction

    Browser Agent by Magnitude is an open source, vision-first browser automation framework that enables users to control web interfaces using natural language instructions. It leverages visually grounded AI models to interpret and interact with web pages based on what is seen on the screen rather than relying solely on the DOM structure. This approach allows the agent to generalize better across complex and modern websites, making it more robust than traditional selector-based automation tools. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    Playwriter

    Playwriter

    Chrome extension to let agents control your browser

    ...Playwriter’s architecture supports both extension-based control for real browser windows and CLI integration, giving developers flexibility in how they build and run browser automation workflows.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    BrowserAI

    BrowserAI

    Run local LLMs like llama, deepseek, kokoro etc. inside your browser

    BrowserAI is a cutting-edge platform that allows users to run large language models (LLMs) directly in their web browser without the need for a server. It leverages WebGPU for accelerated performance and supports offline functionality, making it a highly efficient and privacy-conscious solution. The platform provides a developer-friendly SDK with pre-configured popular models, and it allows for seamless switching between MLC and Transformer engines.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 15
    Secret Llama

    Secret Llama

    Fully private LLM chatbot that runs entirely with a browser

    Secret Llama is a privacy-first large-language-model chatbot that runs entirely inside your web browser, meaning no server is required and your conversation data never leaves your device. It focuses on open-source model support, letting you load families like Llama and Mistral directly in the client for fully local inference. Because everything happens in-browser, it can work offline once models are cached, which is helpful for air-gapped environments or travel. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 16
    Happy Coder

    Happy Coder

    Mobile and Web client for Codex and Claude Code, with realtime voice

    Happy is an open-source, cross-platform mobile and web client designed to bring powerful AI coding agents such as Claude Code and Codex to your fingertips no matter where you are. At its core, Happy wraps existing AI coding tools with a unified interface, providing real-time voice interactions, encrypted communication, and seamless device switching between desktop and mobile. You can start a coding session locally through the Happy CLI or connect from a phone or browser, allowing developers to inspect, interact with, and guide the AI as it generates, tests, or explains code. ...
    Downloads: 36 This Week
    Last Update:
    See Project
  • 17
    Browserbase MCP Server

    Browserbase MCP Server

    Allow LLMs to control a browser with Browserbase and Stagehand

    Browserbase MCP Server is a server implementation of the Model Context Protocol (MCP) that enables large language models to interact with web browsers programmatically through cloud-based automation. The project provides a standardized interface for connecting AI systems to real-world web environments, allowing them to navigate pages, extract structured data, and perform user-like actions such as clicking, typing, and form submission. It leverages Browserbase infrastructure along with Stagehand to deliver high-performance browser automation with improved speed and efficiency through caching and optimized execution pipelines. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    Bolt.new

    Bolt.new

    Prompt, run, edit, and deploy full-stack web applications

    Bolt.new is an AI-powered full-stack development platform created by StackBlitz that enables users to build, run, edit, and deploy complete web applications directly from the browser without requiring any local setup or traditional development environment. It operates as an intelligent coding agent where users describe what they want to build in natural language, and the system generates functional applications, including frontend, backend, and infrastructure components. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19
    Hollama

    Hollama

    A minimal LLM chat app that runs entirely in your browser

    Hollama is a lightweight open-source chat application designed to run entirely within the browser while interacting with large language model servers. The project provides a minimal but powerful user interface for communicating with local or remote LLMs, including servers powered by Ollama or OpenAI-compatible APIs. Because the application runs as a static web interface, it does not require complex backend infrastructure and can be easily deployed or self-hosted.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    PowerPoint-ist

    PowerPoint-ist

    Web presentation editor replicating many PowerPoint features online

    PPTist is a web-based presentation editing application designed to replicate many of the commonly used features found in traditional slide presentation software. It allows users to create, edit, and present slide decks directly within a web browser while maintaining a desktop-like editing experience. PPTist is built with Vue 3 and TypeScript and focuses on providing a highly interactive slide editing environment with extensive customization and extension potential. ...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 21
    Deep Research Web UI

    Deep Research Web UI

    AI-powered research assistant that performs iterative, deep research

    Deep Research Web UI is an AI-powered research assistant interface designed to automate complex, multi-step information gathering workflows through a combination of search engines, web scraping, and large language models. It operates as a front-end system for deep research agents that iteratively refine queries, retrieve information from multiple sources, and synthesize structured outputs into coherent reports. The platform emphasizes long-horizon reasoning, allowing users to explore topics...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    NextJS Ollama LLM UI

    NextJS Ollama LLM UI

    Fully-featured web interface for Ollama LLMs

    NextJS Ollama LLM UI is a web-based frontend interface built with Next.js to make interacting with Ollama-hosted large language models easy and fast. Its goal is to remove the complexity of setting up and managing UI components for local or offline LLM usage by providing a straightforward chat experience with support for responsive layouts, light and dark themes, and local chat history storage in the browser.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Live API Web Console

    Live API Web Console

    A react-based starter app for using the Live API over websockets

    Live API Web Console is a React starter that demonstrates how to use Gemini’s Live API over WebSockets to build real-time, multimodal experiences. The app includes modules for streaming audio playback, recording user media from the microphone, webcam, or even screen capture, and it surfaces a unified event log so you can debug the session as it flows. Configuration lives in a simple .env file and the project boots with standard web tooling, letting you experiment quickly with models, system...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    OpenAI.fm

    OpenAI.fm

    Code for openai.fm, a demo for the OpenAI Speech API

    ...Users can experiment with different input text and voice options directly in their browser, gaining a sense of how high-fidelity AI audio can be integrated into applications ranging from podcasts and narration to accessibility tools and interactive agents. Although the web demo is free to explore, production use of the underlying API requires an OpenAI API key and may incur costs based on usage.
    Downloads: 22 This Week
    Last Update:
    See Project
  • 25
    Suno AI API

    Suno AI API

    Use API to call the music generation AI of suno.ai

    ...One of its distinguishing features is its ability to bypass common web limitations by integrating CAPTCHA-solving services and browser automation, which allows continuous access to the platform.
    Downloads: 6 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB