Showing 134 open source projects for "ai browser"

View related business solutions
  • Host LLMs in Production With On-Demand GPUs Icon
    Host LLMs in Production With On-Demand GPUs

    NVIDIA L4 GPUs. 5-second cold starts. Scale to zero when idle.

    Deploy your model, get an endpoint, pay only for compute time. No GPU provisioning or infrastructure management required.
    Try Free
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • 1
    Browser Use

    Browser Use

    Make websites accessible for AI agents

    Browser-Use is a framework that makes websites accessible for AI agents, enabling automated interactions and data extraction from web pages.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 2
    Steel Browser

    Steel Browser

    Open Source Browser API for AI Agents & Apps

    Steel Browser is a privacy-focused web browser built with security and performance optimizations, designed to minimize tracking and enhance user control.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    agent-browser

    agent-browser

    Browser automation CLI for AI agents

    agent-browser is a toolkit that embeds AI agent capabilities directly into the web browser, enabling agents to interact with web content, scripts, and user actions while maintaining security boundaries that respect user privacy and browser constraints. It effectively provides a sandbox where AI agents can read, scroll, click, and interpret pages in context, allowing them to automate workflows, answer questions about page content, or generate structured summaries directly from the user’s current tab. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 4
    Just the Browser

    Just the Browser

    Remove AI features, telemetry data reporting, sponsored content

    Just the Browser is a configuration and automation tool that helps users strip away unwanted features from mainstream web browsers like Chrome, Edge, and Firefox, focusing on removing AI integrations, telemetry reporting, sponsored content, and other built-in annoyances so that the browser behaves more like a pure web client. Instead of modifying browser binaries, it applies supported group policies and configuration files that disable intrusive UI elements, data collection features, default pop-ups, and integrated services, giving users more control over privacy and interface simplicity. ...
    Downloads: 10 This Week
    Last Update:
    See Project
  • Go From Idea to Deployed AI App Fast Icon
    Go From Idea to Deployed AI App Fast

    One platform to build, fine-tune, and deploy. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 5
    AI Employe

    AI Employe

    Create browser automation as if you were teaching a human using GPT-4

    Try without Firebase authentication (temporary solution). Our stack consists of Next.js, Rust, Postgres, MeiliSearch, and Firebase Auth for authentication. Please sign up for a Firebase account and create a project. There are several techniques for this, ranging from sending a shortened form of HTML to GPT-3, creating a bounding box with IDs and sending it to GPT-4-vision to take actions, or directly asking GPT-4-vision to obtain the X and Y coordinates of the element. However, none of these...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    MCP Browser Kit

    MCP Browser Kit

    MCP Server for interacting with manifest v2 compatible browsers

    An MCP server that integrates with browser extensions to enable AI assistants to interact with the user's browser, allowing actions like starring repositories on GitHub through natural language commands. ​
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Dev Browser

    Dev Browser

    A Claude Skill to give your agent the ability to use a web browser

    Dev Browser is a browser automation skill/plugin that enables an AI agent to control a real browser for verification and testing during development. Its purpose is to close the gap between “code was written” and “the UI actually works,” by letting the agent navigate, interact with pages, and validate behavior in a live environment. A key idea is persistence: the browser can keep pages open so the agent can navigate once and then perform multiple interactions across scripts without losing state each time. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 8
    Browser MCP

    Browser MCP

    Browser MCP is a Model Context Provider (MCP) server

    Browser MCP is an MCP server plus browser integration that lets AI apps automate the user’s real browser instead of launching a separate automation session. By adapting a Playwright-style approach to control the running browser profile, it reuses logged-in sessions and cookies, which reduces re-authentication friction and helps avoid some bot-detection heuristics.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Vibium

    Vibium

    Browser automation for AI agents and humans

    Vibium is an open-source browser automation infrastructure built to serve both AI agents and human developers by simplifying control and interaction with real browsers. It integrates a single lightweight binary that manages browser lifecycle, implements a WebDriver BiDi proxy, and exposes a Model Context Protocol (MCP) server so language models or automation clients can control browser behavior without complex setup.
    Downloads: 1 This Week
    Last Update:
    See Project
  • $300 in Free Credit Across 150+ Cloud Services Icon
    $300 in Free Credit Across 150+ Cloud Services

    VMs, containers, AI, databases, storage | build anything. No commitment to start.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale with Google Cloud.
    Start Building Free
  • 10
    Browser Use MCP Server

    Browser Use MCP Server

    Browse the web, directly from Cursor etc.

    A browser automation server implementing the Model Context Protocol, designed to allow AI assistants to browse the web directly from applications like Cursor. It supports natural language commands for web navigation and interaction. ​
    Downloads: 2 This Week
    Last Update:
    See Project
  • 11
    Speech-AI-Forge

    Speech-AI-Forge

    Speech-AI-Forge is a project developed around TTS generation model

    Speech-AI-Forge is a full-stack project built around modern text-to-speech generation models, providing both an API server and a Gradio-based web UI for interactive use. At its core, it acts as a hub that wires together multiple speech-related capabilities, including TTS, speech-to-text and LLM-based control flows, behind a consistent interface. The system is designed to be deployed in several ways: you can try it online via hosted demos, spin it up in a one-click Colab environment, run it...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 12
    Notte

    Notte

    Opensource browser using agents

    Notte is an open-source browser framework that enables the development and deployment of web-based AI agents. It introduces a perception layer that transforms web pages into structured, navigable maps described in natural language, allowing agents to interact with the internet more effectively. Notte is designed for building scalable and efficient browser-based AI applications.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 13
    Nanobrowser

    Nanobrowser

    Open-Source Chrome extension for AI-powered web automation

    Nanobrowser is an open-source AI web automation tool that runs in your browser. A free alternative to OpenAI Operator with flexible LLM options and a multi-agent system. Nanobrowser, as a chrome extension, delivers premium web automation capabilities while keeping you in complete control. No subscription fees or hidden costs. Just install and use your own API keys, and you only pay what you use with your own API keys.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 14
    MCP Server RAG Web Browser

    MCP Server RAG Web Browser

    A MCP Server for the RAG Web Browser Actor

    The MCP Server for the RAG Web Browser Actor allows AI assistants and LLMs to perform web searches and extract information from web pages. It facilitates interaction with the web, enabling up-to-date context retrieval for AI applications. ​
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Generative AI JS

    Generative AI JS

    This SDK is now deprecated, use the new unified Google GenAI SDK

    deprecated-generative-ai-js is a JavaScript/TypeScript client and example suite for interacting with Gemini generative APIs in web and Node.js environments. Though marked deprecated (likely superseded by newer SDKs), the repo shows how to wrap HTTP/WS endpoints, manage streaming responses, and interoperate with browser UI or server logic. The examples include chat widgets, prompt pipelines, and generalized inference utilities.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    OpenClaw

    OpenClaw

    Your own personal AI assistant. Any OS. Any Platform.

    OpenClaw (formerly Clawdbot/Moltbot) is an open-source, self-hosted autonomous AI assistant designed to run on user-controlled hardware and bridge conversational natural language with real-world task execution, effectively acting as a proactive digital assistant rather than a reactive chatbot. It lets you send instructions through familiar messaging platforms like WhatsApp, Telegram, Discord, Slack, Signal, iMessage, and more, and then interprets those instructions to carry out actions such...
    Downloads: 904 This Week
    Last Update:
    See Project
  • 17
    Easy Diffusion

    Easy Diffusion

    An easy 1-click way to create beautiful artwork on your PC using AI

    Easy Diffusion is a widely used community-driven repository offering a simple, one-click way to install and use Stable Diffusion-based generative AI on a personal computer without advanced technical skills or prior setup. It provides a browser-based user interface that runs locally, allowing users to type text prompts and immediately generate images directly within their web browser, democratizing access to powerful text-to-image models for artists and hobbyists alike. The project abstracts away environment setup, dependencies, and model installation — tasks that can be daunting to beginners — and instead lets users focus on creative experimentation with prompt phrasing, model parameters, and image output settings. ...
    Downloads: 18 This Week
    Last Update:
    See Project
  • 18
    Codel

    Codel

    Fully autonomous AI Agent that can perform complicated tasks

    Fully autonomous AI Agent that can perform complicated tasks and projects using terminal, browser, and editor.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Suna

    Suna

    Suna - Open Source Generalist AI Agent

    Suna is an open-source generalist AI agent developed by Kortix AI. Designed to assist users in accomplishing real-world tasks through natural conversation, Suna combines powerful capabilities with an intuitive interface. It serves as a digital companion for research, data analysis, and everyday challenges, integrating tools like browser automation, file management, web crawling, command-line execution, website deployment, and API integration.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 20
    OpenManus

    OpenManus

    No fortress, purely open ground. OpenManus is Coming

    OpenManus is an open‑agent AI framework focused on building versatile general-purpose agents capable of autonomously executing complex workflows — such as planning, browsing, tool invocation — all via a pluggable prompts and tools interface. It's being extended with reinforcement learning‑based tuning modules and designed for researchers and developers building custom AI agents.
    Downloads: 14 This Week
    Last Update:
    See Project
  • 21

    OpsBot Browser

    Ops bot Browser for Secure private inbuilt vpn support browser, Ai Pow

    Ops bot Browser for Secure private inbuilt vpn support browser, Ai Powered Browser
    Downloads: 21 This Week
    Last Update:
    See Project
  • 22
    Happy Coder

    Happy Coder

    Mobile and Web client for Codex and Claude Code, with realtime voice

    Happy is an open-source, cross-platform mobile and web client designed to bring powerful AI coding agents such as Claude Code and Codex to your fingertips no matter where you are. At its core, Happy wraps existing AI coding tools with a unified interface, providing real-time voice interactions, encrypted communication, and seamless device switching between desktop and mobile. You can start a coding session locally through the Happy CLI or connect from a phone or browser, allowing developers to inspect, interact with, and guide the AI as it generates, tests, or explains code. ...
    Downloads: 23 This Week
    Last Update:
    See Project
  • 23
    Skyvern

    Skyvern

    Automate browser-based workflows with LLMs and Computer Vision

    Skyvern uses a combination of computer vision and AI to understand content on a webpage, making it adaptable to any website. Skyvern takes instructions in natural language, allowing it to execute complex objectives with simple commands. Skyvern is an API-first product. Workflows execute in the cloud, allowing it to run hundreds of workflows at the same time. Skyvern's AI decisions come with built-in explanations, providing clear summaries and justifications for every action. Support for...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 24
    Edit Banana

    Edit Banana

    Edit Banana: A framework for converting statistical figures

    Edit Banana is an innovative web application designed to simplify image editing by merging intuitive user interfaces with powerful generative AI capabilities, enabling users to quickly enhance, manipulate, or transform photos without needing advanced design skills. It provides a smooth, browser-based experience where users can upload images, make precise edits such as background removal or inpainting, and apply stylistic transformations or corrections through AI prompts. The tool focuses on accessibility, giving hobbyists, content creators, and small teams a way to produce polished visuals without downloading heavyweight software or managing local compute resources. ...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 25
    BrowserOS

    BrowserOS

    Agentic browser; privacy-first alternative to ChatGPT Atlas

    BrowserOS is an open-source, agentic web browser built on a Chromium base that integrates AI agents directly into the browsing experience. Rather than just doing standard browsing, it places AI intelligence at the core: you can connect your own API keys (for e.g., OpenAI, Anthropic, Google Gemini) or run local models (via e.g., Ollama) so that your browsing data and automation stay on your machine — privacy and control are emphasized throughout.
    Downloads: 25 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB