Page Agent is an open-source in-page AI agent framework that allows developers to control and interact with web interfaces using natural language directly within the browser. Unlike traditional browser automation tools, it operates entirely through in-page JavaScript, eliminating the need for browser extensions, headless browsers, or external automation environments. The system enables users to manipulate the DOM through text-based commands, allowing complex workflows such as form filling, navigation, and UI interaction to be executed through simple natural language instructions. Page Agent is designed to integrate seamlessly into existing web applications, making it possible to embed AI copilots into SaaS platforms without major backend changes. It supports a bring-your-own-LLM approach, allowing developers to connect their preferred language models to power the agent’s reasoning capabilities.

Features

  • In-page JavaScript agent with no need for headless browsers or extensions
  • Natural language control for interacting with web interfaces and DOM elements
  • Bring your own LLM support for customizable AI reasoning
  • Text-based DOM manipulation without reliance on screenshots or vision models
  • Human-in-the-loop interface for monitoring and guiding actions
  • Optional multi-page automation support via browser extension

Project Samples

Project Activity

See All Activity >

License

MIT License

Follow Page Agent

Page Agent Web Site

Other Useful Business Software
Fully Managed MySQL, PostgreSQL, and SQL Server Icon
Fully Managed MySQL, PostgreSQL, and SQL Server

Automatic backups, patching, replication, and failover. Focus on your app, not your database.

Cloud SQL handles your database ops end to end, so you can focus on your app.
Try Free
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of Page Agent!

Additional Project Details

Programming Language

TypeScript

Related Categories

TypeScript User Interface (UI) Software

Registered

2026-03-17