Midscene.js is an open source AI-driven UI automation framework designed to control user interfaces across multiple platforms using natural language instructions. Instead of relying on traditional selectors, DOM structures, or accessibility attributes, it uses a vision-first approach where screenshots are analyzed by visual-language models to identify interface elements and perform actions. It allows developers to automate interactions on web applications, desktop software, and mobile devices without needing platform-specific automation logic. Developers can describe tasks such as clicking buttons, filling forms, or extracting information, and the system interprets these commands to interact with the interface accordingly. Midscene.js includes SDKs, scripting options, and integration capabilities that allow automation workflows to be written in JavaScript, TypeScript, or YAML-based scripts. Midscene also provides debugging and development tools.

Features

  • Vision-based UI element detection using screenshots instead of DOM selectors
  • Natural language automation that interprets user instructions into UI actions
  • Cross-platform support for web browsers, desktop applications, and mobile devices
  • JavaScript and YAML scripting support for building automation workflows
  • Built-in debugging tools including visual replay reports and playground tools
  • Integration with automation tools and frameworks for browser and device control

Project Samples

Project Activity

See All Activity >

License

MIT License

Follow Midscene

Midscene Web Site

Other Useful Business Software
Auth0 B2B Essentials: SSO, MFA, and RBAC Built In Icon
Auth0 B2B Essentials: SSO, MFA, and RBAC Built In

Unlimited organizations, 3 enterprise SSO connections, role-based access control, and pro MFA included. Dev and prod tenants out of the box.

Auth0's B2B Essentials plan gives you everything you need to ship secure multi-tenant apps. Unlimited orgs, enterprise SSO, RBAC, audit log streaming, and higher auth and API limits included. Add on M2M tokens, enterprise MFA, or additional SSO connections as you scale.
Sign Up Free
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of Midscene!

Additional Project Details

Programming Language

JavaScript, TypeScript

Related Categories

JavaScript Artificial Intelligence Software, TypeScript Artificial Intelligence Software

Registered

2026-03-12