MAI-UI

MAI-UI is a cutting-edge open-source project that implements a family of foundation GUI (Graphical User Interface) agent models capable of interpreting natural language and performing real-world GUI navigation and control tasks across mobile and desktop environments. Developed by Tongyi-MAI (Alibaba’s research initiative), the MAI-UI models are multimodal agents trained to understand user instructions and corresponding screenshots, grounding those instructions to on-screen elements and generating sequences of GUI actions such as taps, swipes, text input, and system commands. Unlike traditional UI frameworks, MAI-UI emphasizes realistic deployment by supporting agent–user interaction (clarifying ambiguous instructions), integration with external tool APIs using MCP calls, and a device–cloud collaboration mechanism that dynamically routes computation to on-device or cloud models based on task state and privacy constraints.

Features

Natural language to GUI action generation for mobile/desktop interfaces
Multimodal grounding of text and screenshots for UI understanding
Support for direct user interaction and clarification workflows
MCP tool integration for extended API-level operations
Device–cloud hybrid execution to balance privacy and performance
Models at multiple scales (from lightweight to large-capacity variants)

Project Samples

Project Activity

See All Activity >

License

Apache License V2.0

Follow MAI-UI

MAI-UI Web Site

Other Useful Business Software

Auth0 for AI Agents now in GA

Ready to implement AI with confidence (without sacrificing security)?

Connect your AI agents to apps and data more securely, give users control over the actions AI agents can perform and the data they can access, and enable human confirmation for critical agent actions.

Start building today

Rate This Project

User Reviews

Be the first to post a review of MAI-UI!

Additional Project Details

Programming Language

Python

Related Categories

Python AI Agents

Registered

4 days ago

Similar Business Software

LM-Kit.NET

LM-Kit.NET is a cutting-edge, high-level inference SDK designed specifically to bring the advanced capabilities of Large Language Models (LLM) into the C# ecosystem. Tailored for developers working within .NET, LM-Kit.NET provides a comprehensive suite of powerful Generative AI tools, making...

See Software
Vertex AI

Build, deploy, and scale machine learning (ML) models faster, with fully managed ML tools for any use case. Through Vertex AI Workbench, Vertex AI is natively integrated with BigQuery, Dataproc, and Spark. You can use BigQuery ML to create and execute machine learning models in BigQuery...

See Software
StackAI

StackAI is an enterprise AI automation platform to build end-to-end internal tools and processes with AI agents in a fully compliant and secure way. Designed for large organizations, it enables teams to automate complex workflows across operations, compliance, finance, IT, and support without...

See Software
Sendbird

Sendbird is the omnichannel AI agent platform enterprises choose to elevate customer experience, by initiating autonomous support & sales conversations, keeping humans in the loop for complex inquiries, and re-engaging customers with proactive business messages. Combining omnichannel AI and...

See Software
Assembled

Assembled is the only platform that unifies AI agents and intelligent workforce management to power fast and flexible support operations. Built for scale, we help teams automate over 50% of customer interactions, forecast with 90%+ accuracy, and optimize staffing across in-house and BPO teams....

See Software
Zendesk

Zendesk is an AI-powered service solution that’s easy to set up, use, and scale. It works out-of-the-box and adapts quickly, enabling businesses to move faster. Built on billions of CX interactions, Zendesk AI supports the whole service journey—from self-service to agents to admins—helping teams...

See Software

Report inappropriate content

MAI-UI

Real-World Centric Foundation GUI Agents

Get an email when there's a new version of MAI-UI

Features

Project Samples

Project Activity

Categories

License

Follow MAI-UI

User Reviews

Additional Project Details

Programming Language

Related Categories

Registered