A framework to enable multimodal models to operate a computer
Enable AI to control your desktop, mobile and HMI devices
Automate native Android apps with AI using accessibility APIs
Real-World Centric Foundation GUI Agents
Control Any Computer Using LLMs