Enable AI to control your desktop, mobile and HMI devices
A framework to enable multimodal models to operate a computer
A simple screen parsing tool towards pure vision based GUI agent
Automate native Android apps with AI using accessibility APIs
An open sourced end-to-end VLM-based GUI Agent
Python SDK for the Computer Use model Lux, developed by OpenAGI