Enable AI to control your desktop, mobile and HMI devices
A framework to enable multimodal models to operate a computer
An open phone agent model & framework
Automate browser-based workflows with LLMs and Computer Vision
A simple screen parsing tool towards pure vision based GUI agent
Automate native Android apps with AI using accessibility APIs
Python SDK for the Computer Use model Lux, developed by OpenAGI
An open sourced end-to-end VLM-based GUI Agent