Enable AI to control your desktop, mobile and HMI devices
A framework to enable multimodal models to operate a computer
An open phone agent model & framework
Automate browser-based workflows with LLMs and Computer Vision
AI Multi-Agent Framework in .NET
A GUI Agent app based on UI-TARS to control your computer using AI
A simple screen parsing tool towards pure vision based GUI agent
Automate native Android apps with AI using accessibility APIs
Analytics for developers, setup Analytics in 30 seconds
An open sourced end-to-end VLM-based GUI Agent
Python SDK for the Computer Use model Lux, developed by OpenAGI
People Localization and Tracking for HomE Automation