Control Any Computer Using LLMs
A natural language interface for computers
A GUI Agent app based on UI-TARS to control your computer using AI
Structure-from-Motion and Multi-View Stereo
Java interface to OpenCV, FFmpeg, and more
LLM Frontend for Power Users
3D Computer Vision Framework
Effortless data labeling with AI support from Segment Anything
AI tool for automating desktop tasks via natural language input
Agent S: an open agentic framework that uses computers like a human
Clippy, now with some AI
Create UIs for your machine learning model in Python in 3 minutes
The Cradle framework is a first attempt at General Computer Control
An easy 1-click way to create beautiful artwork on your PC using AI
Agent Zero AI framework
Fast image augmentation library and an easy-to-use wrapper
The repository provides code for running inference with SAM 2
Self-contained, offline survival computer with tools, knowledge, & AI
SWE-agent takes a GitHub issue and tries to automatically fix it
Fully Managed OpenClaw Framework for all knowledge work ever
A neural network that transforms a design mock-up into static websites
Open source demo platform where you can easily showcase your AI models
Open-source infrastructure for Computer-Use Agents. Sandboxes
Open source no-code system for text annotation and building of text
An open source, extensible AI agent that goes beyond code suggestions