A framework to enable multimodal models to operate a computer
3D reconstruction software
Agent S: an open agentic framework that uses computers like a human
The repository provides code for running inference with SAM 2
Python SDK for the Computer Use model Lux, developed by OpenAGI
A natural language interface for computers
Training data (data labeling, annotation, workflow) for all data types
Agent Zero AI framework
Fast image augmentation library and an easy-to-use wrapper
The open-source tool for building high-quality datasets
Hub of ready-to-use datasets for ML models
E2B Desktop Sandbox for LLMs. E2B Sandbox
Open source framework for deep learning satellite and aerial imagery
Deep learning library
An Efficient Agentic Model for Computer Use
The most reliable AI agent framework that supports MCP
Ultralytics YOLO
Open-source infrastructure for Computer-Use Agents. Sandboxes
Dough is a open source tool for steering AI animations with precision
CogView4, CogView3-Plus and CogView3(ECCV 2024)
Create UIs for your machine learning model in Python in 3 minutes
Generating Immersive, Explorable, and Interactive 3D Worlds
SWE-agent takes a GitHub issue and tries to automatically fix it
Enable AI to control your desktop, mobile and HMI devices
Tooling for the Common Objects In 3D dataset