GUI Exploration Lab. One of the best GUI agent solutions
A simple screen parsing tool towards pure vision based GUI agent
AutoAgent: Fully-Automated and Zero-Code LLM Agent Framework
CRAB: Cross-environment Agent Benchmark for Multimodal Language Model
AI agents autonomously run and improve ML experiments overnight
Superfast AI decision making and processing of multi-modal data
BISHENG is an open LLM devops platform for next generation apps
Enable AI to control your desktop, mobile and HMI devices
An Open-Source AI Agent Platform for Financial Analysis using LLMs
14-stage Fusion Pipeline for LLM token compression
Pokee Deep Research Model Open Source Repo
One API call, pull Claude agent, completely sandboxed
An end-to-end Data Scientist
Claude Code skill that researches any topic across Reddit + X
Natural language workflows for AI agents
Automate native Android apps with AI using accessibility APIs
The Memory layer for AI Agents
An open sourced end-to-end VLM-based GUI Agent
The common language for platforms, agents and businesses.
The official Python SDK for UCP
Get started w/ building Fullstack Agents using Gemini 2.5 & LangGraph
A personal context-agent that learns how you work
Project-scoped Lean workflow orchestrator from Math, Inc.
Designed for training LLM/VLM agents via RL
Outcome driven agent development framework that evolves