A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
Run Claude Code, Gemini, Codex in a clean, isolated sandbox
Spring AI Alibaba examples for building and testing AI apps
A security scanner for custom LLM applications
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
NestJS Helper + AI Chatbot Development
A minimal yet professional single agent demo project
CLI tool for multi-agent workflows and automated code generation
All-in-one AI companion! Desktop girlfriend + virtual streamer
Behavior tree AI for Godot Engine
An open-source, code-first Java toolkit
An open-source visual programming environment
The open source post-building layer for agents
Harmless liberation prompts
Visual AI IDE for building agents with prompt chains and graphs
Evaluate your LLM's response with Prometheus and GPT4
AWS-native chatbot using Bedrock
MCP server wrapper for OpenAI Codex CLI
Open source AI trading OS for autonomous multi-model trading systems
GitHub Agentic Workflows
Framework for building AI agents that automate complex web tasks
Ship AI Agents to Google Cloud in minutes, not months
Anthropic's educational courses
A tool to use the Ai2 Open Coding Agents Soft-Verified Agents
High-resolution models for human tasks