A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
An AI Hedge Fund Team
Agentic, Reasoning, and Coding (ARC) foundation models
A free/open source client and automation tool for Ragnarok Online
Reflexion: Language Agents with Verbal Reinforcement Learning
AI agents running research on single-GPU nanochat training
An open source digital image forensic toolset
Making Enterprise Data Intelligent and Responsive for AI
Mental models, decision heuristics, expressing DNA
A code-first agent framework for seamlessly planning analytics tasks
A specialized Claude Code workspace for creating long-form
An orchestration framework for agentic AI and LLM applications
Python framework for building scalable multi-agent systems
Enable AI to control your desktop, mobile and HMI devices
150+ quantitative finance Python programs
A Foundation Model for Generalist Gaming Agents
An agent is just a for-loop
Volcano Engine Reinforcement Learning for LLMs
Open multimodal web agent built by Ai2
Workshop-Level Automated Scientific Discovery via Agentic Tree Search
Latent Collaboration in Multi-Agent Systems
UI-TARS-desktop version that can operate on your local personal device
Agent S: an open agentic framework that uses computers like a human
A Personalized LLM-powered Agent Frameworks
Machine learning on FPGAs using HLS