decision making free download

LLM Colosseum

Benchmark LLMs by fighting in Street Fighter 3

...The system places language models inside the environment of the classic video game Street Fighter III, where they must interpret the game state and decide which actions to perform during combat. This setup creates a dynamic environment that tests reasoning, situational awareness, and decision-making abilities in real time. Instead of relying purely on reward signals as in reinforcement learning agents, the models analyze contextual information and generate strategic actions based on the game environment. Performance is evaluated using a competitive ranking system that assigns models an ELO rating based on their results across matches against other models.

Downloads: 2 This Week

Last Update: 2026-03-07

See Project

AgentBench

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

...Unlike traditional language model benchmarks that focus on static text tasks, AgentBench measures how models perform in interactive environments that require planning, reasoning, and decision-making. The benchmark includes multiple environments that simulate realistic scenarios such as web interaction, database querying, and problem solving tasks. These environments require agents to interpret instructions, take actions, and adapt their strategies based on feedback from the environment. AgentBench also includes an evaluation framework that measures success rates, rewards, and task completion performance across different agent implementations. ...

Downloads: 0 This Week

Last Update: 2026-03-05

See Project

GLM-4.6

Agentic, Reasoning, and Coding (ARC) foundation models

GLM-4.6 is the latest iteration of Zhipu AI’s foundation model, delivering significant advancements over GLM-4.5. It introduces an extended 200K token context window, enabling more sophisticated long-context reasoning and agentic workflows. The model achieves superior coding performance, excelling in benchmarks and practical coding assistants such as Claude Code, Cline, Roo Code, and Kilo Code. Its reasoning capabilities have been strengthened, including improved tool usage during inference...

Downloads: 86 This Week

Last Update: 2026-02-01

See Project

Dynamiq

An orchestration framework for agentic AI and LLM applications

...The framework supports the creation of multi-agent systems where different AI agents collaborate to solve tasks such as information retrieval, document analysis, or automated decision making. Dynamiq also includes built-in support for retrieval-augmented generation pipelines that allow models to access external documents and knowledge bases during inference.

Downloads: 6 This Week

Last Update: 3 days ago

See Project

DriveLM

Driving with Graph Visual Question Answering

...Instead of treating autonomous driving as a purely sensor-driven pipeline, DriveLM frames it as a reasoning problem where models answer structured questions about the environment to guide decision making. The system includes DriveLM-Data, a dataset built on driving environments such as nuScenes and CARLA, where human-written reasoning steps connect different layers of driving tasks. This design allows models to learn relationships between objects, behaviors, and navigation decisions through graph-structured logic.

Downloads: 0 This Week

Last Update: 2026-03-09

See Project

Cradle framework

The Cradle framework is a first attempt at General Computer Control

Cradle is an open-source framework designed to enable AI agents to perform complex computer tasks by interacting with software environments in a way similar to human users. The system introduces the concept of General Computer Control, where AI agents receive screenshots as input and perform actions through simulated keyboard and mouse operations. This approach allows agents to interact with any software interface without relying on specialized APIs or predefined automation scripts. The...

Downloads: 0 This Week

Last Update: 2026-03-06

See Project

LLM Workflow Engine

Power CLI and Workflow manager for LLMs (core package)

...The platform allows users to interact with AI models directly from the terminal, enabling conversational AI access through shell commands and scripts. Instead of focusing solely on chat interactions, the system is built to embed LLM calls into larger automation pipelines where model outputs can drive decision making or trigger additional processes. Developers can construct structured workflows using configuration files and integrate them with tools such as Ansible playbooks or custom scripts to automate complex tasks. The engine supports multiple AI providers through a plugin architecture, allowing connections to services like OpenAI, Hugging Face, Cohere, or other compatible APIs.

Downloads: 0 This Week

Last Update: 2026-03-06

See Project

Functionary

Chat language model that can use tools and interpret the results

Functionary is an open-source large language model specifically designed for interpreting and executing structured functions or external tools within conversational AI systems. The model extends traditional chat-based language models by enabling them to determine when external functions should be called and how to extract the necessary parameters from natural language input. Function definitions are typically provided in JSON schema format, allowing the model to generate structured function...

Downloads: 0 This Week

Last Update: 2026-03-07

See Project

ReAct Prompting

Synergizing Reasoning and Acting in Language Models

...Instead of generating answers in a single step, models using the ReAct approach produce intermediate reasoning steps and perform actions such as searching for information or interacting with external tools. This alternating sequence of reasoning, acting, and observing results allows the model to gather additional information and refine its decision-making process during task execution. The framework has been tested on several benchmarks including question answering, fact verification, and interactive decision-making tasks, demonstrating improved performance compared to methods that rely only on reasoning.

Downloads: 0 This Week

Last Update: 2026-03-05

See Project

Repo of Tree of Thoughts (ToT)

Implementation of "Tree of Thoughts

...ToT allows LMs to perform deliberate decision-making by considering multiple different reasoning paths and self-evaluating choices to decide the next course of action, as well as looking ahead or backtracking when necessary to make global choices.

Downloads: 0 This Week

Last Update: 2023-08-21

See Project

Search Results for "decision making"

Showing 10 open source projects for "decision making"

LLM Colosseum

AgentBench

GLM-4.6

Dynamiq

DriveLM

Cradle framework

LLM Workflow Engine

Functionary

ReAct Prompting

Repo of Tree of Thoughts (ToT)

Search Results for "decision making"

Showing 10 open source projects for "decision making"

LLM Colosseum

AgentBench

GLM-4.6

Dynamiq

DriveLM

Cradle framework

LLM Workflow Engine

Functionary

ReAct Prompting

Repo of Tree of Thoughts (ToT)

Related Searches

Related Categories