version free download

Stable Baselines3

PyTorch version of Stable Baselines

Stable Baselines3 (SB3) is a set of reliable implementations of reinforcement learning algorithms in PyTorch. It is the next major version of Stable Baselines. You can read a detailed presentation of Stable Baselines3 in the v1.0 blog post or our JMLR paper. These algorithms will make it easier for the research community and industry to replicate, refine, and identify new ideas, and will create good baselines to build projects on top of. We expect these tools will be used as a base around which new ideas can be added, and as a tool for comparing a new approach against existing ones. ...

Downloads: 2 This Week

Last Update: 2025-12-05

See Project

Agent S

Agent S: an open agentic framework that uses computers like a human

...Built to operate graphical user interfaces like a human, it allows AI agents to perceive screens, reason about tasks, and execute actions across macOS, Windows, and Linux systems. The latest version, Agent S3, surpasses human-level performance on the OSWorld benchmark, demonstrating state-of-the-art results in complex multi-step computer tasks. Agent S combines powerful foundation models (such as GPT-5) with grounding models like UI-TARS to translate visual inputs into precise executable actions. It supports flexible deployment via CLI, SDK, or cloud, and integrates with multiple model providers including OpenAI, Anthropic, Gemini, Azure, and Hugging Face endpoints. ...

Downloads: 3 This Week

Last Update: 2025-12-16

See Project

VectorizedMultiAgentSimulator (VMAS)

VMAS is a vectorized differentiable simulator

VectorizedMultiAgentSimulator is a high-performance, vectorized simulator for multi-agent systems, focusing on large-scale agent interactions in shared environments. It is designed for research in multi-agent reinforcement learning, robotics, and autonomous systems where thousands of agents need to be simulated efficiently.

Downloads: 0 This Week

Last Update: 2025-11-10

See Project

Jittor

Jittor is a high-performance deep learning framework

Jittor is a high-performance deep learning framework based on JIT compiling and meta-operators. The whole framework and meta-operators are compiled just in time. A powerful op compiler and tuner are integrated into Jittor. It allowed us to generate high-performance code specialized for your model. Jittor also contains a wealth of high-performance model libraries, including image recognition, detection, segmentation, generation, differentiable rendering, geometric learning, reinforcement...

Downloads: 0 This Week

Last Update: 2025-07-28

See Project

dm_control

DeepMind's software stack for physics-based simulation

DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo. DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo physics. The MuJoCo Python bindings support three different OpenGL rendering backends: EGL (headless, hardware-accelerated), GLFW (windowed, hardware-accelerated), and OSMesa (purely software-based). At least one of these three backends must be available in order render...

Downloads: 0 This Week

Last Update: 2026-03-11

See Project

Astrape

Optical-packet node transceiver frequency allocation

In an optical network scenario which consists of multiple nodes (whiteboxes) at its edges and ROADMs in-between, the coherent transceiver average laser configuration time is improved. The process is evaluated according to a testbed setup. This is facilitated in the appropriate lab equipment (or via simulation when required). For that purpose, a software agent (Netconf server) residing at the whiteboxes, is developed receiving input from the Software-Defined Networking (SDN) packet...

Downloads: 0 This Week

Last Update: 2025-03-14

See Project

TensorLayer

Deep learning and reinforcement learning library for scientists

...TensorLayer is awarded the 2017 Best Open Source Software by the ACM Multimedia Society. This project can also be found at OpenI and Gitee. 3.0.0 has been pre-released, the current version supports TensorFlow, MindSpore and PaddlePaddle (partial) as the backends, allowing users to run the code on different hardware like Nvidia-GPU and Huawei-Ascend. In the future, it will support TensorFlow, MindSpore, PaddlePaddle, PyTorch and other backends. TensorLayer has a high-level layer/model abstraction which is effortless to learn. ...

Downloads: 1 This Week

Last Update: 2022-01-17

See Project

CCZero (中国象棋Zero)

Implement AlphaZero/AlphaGo Zero methods on Chinese chess

ChineseChess-AlphaZero is a project that implements the AlphaZero algorithm for the game of Chinese Chess (Xiangqi). It adapts DeepMind’s AlphaZero method—combining neural networks and Monte Carlo Tree Search (MCTS)—to learn and play Chinese Chess without prior human data. The system includes self-play, training, and evaluation pipelines tailored to Xiangqi's unique game mechanics.

Downloads: 0 This Week

Last Update: 2025-03-13

See Project

Project Malmo

A platform for Artificial Intelligence experimentation on Minecraft

...The Malmo platform is a sophisticated AI experimentation platform built on top of Minecraft, and designed to support fundamental research in artificial intelligence. The Project Malmo platform consists of a mod for the Java version, and code that helps artificial intelligence agents sense and act within the Minecraft environment. The two components can run on Windows, Linux, or Mac OS, and researchers can program their agents in any programming language they’re comfortable with.

Downloads: 6 This Week

Last Update: 2023-03-23

See Project

Maja Machine Learning Framework

This project provides a framework for testing and comparing different machine learning algorithms (particularly reinforcement learning methods) in different scenarios. Its intended area of application is in research and education.

Downloads: 0 This Week

Last Update: 2018-05-28

See Project

PLASTK

A Python class library of tools for learning agents, including reinforcement learning algorithms, function approximators, and vector quantizations algorithms. (Pronounced "plastic".)

Downloads: 0 This Week

Last Update: 2013-04-22

See Project

Verve: General Purpose Agents

General purpose agents using reinforcement learning. Combines radial basis functions, temporal difference learning, planning, uncertainty estimations, and curiosity. Intended to be an out-of-the-box solution for roboticists and game developers.

1 Review

Downloads: 0 This Week

Last Update: 2013-04-24

See Project

Pyqle

A Python translation of Piqle with Traits for simplified simulation with reinforcement learning agents.

Downloads: 0 This Week

Last Update: 2013-04-23

See Project

Search Results for "version"

Showing 13 open source projects for "version"

Stable Baselines3

Agent S

VectorizedMultiAgentSimulator (VMAS)

Jittor

dm_control

Astrape

TensorLayer

CCZero (中国象棋Zero)

Project Malmo

Maja Machine Learning Framework

PLASTK

Verve: General Purpose Agents

Pyqle

Search Results for "version"

Showing 13 open source projects for "version"

Stable Baselines3

Agent S

VectorizedMultiAgentSimulator (VMAS)

Jittor

dm_control

Astrape

TensorLayer

CCZero (中国象棋Zero)

Project Malmo

Maja Machine Learning Framework

PLASTK

Verve: General Purpose Agents

Pyqle

Related Searches

Related Categories