Monte Carlo tree search in JAX
PyTorch version of Stable Baselines
DeepMind's software stack for physics-based simulation
Advanced evolutionary computation library built on top of PyTorch
Massively parallel rigidbody physics simulation
OpenDILab Decision AI Engine
Framework and no-code GUI for fine-tuning LLMs
Open-source, high-performance AI model with advanced reasoning
A Modular Simulation Framework and Benchmark for Robot Learning
TextWorld is a sandbox learning environment for the training
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training
Benchmarking Multimodal Agents for Open-Ended Tasks
Training framework for Stable Baselines3 reinforcement learning agents
SAPIEN Manipulation Skill Framework
Implementation of RLHF (Reinforcement Learning with Human Feedback)
Tool for visualizing and tracking your machine learning experiments
Volcano Engine Reinforcement Learning for LLMs
RL implementations
The most simple, flexible, and comprehensive OpenAI Gym trading
Optical-packet node transceiver frequency allocation
Reinforcement learning (RL) tutorial series
A repo for distributed training of language models with Reinforcement
High-quality single-file implementations of SOTA Offline
A high-performance distributed training framework
TradeMaster is an open-source platform for quantitative trading