Monte Carlo tree search in JAX
MuA multi-agent reinforcement learning environment
A TensorFlow library for applied reinforcement learning
Volcano Engine Reinforcement Learning for LLMs
Benchmarking Multimodal Agents for Open-Ended Tasks
[NeurIPS 2023 Spotlight] LightZero
Advanced evolutionary computation library built on top of PyTorch
An API standard for single-agent reinforcement learning environments
Implementation of RLHF (Reinforcement Learning with Human Feedback)
An Easy-to-use, Scalable and High-performance RLHF Framework
SAPIEN Manipulation Skill Framework
A collection of reference Jupyter notebooks and demo AI/ML application
Flexible and powerful framework for managing multiple AI agents
Framework and no-code GUI for fine-tuning LLMs
PyTorch version of Stable Baselines
A modular high-level library to train embodied AI agents
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training
RL implementations
The most simple, flexible, and comprehensive OpenAI Gym trading
Optical-packet node transceiver frequency allocation
Reinforcement learning (RL) tutorial series
A repo for distributed training of language models with Reinforcement
High-quality single-file implementations of SOTA Offline
A high-performance distributed training framework
TradeMaster is an open-source platform for quantitative trading