Monte Carlo tree search in JAX
MuA multi-agent reinforcement learning environment
Framework and no-code GUI for fine-tuning LLMs
PyTorch version of Stable Baselines
Benchmarking Multimodal Agents for Open-Ended Tasks
[NeurIPS 2023 Spotlight] LightZero
Modular Deep Reinforcement Learning framework in PyTorch
Advanced evolutionary computation library built on top of PyTorch
Implementation of RLHF (Reinforcement Learning with Human Feedback)
Massively parallel rigidbody physics simulation
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training
VMAS is a vectorized differentiable simulator
Physical Symbolic Optimization
Flexible and powerful framework for managing multiple AI agents
Cosmos-RL is a flexible and scalable Reinforcement Learning framework
Language Model Reinforcement Learning Environments frameworks
Volcano Engine Reinforcement Learning for LLMs
A modular high-level library to train embodied AI agents
A code-first agent framework for seamlessly planning analytics tasks
Optical-packet node transceiver frequency allocation
A collection of reference Jupyter notebooks and demo AI/ML application
The most simple, flexible, and comprehensive OpenAI Gym trading
Reinforcement learning (RL) tutorial series
A repo for distributed training of language models with Reinforcement
High-quality single-file implementations of SOTA Offline