Monte Carlo tree search in JAX
Agent S: an open agentic framework that uses computers like a human
Advanced evolutionary computation library built on top of PyTorch
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training
A code-first agent framework for seamlessly planning analytics tasks
DeepMind's software stack for physics-based simulation
TextWorld is a sandbox learning environment for the training
SAPIEN Manipulation Skill Framework
RL implementations
OpenDILab Decision AI Engine
RL research on Android devices
A Modular Simulation Framework and Benchmark for Robot Learning
Training framework for Stable Baselines3 reinforcement learning agents
Volcano Engine Reinforcement Learning for LLMs
Benchmarking Multimodal Agents for Open-Ended Tasks
Modular Deep Reinforcement Learning framework in PyTorch
Implementation of RLHF (Reinforcement Learning with Human Feedback)
Cosmos-RL is a flexible and scalable Reinforcement Learning framework
Language Model Reinforcement Learning Environments frameworks
A collection of reference Jupyter notebooks and demo AI/ML application
Optical-packet node transceiver frequency allocation
The most simple, flexible, and comprehensive OpenAI Gym trading
Reinforcement learning (RL) tutorial series
A repo for distributed training of language models with Reinforcement
High-quality single-file implementations of SOTA Offline