PyTorch version of Stable Baselines
Framework and no-code GUI for fine-tuning LLMs
Physical Symbolic Optimization
A repo for distributed training of language models with Reinforcement
High-quality single-file implementations of SOTA Offline
Library of deep learning models and datasets
ChainerRL is a deep reinforcement learning library
Enables easy experimentation with state of the art algorithms
Deep Reinforcement Learning for Keras.
Intel® Nervana™ reference deep learning framework