Download Latest Version v0.7.0_ NeMo PPO, PEFT Migration, and Fixes source code.tar.gz (295.6 kB)
Email in envelope

Get an email when there's a new version of Transformer Reinforcement Learning X

Home / v0.7.0
Name Modified Size InfoDownloads / Week
Parent folder
README.md 2023-06-23 5.7 kB
v0.7.0_ NeMo PPO, PEFT Migration, and Fixes source code.tar.gz 2023-06-23 295.6 kB
v0.7.0_ NeMo PPO, PEFT Migration, and Fixes source code.zip 2023-06-23 370.5 kB
Totals: 3 Items   671.8 kB 0

The v0.7.0 release includes several new features, bug fixes, and overall improvements to the codebase. Here are the key changes:

🐠 NeMo PPO and SFT support

This release introduces NeMo-backed PPO and SFT implementations for capabilities and improved system performance under large-scale training.

šŸ¦† PEFT Migration

trlx now supports parameter-efficient tuning methods via the peft library, which we hope will provide greater access to RLHF training in low-resource settings.

Fixes and mores!

New Contributors

Full Changelog: https://github.com/CarperAI/trlx/compare/v0.6.0...v0.7.0

Source: README.md, updated 2023-06-23