| Name | Modified | Size | Downloads / Week |
|---|---|---|---|
| Parent folder | |||
| README.md | 2025-09-22 | 1.2 kB | |
| Release v0.8.11 source code.tar.gz | 2025-09-22 | 372.1 kB | |
| Release v0.8.11 source code.zip | 2025-09-22 | 458.0 kB | |
| Totals: 3 Items | 831.3 kB | 0 | |
What's Changed
- Fix PPO progress display when resuming from a checkpoint by @zhaoxu98 in https://github.com/OpenRLHF/OpenRLHF/pull/1124
- Add
--tokenizer_chat_templateargument to the DPO trainer by @armsp in https://github.com/OpenRLHF/OpenRLHF/pull/1129 - Add GEM: A Gym for Generalist LLMs demo by @xiaoxigua999 in https://github.com/OpenRLHF/OpenRLHF/commit/4e9a12f9f902db880d4a599e18b36ab37c7742d4
- Bump vLLM to 0.10.2 by @xiaoxigua999 in https://github.com/OpenRLHF/OpenRLHF/commit/b678e303a3d432b271b85527365ab4cd2467f9b7
- Bump DeepSpeed to 0.17.6 by @xiaoxigua999 in https://github.com/OpenRLHF/OpenRLHF/commit/ac3689ece2666cb524836568b88bfc6016dabc6f
- Bump Transformers to 4.56.1 by @xiaoxigua999 in https://github.com/OpenRLHF/OpenRLHF/commit/564e4672dee0f1599c2dfe434f135a8c9570318f
New Contributors
- @zhaoxu98 made their first contribution in https://github.com/OpenRLHF/OpenRLHF/pull/1124
- @armsp made their first contribution in https://github.com/OpenRLHF/OpenRLHF/pull/1129
Full Changelog: https://github.com/OpenRLHF/OpenRLHF/compare/v0.8.10...v0.8.11