| Name | Modified | Size | Downloads / Week |
|---|---|---|---|
| Parent folder | |||
| README.md | 2026-03-19 | 553 Bytes | |
| Release v0.9.6 source code.tar.gz | 2026-03-19 | 376.1 kB | |
| Release v0.9.6 source code.zip | 2026-03-19 | 426.7 kB | |
| Totals: 3 Items | 803.3 kB | 0 | |
What's changed ?
bump vllm and deepspeed@xiaoxigua999remove KTO/PRM/KD, batch_inference, and interactive_chat@xiaoxigua999add grad_norm logging and PPO phase timing breakdown@yxs
Full Changelog: https://github.com/OpenRLHF/OpenRLHF/compare/v0.9.5...v0.9.6