OpenRLHF Files

An Easy-to-use, Scalable and High-performance RLHF Framework

This is an exact mirror of the OpenRLHF project, hosted at https://github.com/OpenRLHF/OpenRLHF. SourceForge is not affiliated with OpenRLHF. For more information, see the SourceForge Open Source Mirror Directory.

The interactive file manager requires Javascript. Please enable it or use sftp or scp.
You may still browse the files here.

Name	Modified	Size	InfoDownloads / Week
Parent folder
README.md	2025-07-24	654 Bytes	0
Release v0.8.7 source code.tar.gz	2025-07-24	368.3 kB	0
Release v0.8.7 source code.zip	2025-07-24	452.3 kB	1
Totals: 3 Items		821.2 kB	1

What's Changed

Support adustable min_lr from DPO training script by @viswavi in https://github.com/OpenRLHF/OpenRLHF/pull/1092
Support soft overlong punishment from DAPO by @xjli360 in https://github.com/OpenRLHF/OpenRLHF/pull/1091
Support token in token out and agent loop by @xiaoxigua999 @physicsru in https://github.com/OpenRLHF/OpenRLHF/pull/1094

New Contributors

@viswavi made their first contribution in https://github.com/OpenRLHF/OpenRLHF/pull/1092
@xjli360 made their first contribution in https://github.com/OpenRLHF/OpenRLHF/pull/1091

Full Changelog: https://github.com/OpenRLHF/OpenRLHF/compare/v0.8.6...v0.8.7

Source: README.md, updated 2025-07-24

Other Useful Business Software

Our Free Plans just got better! | Auth0 Icon

Our Free Plans just got better! | Auth0

With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now

Powering the best of the internet | Fastly Icon

Powering the best of the internet | Fastly

Fastly's edge cloud platform delivers faster, safer, and more scalable sites and apps to customers.

Ensure your websites, applications and services can effortlessly handle the demands of your users with Fastly. Fastly’s portfolio is designed to be highly performant, personalized and secure while seamlessly scaling to support your growth.

Try for free

Recommended Projects

Acme
A library of reinforcement learning components and agents
7-Zip
A free file archiver for extremely high compression
Clonezilla
A partition and disk imaging/cloning program
Apache OpenOffice
The free and Open Source productivity suite
DeSmuME: Nintendo DS emulator
DeSmuME is a Nintendo DS emulator