Summarize from Feedback download

The summarize-from-feedback repository implements the methods from the paper “Learning to Summarize from Human Feedback”. Its purpose is to train a summarization model that better aligns with human preferences by first collecting human feedback (comparisons between summaries) to train a reward model, and then fine-tuning a policy (summarizer) to maximize that learned reward. The code includes different stages: a supervised baseline (i.e. standard summarization training), the reward modeling component, and the reinforcement learning (or preference-based fine-tuning) phase. The repo also includes utilities for dataset handling, modeling architectures, inference, and evaluation. Because the codebase is experimental, parts of it may not run out-of-box depending on dependencies or environment, but it remains a canonical reference for how to implement summarization via human feedback.

Features

Supervised baseline summarization model to initialize performance
Reward model trained from human comparisons of summary pairs
Preference-based fine-tuning / RL stage to optimize summarizer toward human judgments
Dataset handling modules (loading, comparisons, splits)
Inference and evaluation scripts to generate and score summaries
Architecture layout files (e.g. model_layout.py) supporting modular model definitions

Project Samples

Project Activity

See All Activity >

License

MIT License

Follow Summarize from Feedback

Summarize from Feedback Web Site

Other Useful Business Software

Get the most trusted enterprise browser

Advanced built-in security helps IT prevent breaches before they happen

Defend against security incidents with Chrome Enterprise. Create customizable controls, manage extensions and set proactive alerts to keep your data and employees protected without slowing down productivity.

Download Chrome

Rate This Project

User Reviews

Be the first to post a review of Summarize from Feedback!

Additional Project Details

Operating Systems

Linux, Mac, Windows

Programming Language

Python

Related Categories

Python Education Software

Registered

5 days ago

Similar Business Software

Elicit

Automate time-consuming research tasks like summarizing papers, extracting data, and synthesizing your findings. Ask a research question and get back a list of relevant papers from our database of 200 million. Get one-sentence abstract summaries. Select relevant papers and search for more like...

See Software
Ethena

Compliance training doesn’t have to be a boring, check-the-box exercise. Deliver trainings your employees will learn from and enjoy on a platform that does the heavy lifting for you. Our online training is memorable, even (gasp) enjoyable, because it’s about more than just avoiding the worst...

See Software
myACI

ACI Learning delivers hands-on IT and cybersecurity training built for real-world results. Our outcome-driven approach equips learners with practical skills through immersive labs, certification prep, and instruction from proven industry experts. Whether you're training a team or advancing...

See Software
CBT Nuggets

Learning IT doesn’t have to mean boring lectures, the frantic pace of bootcamps, or lots of time away from your job or family. With CBT Nuggets, you can train anytime, anywhere, at your own pace — all from the comfort of your office chair or living room couch. Our training team is made up of...

See Software
Coursebox AI

Transform your content into engaging eLearning experiences with Coursebox, the #1 AI-powered eLearning authoring tool. Our platform automates the course creation process, allowing you to design a structured course in seconds. Simply make edits, add any missing elements, and your course is ready...

See Software
Thinkific

Thinkific is an easy-to-use platform that enables individuals, training companies, academies, and businesses to sell learning experiences at scale. It provides a world-class user experience for both customers and their students, offering a range of customizable tools to create engaging learning...

See Software

Report inappropriate content

Summarize from Feedback

Code for "Learning to summarize from human feedback"

Get an email when there's a new version of Summarize from Feedback