Download Latest Version v0.7.1 source code.zip (2.8 MB)
Email in envelope

Get an email when there's a new version of verl

Home / v0.5.0
Name Modified Size InfoDownloads / Week
Parent folder
README.md 2025-07-23 19.6 kB
v0.5.0 source code.tar.gz 2025-07-23 1.1 MB
v0.5.0 source code.zip 2025-07-23 1.8 MB
Totals: 3 Items   3.0 MB 1

Highlights

Agentic RL rollout interface [beta]

verl v0.5 introduces the AgentLoop abstraction that allows easy extension to custom rollout with tool/agent interactions. Server-based asynchronous rollout is adopted to efficiently utilize GPUs. verl provides a few example agent loop implementations including:

Please check the documentation for the system architecture design.

Disaggregated placement & async training [prototype]

verl v0.5 includes a community-contributed one-step-off async training recipe, with trainer and rollout deployed on disaggregated resources and off-policy model updates with staleness = 1. In a small scale experiment, the reference recipe provides 20-40% throughput gain compared to the on-policy baseline depending on the configuration. Please checkout the code and documentation for example configurations.

Remote generative reward models [prototype]

A recipe is provided as a prototype to demonstrate the recommended way to use generative reward models in verl. Documentation and code.

New features

Important fixes & improvements

Breaking changes and deprecations

Full list: https://github.com/volcengine/verl/discussions/2270

Rollout

  • When generate_sequences with sampling params n>1, change DataProto repeat behavior:
  • chunk-dispatch-repeat: DataProto is chunked and dispatched to rollout workers, then repeated in rollout workers.
  • repeat-chunk-dispatch: DataProto is repeated by n in driver, then chunked and dispatched to rollout workers. Switch from chunk-dispatch-repeat to repeat-chunk-dispatch, this change may break almost all recipes and projects using verl GRPO as submodules. https://github.com/volcengine/verl/pull/2324

  • verl.workers.rollout.sglang_rollout.AsyncSglangServer is now renamed as AsyncSGLangServer

  • vllm <= v0.6 support is dropped

Multi-turn

Megatron

  • Megatron recomputation options are moved to *.megatron.override_transformer_config. https://github.com/volcengine/verl/pull/2651 Default values are:

    override_transformer_config: recompute_granularity: null recompute_modules:

    • core_attn recompute_method: null recompute_num_layers: null
    • Merged config actor_rollout_ref.(actor, ref, rollout).profiler to actor_rollout_ref.profiler

What's Changed

Trainer & FSDP

Rollout & SGLang

Megatron

Hardware

Misc fixes

New Contributors

Welcome new contributors to the verl community! @rhiremat @LeavesLei @diqiuzhuanzhuan @frrad @shuyhere @askender @Tavish9 @Wangmerlyn @SuperCB @tongtong0613 @jwong8314 @ji-huazhong @Keilo001 @conver334 @JoostvDoorn @mathewjhan @PopSoda2002 @rudeigerc @Titanpku @firefighter-eric @meituan-search @xihuai18 @tardis-key @ZihengJiang @Pursuer-Hsf @beep-bebop @aaronyeeio @Hecate0821 @apeforest @zhxieml

Full Changelog: https://github.com/volcengine/verl/compare/v0.4.1...v0.5.0

Source: README.md, updated 2025-07-23