Download Latest Version v0.7.1 source code.zip (2.8 MB)
Email in envelope

Get an email when there's a new version of verl

Home / v0.6.1
Name Modified Size InfoDownloads / Week
Parent folder
README.md 2025-11-14 20.7 kB
v0.6.1 source code.tar.gz 2025-11-14 2.1 MB
v0.6.1 source code.zip 2025-11-14 3.1 MB
Totals: 3 Items   5.2 MB 2

Highlights

Trainer

  • support fp16 training (FSDP/Megatron)

Megatron

  • support 1f1b_overlap/moe_a2a_overlap
  • support for Qwen3VL MoE/dense models
  • support Qwen2.5/3vl with context parallel

Rollout

  • Use vllm and sglang release image as ci base image, upgrade vllm==0.11.0, upgrade sglang==0.5.5
  • Prometheus monitoring

Algorithm

  • Rollout Correction: comprehensive overhaul of the rollout correction system with typed configuration, mathematical documentation, and performance optimizations.

Recipe

Introduce two new experimental recipes, which will be gradually merge to main in future release.

  • Fully Async Policy Trainer: fully asynchronous PPO training system that completely decouples the Trainer and Rollouter, supporting asynchronous sample generation and training.
  • TransferQueue Data System: an asynchronous streaming data management system for efficient post-training.
  • FlowRL

Importance bug fixes

What's Changed

New Contributors

Full Changelog: https://github.com/volcengine/verl/compare/v0.6.0...v0.6.1

Source: README.md, updated 2025-11-14