| Name | Modified | Size | Downloads / Week |
|---|---|---|---|
| Parent folder | |||
| README.md | 2026-03-12 | 13.6 kB | |
| v0.2.3 source code.tar.gz | 2026-03-12 | 3.0 MB | |
| v0.2.3 source code.zip | 2026-03-12 | 3.3 MB | |
| Totals: 3 Items | 6.4 MB | 2 | |
v0.2.3 is here! Thanks to everyone who contributed to this release.
Major Updates
In addition to a broad set of bug fixes and stability improvements, v0.2.3 brings several major updates:
- YAML-based sglang_config support for engine group configuration This makes rollout setup much more flexible: you can now configure different parallelism strategies for PD disaggregation, enable EPD-style deployments, or even serve multiple heterogeneous models within one rollout setup more cleanly.
- Expanded model support, including GLM5, GLM-4.7-Flash and Qwen3.5.
- Dependency and runtime updates, including SGLang v0.5.9 docker support and multiple fixes for PD, NSA, HiCache, CP+PP, etc.
Other Notable Changes
- Added consistent hashing routing for multi-turn rollout
- Removed FSDP support so we can focus maintenance effort on the training and rollout paths we actively invest in
What's Changed
- fix: fix sglang regression by @nanjiangwill in https://github.com/THUDM/slime/pull/1363
- [docker] upgrade fla to 0.4.1 by @zhuzilin in https://github.com/THUDM/slime/pull/1452
- Allow passing pp_size by @zhuzilin in https://github.com/THUDM/slime/pull/1454
- Revert "fix: fix sglang regression" by @zhuzilin in https://github.com/THUDM/slime/pull/1457
- update [#1457] by @nanjiangwill in https://github.com/THUDM/slime/pull/1458
- [model] Add support for GLM4.7 Flash by @zhuzilin in https://github.com/THUDM/slime/pull/1460
- [script] Add example script for GLM4.7 Flash by @zhuzilin in https://github.com/THUDM/slime/pull/1467
- add lb default by @lilei199908 in https://github.com/THUDM/slime/pull/1465
- support non-symmetric int4 qat by @zhuzilin in https://github.com/THUDM/slime/pull/1472
- add convert hf to int4 without calibration dataset by @xieck13 in https://github.com/THUDM/slime/pull/1489
- [docker] fix nsa + hicache by @zhuzilin in https://github.com/THUDM/slime/pull/1494
- fix compute logprobs menmory leak bug by @lilei199908 in https://github.com/THUDM/slime/pull/1506
- [Fix] support converting torch_dist to hf for qwen3vl dense model by @p1k0pan in https://github.com/THUDM/slime/pull/1491
- renamed qwen3-vl.py to qwen3_vl.py to fix typo by @gxlvera in https://github.com/THUDM/slime/pull/1512
- sync internal features by @zhuzilin in https://github.com/THUDM/slime/pull/1513
- [docker] allow retract to 0 req during PD by @zhuzilin in https://github.com/THUDM/slime/pull/1515
- Pass through correct Megatron model provider PP args by @hari-hm in https://github.com/THUDM/slime/pull/1486
- fix: support mtp for qwen3-next by @huang3eng in https://github.com/THUDM/slime/pull/1503
- [Doc] Add doc for slime router by @Hecate0821 in https://github.com/THUDM/slime/pull/1499
- Add support for vlm checkpoints conversion by @cklxx in https://github.com/THUDM/slime/pull/1475
- fix: use aread() to fully consume HTTP response body by @ann-qin-lu in https://github.com/THUDM/slime/pull/1488
- Add response.aclose() and fix lint by @zhuzilin in https://github.com/THUDM/slime/pull/1520
- [docker] alleviate pd memory leakage by @zhuzilin in https://github.com/THUDM/slime/pull/1525
- Update convert to INT4 script path by @gxlvera in https://github.com/THUDM/slime/pull/1528
- [docker] add alloc_extend_torch_fallback for long context by @zhuzilin in https://github.com/THUDM/slime/pull/1530
- [docker] remove seq_len in _get_k_and_s_triton_kernel by @zhuzilin in https://github.com/THUDM/slime/pull/1531
- [Feature] Add megatron version for on policy distillation by @yitianlian in https://github.com/THUDM/slime/pull/1538
- Make
DataSourceimplement__len__to standardize the data source contract by @TSunny007 in https://github.com/THUDM/slime/pull/1518 - [Fix] Add fsdp assert for OPD by @yitianlian in https://github.com/THUDM/slime/pull/1545
- [bug] fix: gracefully handle datapoints with no multimodal input in a multimodal dataset by @hleehlee-amazon in https://github.com/THUDM/slime/pull/1535
- fix sglang hicache nsa bugs by @lilei199908 in https://github.com/THUDM/slime/pull/1549
- [docker] fix nsa retract by @zhuzilin in https://github.com/THUDM/slime/pull/1566
- [Multimodal] make multimodal processing robust by @coding-famer in https://github.com/THUDM/slime/pull/1516
- [Fix] Fix multimodal_train_inputs handling for mixed text-multimodal datasets by @coding-famer in https://github.com/THUDM/slime/pull/1559
- [Feature] add convert_torch_dist_to_hf_bridge.py by @coding-famer in https://github.com/THUDM/slime/pull/1573
- [Fix] Fix resuming from Megatron checkpoint when using bridge by @coding-famer in https://github.com/THUDM/slime/pull/1577
- [Feaure] Add Profile Config by @yitianlian in https://github.com/THUDM/slime/pull/1561
- [Feature] Add consistent hashing routing policy for rollout by @yitianlian in https://github.com/THUDM/slime/pull/1588
- [Fix] Minor fix for support distribute mode by @yitianlian in https://github.com/THUDM/slime/pull/1589
- [bug] fix: fix type hint in
Sampleclass by @hleehlee-amazon in https://github.com/THUDM/slime/pull/1551 - Add support for GLM5 by @zhuzilin in https://github.com/THUDM/slime/pull/1599
- fix(examples): update strands_sglang example to strands-sglang v0.2.x API by @Lawhy in https://github.com/THUDM/slime/pull/1593
- refactor: separate sglang argparse from megatron with two-phase parsing by @zhuzilin in https://github.com/THUDM/slime/pull/1600
- Add issue templates by @zhuzilin in https://github.com/THUDM/slime/pull/1602
- [cleanup] remove metric checker and long tests by @zhuzilin in https://github.com/THUDM/slime/pull/1603
- fix: use getattr for sglang params when rollout_only is disabled by @zhuzilin in https://github.com/THUDM/slime/pull/1604
- refactor: extract RolloutServerGroup and start_rollout_server by @zhuzilin in https://github.com/THUDM/slime/pull/1605
- refactor: delegate engine state and operations to RolloutServerGroup by @zhuzilin in https://github.com/THUDM/slime/pull/1606
- fix: sync docs with current implementation by @zhuzilin in https://github.com/THUDM/slime/pull/1608
- fix: restrict SymPy parser scope to prevent arbitrary code execution by @Hardik-369 in https://github.com/THUDM/slime/pull/1587
- refactor: split PD disaggregation into separate EngineGroups by @zhuzilin in https://github.com/THUDM/slime/pull/1609
- feat: move OPD to slime/rollout, add CI test and docs by @zhuzilin in https://github.com/THUDM/slime/pull/1610
- docs: add CI section to developer guide (EN + ZH) by @zhuzilin in https://github.com/THUDM/slime/pull/1612
- refactor: make EngineGroup ops non-blocking and batch ray.get at RolloutServer level by @zhuzilin in https://github.com/THUDM/slime/pull/1613
- Fix: disable allow_abbrev in _pre_parse_mode to prevent --load collision by @coding-famer in https://github.com/THUDM/slime/pull/1616
- fix: skip memory check for non-communication query functions in ReloadableProcessGroup by @zhuzilin in https://github.com/THUDM/slime/pull/1625
- Fix: Handle quantization formats during weight synchronization in Megatron bridge. by @GeLee-Q in https://github.com/THUDM/slime/pull/1624
- Add retries, which are not built into aiohttp by @joyliu-q in https://github.com/THUDM/slime/pull/1617
- [docker] upgrade to v0.5.9 by @zhuzilin in https://github.com/THUDM/slime/pull/1626
- Upgrade GitHub Actions for Node 24 compatibility by @salmanmkc in https://github.com/THUDM/slime/pull/1590
- add no colocate update critic only by @lilei199908 in https://github.com/THUDM/slime/pull/1567
- remove redundant log_rollout_data by @zhuzilin in https://github.com/THUDM/slime/pull/1629
- feat: add --sglang-config YAML for engine group configuration by @zhuzilin in https://github.com/THUDM/slime/pull/1614
- Fix [#1595]: pass rollout_id explicitly to offload_train by @yitianlian in https://github.com/THUDM/slime/pull/1631
- Fix [#1615]: tolerate abort_request connection failures by @yitianlian in https://github.com/THUDM/slime/pull/1632
- [docker] support pp + cp for dsa model by @zhuzilin in https://github.com/THUDM/slime/pull/1634
- [docker] fix sglang upgrade bug by @zhuzilin in https://github.com/THUDM/slime/pull/1639
- Add Qwen3.5 model support (27B dense and 35B-A3B MoE) by @zhuzilin in https://github.com/THUDM/slime/pull/1641
- [docker] fix int4 qat for upgraded sglang by @zhuzilin in https://github.com/THUDM/slime/pull/1642
- Add GLM-4.7-Flash example docs and 8xH100 training script by @zhuzilin in https://github.com/THUDM/slime/pull/1645
- [docker] supports bf16 deepep by @zhuzilin in https://github.com/THUDM/slime/pull/1651
- Add slime skills for rollout, reward, filter, eval config, and CI by @yitianlian in https://github.com/THUDM/slime/pull/1646
- [Feature] Add plugin contract test suite by @yitianlian in https://github.com/THUDM/slime/pull/1652
- support gpt-oss by @zhuzilin in https://github.com/THUDM/slime/pull/1658
- [docker] bugfixes on cp + pp by @zhuzilin in https://github.com/THUDM/slime/pull/1659
- [docker] remove true on policy patches by @zhuzilin in https://github.com/THUDM/slime/pull/1661
- [fix]: Qwen3.5-35B-A3B 8-GPU: set TP size to 2 for num_query_groups=2 by @none0663 in https://github.com/THUDM/slime/pull/1662
- Remove FSDP support by @zhuzilin in https://github.com/THUDM/slime/pull/1664
- docs: add OpenClaw-RL to projects built upon slime by @yinjjiew in https://github.com/THUDM/slime/pull/1635
- Support setting update weights in sglang_config by @zhuzilin in https://github.com/THUDM/slime/pull/1665
- [fix] Fix numerical accuracy issue in dynamic sampling filter by @Django-Jiang in https://github.com/THUDM/slime/pull/1674
- sync from internal by @zhuzilin in https://github.com/THUDM/slime/pull/1677
- bugfixes from community by @zhuzilin in https://github.com/THUDM/slime/pull/1678
- Fix: pass return_tensors in text_kwargs for transformers>=5.0.0 compatibility by @coding-famer in https://github.com/THUDM/slime/pull/1648
- Fix missing packed_seq_params in bshd qkv_format by @coding-famer in https://github.com/THUDM/slime/pull/1649
- [Multimodal][Model] Qwen3.5 VL training example/support by @coding-famer in https://github.com/THUDM/slime/pull/1676
- update docs by @zhuzilin in https://github.com/THUDM/slime/pull/1680
- update docs by @zhuzilin in https://github.com/THUDM/slime/pull/1681
- support offloading non-updatable server by @zhuzilin in https://github.com/THUDM/slime/pull/1668
- bugfix by @zhuzilin in https://github.com/THUDM/slime/pull/1685
- fix: handle Qwen3.5 in quantize_params_fp8 by @lawrence-harmonic in https://github.com/THUDM/slime/pull/1683
- bugfix by @zhuzilin in https://github.com/THUDM/slime/pull/1687
- Fix Qwen3.5 & Qwen3-Next linear attention cu_seqlens missing by @huang3eng in https://github.com/THUDM/slime/pull/1686
- fix: use semantic version comparison for PyTorch >= 2.6 detection by @abatilo in https://github.com/THUDM/slime/pull/1667
- [Fix] Minor fix for properly finishing / flushing wandb logging metrics at exit by @silunw in https://github.com/THUDM/slime/pull/1592
- Autofix/issue 1578 hf2megatron arg suffix by @yitianlian in https://github.com/THUDM/slime/pull/1636
- bugfix by @zhuzilin in https://github.com/THUDM/slime/pull/1688
- fix(examples): update strands_sglang example to v0.3.x API by @Lawhy in https://github.com/THUDM/slime/pull/1684
- [docker] cherry pick qwen3.5 bugfix by @zhuzilin in https://github.com/THUDM/slime/pull/1691
- bugfix/fix Qwen3.5 dense model precision bug in TP_SIZE>1 from sglang by @mzusman in https://github.com/THUDM/slime/pull/1705
- Fix/qwen3 5 mtp bridge by @huang3eng in https://github.com/THUDM/slime/pull/1702
- support epd for glm4.6v by @hanwen-sun in https://github.com/THUDM/slime/pull/1704
- [docker] support epd for glm4.6v by @zhuzilin in https://github.com/THUDM/slime/pull/1707
- [docker] store v0.5.9 patch by @zhuzilin in https://github.com/THUDM/slime/pull/1710
- Add GLM-4.7-Flash MTP training support by @zhuzilin in https://github.com/THUDM/slime/pull/1712
- [release] bump to v0.2.3 by @zhuzilin in https://github.com/THUDM/slime/pull/1682
New Contributors
- @p1k0pan made their first contribution in https://github.com/THUDM/slime/pull/1491
- @hari-hm made their first contribution in https://github.com/THUDM/slime/pull/1486
- @huang3eng made their first contribution in https://github.com/THUDM/slime/pull/1503
- @ann-qin-lu made their first contribution in https://github.com/THUDM/slime/pull/1488
- @TSunny007 made their first contribution in https://github.com/THUDM/slime/pull/1518
- @hleehlee-amazon made their first contribution in https://github.com/THUDM/slime/pull/1535
- @Hardik-369 made their first contribution in https://github.com/THUDM/slime/pull/1587
- @joyliu-q made their first contribution in https://github.com/THUDM/slime/pull/1617
- @salmanmkc made their first contribution in https://github.com/THUDM/slime/pull/1590
- @yinjjiew made their first contribution in https://github.com/THUDM/slime/pull/1635
- @Django-Jiang made their first contribution in https://github.com/THUDM/slime/pull/1674
- @lawrence-harmonic made their first contribution in https://github.com/THUDM/slime/pull/1683
- @abatilo made their first contribution in https://github.com/THUDM/slime/pull/1667
- @silunw made their first contribution in https://github.com/THUDM/slime/pull/1592
- @mzusman made their first contribution in https://github.com/THUDM/slime/pull/1705
- @hanwen-sun made their first contribution in https://github.com/THUDM/slime/pull/1704
Full Changelog: https://github.com/THUDM/slime/compare/v0.2.2...v0.2.3