DeepSpeed - Browse /v0.17.6 at SourceForge.net

The interactive file manager requires Javascript. Please enable it or use sftp or scp.
You may still browse the files here.

Name	Modified	Size	InfoDownloads / Week
Parent folder
README.md	2025-09-19	5.3 kB	1
v0.17.6 Patch Release source code.tar.gz	2025-09-19	204.4 MB	1
v0.17.6 Patch Release source code.zip	2025-09-19	205.6 MB	3
Totals: 3 Items		410.0 MB	5

What's Changed

Update version.txt after 0.17.5 release by @loadams in https://github.com/deepspeedai/DeepSpeed/pull/7502
Support DeepSpeed offload and reload states with ZeRO1 and ZeRO2 by @LYMDLUT in https://github.com/deepspeedai/DeepSpeed/pull/7421
CI funding shout out to modal.com by @stas00 in https://github.com/deepspeedai/DeepSpeed/pull/7503
Fix assert when 'pp_int' object has no attribute 'custom_print_str' by @aeeeeeep in https://github.com/deepspeedai/DeepSpeed/pull/7507
Update TSC Committers by @PKUWZP in https://github.com/deepspeedai/DeepSpeed/pull/7517
Enabling Muon Optimizer in DeepSpeed by @PKUWZP in https://github.com/deepspeedai/DeepSpeed/pull/7509
Enable non-ZeRO mode by @sfc-gh-truwase in https://github.com/deepspeedai/DeepSpeed/pull/7515
Update README with ZenFlow release blog featured by PyTorch. by @Antlera in https://github.com/deepspeedai/DeepSpeed/pull/7520
Add riscv64 cpu support in deepspeed_shm_comm op by @heyujiao99 in https://github.com/deepspeedai/DeepSpeed/pull/7519
ZeRO3: Improve mismatch detection by @sfc-gh-truwase in https://github.com/deepspeedai/DeepSpeed/pull/7525
fix typo s/1014 /1024 by @digger-yu in https://github.com/deepspeedai/DeepSpeed/pull/7528
undo the revert by @stas00 in https://github.com/deepspeedai/DeepSpeed/pull/7536
[logging] less startup noise by @stas00 in https://github.com/deepspeedai/DeepSpeed/pull/7526
[doc] fixing moe tutorial by @stas00 in https://github.com/deepspeedai/DeepSpeed/pull/7538
docs typo: lrrt.md, reference to cycle_min_lr should be cycle_max_lr by @jakehemmerle in https://github.com/deepspeedai/DeepSpeed/pull/7530
fixed DeepSpeedCPULion with ZeRO-Offload bug by @qibin0506 in https://github.com/deepspeedai/DeepSpeed/pull/7531
Fix scaling and allgather with torch.autocast by @tohtana in https://github.com/deepspeedai/DeepSpeed/pull/7534
Fix zenflow_torch_adam.py by @stas00 in https://github.com/deepspeedai/DeepSpeed/pull/7544
Relax restrictions of torch.autocast integration by @tohtana in https://github.com/deepspeedai/DeepSpeed/pull/7543
Autotune ZenFlow affinity by @delock in https://github.com/deepspeedai/DeepSpeed/pull/7506
fix get_cuda_compile_flag by @mingjielu in https://github.com/deepspeedai/DeepSpeed/pull/7521
avoid setting device_id to init_process_group by @kaixuanliu in https://github.com/deepspeedai/DeepSpeed/pull/7542
Improve error message and reduce validation in autocast test by @tohtana in https://github.com/deepspeedai/DeepSpeed/pull/7547
Revert "Add index to HPU devices (#7497)" by @deepcharm in https://github.com/deepspeedai/DeepSpeed/pull/7545
[ALST tutorial] support bs>1 by @sfc-gh-sbekman in https://github.com/deepspeedai/DeepSpeed/pull/7550
[MoE] Fix misuse of num_experts as expert parallel group size (ep_size) by @Flakes342 in https://github.com/deepspeedai/DeepSpeed/pull/7551
Limit random seed range in tests by @tohtana in https://github.com/deepspeedai/DeepSpeed/pull/7553
Fix gradient buffer access for DeepCompile Z1/2 by @tohtana in https://github.com/deepspeedai/DeepSpeed/pull/7548
Move modal tests to tests/v1 by @tohtana in https://github.com/deepspeedai/DeepSpeed/pull/7557
Add dependency for deepcompile test by @tohtana in https://github.com/deepspeedai/DeepSpeed/pull/7558
deepcompile: Create dummy inputs using empty_strided by @eternalNight in https://github.com/deepspeedai/DeepSpeed/pull/7564
deepcompile: Record graph order using OrderedDict by @eternalNight in https://github.com/deepspeedai/DeepSpeed/pull/7563
deepcompile: Create a full list of no-copy ops by @eternalNight in https://github.com/deepspeedai/DeepSpeed/pull/7562
fix npu device_id AttributeError issue by @we1sper in https://github.com/deepspeedai/DeepSpeed/pull/7560
Make Muon optimizer easier to enable by @delock in https://github.com/deepspeedai/DeepSpeed/pull/7555
scripts: Check .is_cuda only in non-C++ files by @eternalNight in https://github.com/deepspeedai/DeepSpeed/pull/7561
[bugfix] fix partition context unpatch by @hjh0119 in https://github.com/deepspeedai/DeepSpeed/pull/7566

New Contributors

@LYMDLUT made their first contribution in https://github.com/deepspeedai/DeepSpeed/pull/7421
@aeeeeeep made their first contribution in https://github.com/deepspeedai/DeepSpeed/pull/7507
@heyujiao99 made their first contribution in https://github.com/deepspeedai/DeepSpeed/pull/7519
@jakehemmerle made their first contribution in https://github.com/deepspeedai/DeepSpeed/pull/7530
@qibin0506 made their first contribution in https://github.com/deepspeedai/DeepSpeed/pull/7531
@mingjielu made their first contribution in https://github.com/deepspeedai/DeepSpeed/pull/7521
@kaixuanliu made their first contribution in https://github.com/deepspeedai/DeepSpeed/pull/7542
@sfc-gh-sbekman made their first contribution in https://github.com/deepspeedai/DeepSpeed/pull/7550
@Flakes342 made their first contribution in https://github.com/deepspeedai/DeepSpeed/pull/7551
@we1sper made their first contribution in https://github.com/deepspeedai/DeepSpeed/pull/7560
@hjh0119 made their first contribution in https://github.com/deepspeedai/DeepSpeed/pull/7566

Full Changelog: https://github.com/deepspeedai/DeepSpeed/compare/v0.17.5...v0.17.6

Source: README.md, updated 2025-09-19

DeepSpeed Files

Deep learning optimization library: makes distributed training easy

What's Changed

New Contributors

DeepSpeed Files

Deep learning optimization library: makes distributed training easy

Get an email when there's a new version of DeepSpeed

What's Changed

New Contributors