Name | Modified | Size | Downloads / Week |
---|---|---|---|
Parent folder | |||
README.md | 2025-10-07 | 4.5 kB | |
v0.18.0 source code.tar.gz | 2025-10-07 | 215.2 MB | |
v0.18.0 source code.zip | 2025-10-07 | 216.3 MB | |
Totals: 3 Items | 431.5 MB | 6 |
What's Changed
- Update version.txt post 0.17.6 release by @loadams in https://github.com/deepspeedai/DeepSpeed/pull/7572
- DeepCompile ZeRO-3: robust allgather for uneven shards; fix profiling… by @juyterman1000 in https://github.com/deepspeedai/DeepSpeed/pull/7489
- logging: Also set log level of logger handlers by @eternalNight in https://github.com/deepspeedai/DeepSpeed/pull/7576
- Deepcompile: Fix bugs when applying deepcompile to VLA-like models by @eternalNight in https://github.com/deepspeedai/DeepSpeed/pull/7569
- Broadcast fp16 overflow in Z1 by @sfc-gh-truwase in https://github.com/deepspeedai/DeepSpeed/pull/7580
- Deepcompile: Make size of activation to free configurable by @eternalNight in https://github.com/deepspeedai/DeepSpeed/pull/7582
- SuperOffload Release by @xylian86 in https://github.com/deepspeedai/DeepSpeed/pull/7559
- Include init file for superoffload folder by @nguyen599 in https://github.com/deepspeedai/DeepSpeed/pull/7591
- disables ZeRO checkpoint loading path when stage=0 by @therealnaveenkamal in https://github.com/deepspeedai/DeepSpeed/pull/7586
- Simplify leaf module hook by @tohtana in https://github.com/deepspeedai/DeepSpeed/pull/7592
- Fix the universal checkpoint issue for stage3 when there are multiple subgroups. by @zhengchenyu in https://github.com/deepspeedai/DeepSpeed/pull/7585
- Change current_device() to current_device_name() by @delock in https://github.com/deepspeedai/DeepSpeed/pull/7600
- Fixed the problem of loading universal checkpoint error in multi-machine mode. by @zhengchenyu in https://github.com/deepspeedai/DeepSpeed/pull/7601
- DeepCompile: Specify tensor aliasing in C++ op schema by @eternalNight in https://github.com/deepspeedai/DeepSpeed/pull/7597
- DeepCompile: Fuse allgather and downcast by @eternalNight in https://github.com/deepspeedai/DeepSpeed/pull/7588
- Add blog for SuperOffload by @xylian86 in https://github.com/deepspeedai/DeepSpeed/pull/7594
- Add venv to .gitignore by @zhengchenyu in https://github.com/deepspeedai/DeepSpeed/pull/7605
- Handle the case of DeepCompile's enabled but not activated by @tohtana in https://github.com/deepspeedai/DeepSpeed/pull/7603
- DeepCompile: Fix IPG bucket clearing by @eternalNight in https://github.com/deepspeedai/DeepSpeed/pull/7610
- Minor fix in the SuperOffload blog by @xylian86 in https://github.com/deepspeedai/DeepSpeed/pull/7612
- Fixed the issue that universal checkpoint cannot be loaded for stage3 when world size expansion. by @zhengchenyu in https://github.com/deepspeedai/DeepSpeed/pull/7599
- Fixed save_checkpoint race when consolidating NVMe offloaded tensors by @H1manshu21 in https://github.com/deepspeedai/DeepSpeed/pull/7613
- [wall_clock_breakdown] always log stats when enabled by @stas00 in https://github.com/deepspeedai/DeepSpeed/pull/7617
- DeepCompile: Use min_cut_rematerialization for partitioning joint graphs by @eternalNight in https://github.com/deepspeedai/DeepSpeed/pull/7609
- Show mismatching values when DeepCompile test fails by @tohtana in https://github.com/deepspeedai/DeepSpeed/pull/7618
- Improve leaf module interface (enable via config, relax matching criteria, add document, etc.) by @tohtana in https://github.com/deepspeedai/DeepSpeed/pull/7604
- add print_dist util by @stas00 in https://github.com/deepspeedai/DeepSpeed/pull/7621
- Super offload blog Chinese version by @delock in https://github.com/deepspeedai/DeepSpeed/pull/7620
- Enable grad scaler for ZeRO-0 + torch.autocast path by @tohtana in https://github.com/deepspeedai/DeepSpeed/pull/7619
- Blog of zenflow binding study by @delock in https://github.com/deepspeedai/DeepSpeed/pull/7614
- Clarify document of leaf module config by @tohtana in https://github.com/deepspeedai/DeepSpeed/pull/7623
- [TiledMLP] moe support by @stas00 in https://github.com/deepspeedai/DeepSpeed/pull/7622
- Update email address by @sfc-gh-truwase in https://github.com/deepspeedai/DeepSpeed/pull/7624
New Contributors
- @juyterman1000 made their first contribution in https://github.com/deepspeedai/DeepSpeed/pull/7489
- @nguyen599 made their first contribution in https://github.com/deepspeedai/DeepSpeed/pull/7591
- @zhengchenyu made their first contribution in https://github.com/deepspeedai/DeepSpeed/pull/7585
- @H1manshu21 made their first contribution in https://github.com/deepspeedai/DeepSpeed/pull/7613
Full Changelog: https://github.com/deepspeedai/DeepSpeed/compare/v0.17.6...v0.18.0