| Name | Modified | Size | Downloads / Week |
|---|---|---|---|
| Parent folder | |||
| README.md | 2025-11-04 | 1.4 kB | |
| v0.18.2 Patch Release source code.tar.gz | 2025-11-04 | 215.2 MB | |
| v0.18.2 Patch Release source code.zip | 2025-11-04 | 216.3 MB | |
| Totals: 3 Items | 431.5 MB | 2 | |
What's Changed
- Update version after 0.18.1 release by @loadams in https://github.com/deepspeedai/DeepSpeed/pull/7647
- Deduplicate fp32 weights under torch autocast and ZeRO3 by @eternalNight in https://github.com/deepspeedai/DeepSpeed/pull/7651
- ulysses mpu: additional api by @stas00 in https://github.com/deepspeedai/DeepSpeed/pull/7649
- ALST/UlyssesSP: more intuitive API wrt variable seqlen by @stas00 in https://github.com/deepspeedai/DeepSpeed/pull/7656
- Fix misplaced overflow handling return in fused_optimizer.py by @rraminen in https://github.com/deepspeedai/DeepSpeed/pull/7645
- [bug]: fixed comm_dtype in extra_large_param_to_reduce by @therealnaveenkamal in https://github.com/deepspeedai/DeepSpeed/pull/7660
- UlyssesSP: TiledMLP doc - recomputes forward twice by @stas00 in https://github.com/deepspeedai/DeepSpeed/pull/7664
- resolved a 0-dim tensor slicing bug from _get_state_without_padding by @therealnaveenkamal in https://github.com/deepspeedai/DeepSpeed/pull/7659
- Fix typo in pytorch-profiler.md documentation by @kunheek in https://github.com/deepspeedai/DeepSpeed/pull/7652
- README refresh by @sfc-gh-truwase in https://github.com/deepspeedai/DeepSpeed/pull/7668
New Contributors
- @kunheek made their first contribution in https://github.com/deepspeedai/DeepSpeed/pull/7652
Full Changelog: https://github.com/deepspeedai/DeepSpeed/compare/v0.18.1...v0.18.2