| Name | Modified | Size | Downloads / Week |
|---|---|---|---|
| Parent folder | |||
| README.md | 2026-03-05 | 3.0 kB | |
| v0.18.7 Patch Release source code.tar.gz | 2026-03-05 | 216.2 MB | |
| v0.18.7 Patch Release source code.zip | 2026-03-05 | 217.4 MB | |
| Totals: 3 Items | 433.6 MB | 0 | |
What's Changed
- Update version post release by @loadams in https://github.com/deepspeedai/DeepSpeed/pull/7850
- Z1/2 init: flatten params on device by @ksugama in https://github.com/deepspeedai/DeepSpeed/pull/7828
- Enable shm_comm support for arm by @phalani-paladugu in https://github.com/deepspeedai/DeepSpeed/pull/7800
- Add news entry for DeepSpeed updates by @PKUWZP in https://github.com/deepspeedai/DeepSpeed/pull/7854
- Add EXAONE 4.0 model support for Inference V2 by @Bias92 in https://github.com/deepspeedai/DeepSpeed/pull/7853
- Fix ROCm BF16 conversion intrinsics in inference v2 (#7843) by @tohtana in https://github.com/deepspeedai/DeepSpeed/pull/7846
- Fix compilation of Evoformer by @Flamefire in https://github.com/deepspeedai/DeepSpeed/pull/7862
- Throw error when parameter is modified in GatheredParameters by @tohtana in https://github.com/deepspeedai/DeepSpeed/pull/7832
- Fix Zero-3 static scale assertion in fp16 test by @tohtana in https://github.com/deepspeedai/DeepSpeed/pull/7866
- Schedule nightly full test by @tohtana in https://github.com/deepspeedai/DeepSpeed/pull/7870
- Fix broken links and add AutoTP Training tutorial to sidebar nav by @tohtana in https://github.com/deepspeedai/DeepSpeed/pull/7874
- fix: replace 35 bare except clauses with except Exception by @haosenwang1018 in https://github.com/deepspeedai/DeepSpeed/pull/7873
- perf: use deque for FIFO queues in sequence parallel, superoffload, and compile by @giulio-leone in https://github.com/deepspeedai/DeepSpeed/pull/7880
- Fix: only add parameter with grads to parameter group by @delock in https://github.com/deepspeedai/DeepSpeed/pull/7869
- Fix no-grad grad-fn lookup in ZeRO hook counting on PyTorch 2.3 (#7830) by @tohtana in https://github.com/deepspeedai/DeepSpeed/pull/7841
- Fix import deepspeed crash on PyTorch v2.3 + Python 3.12 by @tohtana in https://github.com/deepspeedai/DeepSpeed/pull/7875
- XPU use stock pytorch instead of Intel Extension for PyTorch by @delock in https://github.com/deepspeedai/DeepSpeed/pull/7877
- Remove amp() from abstract accelerator by @delock in https://github.com/deepspeedai/DeepSpeed/pull/7879
- Add document section explaining autocast nesting by @tohtana in https://github.com/deepspeedai/DeepSpeed/pull/7883
- Fix hook count performance regression from v0.18.5 by @tohtana in https://github.com/deepspeedai/DeepSpeed/pull/7886
New Contributors
- @ksugama made their first contribution in https://github.com/deepspeedai/DeepSpeed/pull/7828
- @phalani-paladugu made their first contribution in https://github.com/deepspeedai/DeepSpeed/pull/7800
- @Bias92 made their first contribution in https://github.com/deepspeedai/DeepSpeed/pull/7853
- @haosenwang1018 made their first contribution in https://github.com/deepspeedai/DeepSpeed/pull/7873
- @giulio-leone made their first contribution in https://github.com/deepspeedai/DeepSpeed/pull/7880
Full Changelog: https://github.com/deepspeedai/DeepSpeed/compare/v0.18.6...v0.18.7