Download Latest Version v0.2.4 source code.tar.gz (5.5 MB)
Email in envelope

Get an email when there's a new version of slime LLM

Home / v0.2.4
Name Modified Size InfoDownloads / Week
Parent folder
README.md 2026-03-29 5.5 kB
v0.2.4 source code.tar.gz 2026-03-29 5.5 MB
v0.2.4 source code.zip 2026-03-29 5.8 MB
Totals: 3 Items   11.3 MB 1

v0.2.4 is here! Thanks to everyone who contributed to this release.

Major Updates

In addition to a broad set of bug fixes and stability improvements, v0.2.4 brings several major updates:

  • Profiling and observability improvements Added a rollout trace timeline viewer and W&B reporting for dynamic ITL / TTFT percentile metrics.
  • Router stack unified on sgl-router Consolidated the router stack onto sgl-router and removed slime-router.
  • Expanded multimodal and model support Improved support for GLM-4.6V / GLM4V, Multimodal OPD, and Qwen3.5-related workflows.

Other Notable Changes

  • Fixed CUDA IPC cache leaks during weight updates
  • Fixed SP/CP gradient inflation in FLA layers

What's Changed

New Contributors

Full Changelog: https://github.com/THUDM/slime/compare/v0.2.3...v0.2.4

Source: README.md, updated 2026-03-29