Name | Modified | Size | Downloads / Week |
---|---|---|---|
Parent folder | |||
README.md | 2025-05-22 | 2.7 kB | |
v0.5.10_ Qwen3 MOE support, Sparsemax kernel, bug fixes source code.tar.gz | 2025-05-22 | 3.7 MB | |
v0.5.10_ Qwen3 MOE support, Sparsemax kernel, bug fixes source code.zip | 2025-05-22 | 3.8 MB | |
Totals: 3 Items | 7.5 MB | 0 |
What's Changed
- fix zip bug by @KareemMusleh in https://github.com/linkedin/Liger-Kernel/pull/702
- [dpo] set default average_log_prob to False by @cyr0930 in https://github.com/linkedin/Liger-Kernel/pull/693
- Rank build status lower by @momochen in https://github.com/linkedin/Liger-Kernel/pull/707
- Add support for Qwen3 MoE models by @chiwanpark in https://github.com/linkedin/Liger-Kernel/pull/706
- Fix qwen3_moe flaky convergence test by @vaibhavjindal in https://github.com/linkedin/Liger-Kernel/pull/710
- Fix empty Medusa head tensors by @chiwanpark in https://github.com/linkedin/Liger-Kernel/pull/698
- Sparsemax by @AndreSlavescu in https://github.com/linkedin/Liger-Kernel/pull/687
- fix: remove docstring imports in transformer patches by @NanoCode012 in https://github.com/linkedin/Liger-Kernel/pull/712
- Increase tests timeout to 45 mins by @vaibhavjindal in https://github.com/linkedin/Liger-Kernel/pull/718
- fix modal tests by @shivam15s in https://github.com/linkedin/Liger-Kernel/pull/719
- Visualizer Update by @AndreSlavescu in https://github.com/linkedin/Liger-Kernel/pull/717
- Sparsemax Documentation by @AndreSlavescu in https://github.com/linkedin/Liger-Kernel/pull/716
- element-wise-DyT faster than the origin LigerDyT by @mdy666 in https://github.com/linkedin/Liger-Kernel/pull/673
- GRPO Loss kernel fully write by triton, reduce 46G memory by @mdy666 in https://github.com/linkedin/Liger-Kernel/pull/672
- Make FLCE compatible with FSDP and PEFT by @astefanutti in https://github.com/linkedin/Liger-Kernel/pull/674
- Fix incorrect module patching when using LoRA with modules_to_save by @BenasdTW in https://github.com/linkedin/Liger-Kernel/pull/632
- [XPU] Changed how XPU discovery works during
setup.py
by @Egor-Krivov in https://github.com/linkedin/Liger-Kernel/pull/720 - Fix to publish docs on pushes to main branch by @shimizust in https://github.com/linkedin/Liger-Kernel/pull/722
- Release 0.5.10 by @shimizust in https://github.com/linkedin/Liger-Kernel/pull/725
New Contributors
- @KareemMusleh made their first contribution in https://github.com/linkedin/Liger-Kernel/pull/702
- @cyr0930 made their first contribution in https://github.com/linkedin/Liger-Kernel/pull/693
- @NanoCode012 made their first contribution in https://github.com/linkedin/Liger-Kernel/pull/712
- @mdy666 made their first contribution in https://github.com/linkedin/Liger-Kernel/pull/673
- @astefanutti made their first contribution in https://github.com/linkedin/Liger-Kernel/pull/674
- @Egor-Krivov made their first contribution in https://github.com/linkedin/Liger-Kernel/pull/720
Full Changelog: https://github.com/linkedin/Liger-Kernel/compare/v0.5.9...v0.5.10