The interactive file manager requires Javascript. Please enable it or use sftp or scp.
You may still browse the files here.

Name	Modified	Size	InfoDownloads / Week
Parent folder
Patch release 4.55.1 source code.tar.gz	2025-08-13	18.9 MB	0
Patch release 4.55.1 source code.zip	2025-08-13	24.0 MB	0
README.md	2025-08-13	1.2 kB	0
Totals: 3 Items		43.0 MB	0

Patch release 4.55.1:

Mostly focused around stabalizing the Mxfp4 for GPTOSS model!

Bug Fixes & Improvements

Idefics2, Idefics3, SmolVLM – Fix tensor device issue (#39975) by @qgallouedec
Merge conflicts – Fix merge conflicts from previous changes by @vasqu
MXFP4 / CPU device_map – Default to dequantize when CPU is in device_map (#39993) by @MekkCyber
GPT Big Code – Fix attention scaling (#40041) by @vasqu
Windows compatibility – Resolve Triton version check compatibility (#39986) by @Tsumugii24 @MekkCyber
Gemma3n model – Add missing None default values for get_placeholder_mask (#39991, [#40024]) by @Znerual
Fuyu model – Fix broken image inference (#39915) by @Isotr0py
PerceptionLM – Fix missing video inputs (#39971) by @shuminghu
Idefics – Fix device mismatch (#39981) by @zucchini-nlp
Triton kernels – Remove triton_kernels dependency in favor of included kernels (#39926) by @SunMarc
GPT-OSS MXFP4 – Enable on older hardware (sm75+) (#39940) by @matthewdouglas @SunMarc
MXFP4 quantizer – Allow CPU inference with dequantize option (#39953) by @returnL

Source: README.md, updated 2025-08-13