FlashInfer - Browse /v0.6.6 at SourceForge.net

The interactive file manager requires Javascript. Please enable it or use sftp or scp.
You may still browse the files here.

Name	Modified	Size	InfoDownloads / Week
Parent folder
flashinfer_jit_cache-0.6.6+cu130-cp39-abi3-manylinux_2_28_aarch64.whl	2026-03-11	1.8 GB	0
flashinfer_jit_cache-0.6.6+cu130-cp39-abi3-manylinux_2_28_x86_64.whl	2026-03-11	1.8 GB	0
flashinfer_jit_cache-0.6.6+cu129-cp39-abi3-manylinux_2_28_aarch64.whl	2026-03-11	1.6 GB	0
flashinfer_jit_cache-0.6.6+cu129-cp39-abi3-manylinux_2_28_x86_64.whl	2026-03-11	1.6 GB	0
flashinfer_jit_cache-0.6.6+cu128-cp39-abi3-manylinux_2_28_aarch64.whl	2026-03-11	1.2 GB	0
flashinfer_jit_cache-0.6.6+cu128-cp39-abi3-manylinux_2_28_x86_64.whl	2026-03-11	1.2 GB	0
flashinfer_cubin-0.6.6-py3-none-any.whl	2026-03-11	267.7 MB	0
flashinfer_python-0.6.6-py3-none-any.whl	2026-03-11	7.8 MB	0
flashinfer_python-0.6.6.tar.gz	2026-03-11	5.3 MB	0
README.md	2026-03-10	2.7 kB	0
Release v0.6.6 source code.tar.gz	2026-03-10	2.9 MB	0
Release v0.6.6 source code.zip	2026-03-10	3.8 MB	0
Totals: 12 Items		9.5 GB	0

What's Changed

fix: move ArtifactPath/CheckSumHash imports inside gen_moe_utils_modu… by @dierksen in https://github.com/flashinfer-ai/flashinfer/pull/2681
Enable sm120f compilation by @kahyunnam in https://github.com/flashinfer-ai/flashinfer/pull/2650
Ensure -gencode flags are in deterministic order (for ccache) by @benbarsdell in https://github.com/flashinfer-ai/flashinfer/pull/2674
int16 Block-Scaled State and Stochastic Rounding for SSU (mamba) by @ishovkun in https://github.com/flashinfer-ai/flashinfer/pull/2645
feat: add pool+indices support to gated_delta_rule_decode_pretranspose (bf16 path) by @kaixih in https://github.com/flashinfer-ai/flashinfer/pull/2619
chore: replace bare print() with logging across the package by @esmeetu in https://github.com/flashinfer-ai/flashinfer/pull/2648
fix: reduce smem allocation for tinygemm2 kernel in SM120 by @jimmyzho in https://github.com/flashinfer-ai/flashinfer/pull/2670
[chore] bench_moe_deepseek.py allows adjusting expert distribution by @rosenrodt in https://github.com/flashinfer-ai/flashinfer/pull/2678
feat: add support for more MLA head dimensions by @hypdeb in https://github.com/flashinfer-ai/flashinfer/pull/2677
[fp8_blockwise]Fix int32 overflow in TRTLLM fused MoE activation kernel by @charlotte12l in https://github.com/flashinfer-ai/flashinfer/pull/2642
Give knam codeowner override for Qwen3.5 (gdn) related directories by @kahyunnam in https://github.com/flashinfer-ai/flashinfer/pull/2680
HOTFIX: Skip mamba Stochastic Rounding tests on sm_120 by @ishovkun in https://github.com/flashinfer-ai/flashinfer/pull/2699
chore: Update CODEOWNERS by @flashinfer-bot in https://github.com/flashinfer-ai/flashinfer/pull/2712
feat: support mxfp4 & mxfp8 entrypoint for blackwell cutedsl dense gemm by @b8zhong in https://github.com/flashinfer-ai/flashinfer/pull/2660
Undo fix to AutoTuner find_nearest_profile by @danisereb in https://github.com/flashinfer-ai/flashinfer/pull/2697
Experiment Add @kahyunnam as co-owner for several files by @aleozlx in https://github.com/flashinfer-ai/flashinfer/pull/2713
chore: Update CODEOWNERS by @flashinfer-bot in https://github.com/flashinfer-ai/flashinfer/pull/2719
Implement cutlass_fused_moe mxfp8 by @zianglih in https://github.com/flashinfer-ai/flashinfer/pull/2581

New Contributors

@benbarsdell made their first contribution in https://github.com/flashinfer-ai/flashinfer/pull/2674
@charlotte12l made their first contribution in https://github.com/flashinfer-ai/flashinfer/pull/2642
@zianglih made their first contribution in https://github.com/flashinfer-ai/flashinfer/pull/2581

Full Changelog: https://github.com/flashinfer-ai/flashinfer/compare/v0.6.5...v0.6.6

Source: README.md, updated 2026-03-10

FlashInfer Files

FlashInfer: Kernel Library for LLM Serving

What's Changed

New Contributors

FlashInfer Files

FlashInfer: Kernel Library for LLM Serving

Get an email when there's a new version of FlashInfer

What's Changed

New Contributors