Name | Modified | Size | Downloads / Week |
---|---|---|---|
Parent folder | |||
README.md | 2025-06-18 | 1.7 kB | |
v3.3.3 source code.tar.gz | 2025-06-18 | 3.2 MB | |
v3.3.3 source code.zip | 2025-06-18 | 3.8 MB | |
Totals: 3 Items | 7.0 MB | 0 |
Neuron backend update.
What's Changed
- Remove useless packages by @yuanwu2017 in https://github.com/huggingface/text-generation-inference/pull/3253
- Bump neuron SDK version by @dacorvo in https://github.com/huggingface/text-generation-inference/pull/3260
- Perf opt by @sywangyi in https://github.com/huggingface/text-generation-inference/pull/3256
- [gaudi] Vlm rebase and issue fix in benchmark test by @sywangyi in https://github.com/huggingface/text-generation-inference/pull/3263
- Move the _update_cos_sin_cache into get_cos_sin by @yuanwu2017 in https://github.com/huggingface/text-generation-inference/pull/3254
- [Gaudi] Remove optimum-habana by @yuanwu2017 in https://github.com/huggingface/text-generation-inference/pull/3261
- [gaudi] HuggingFaceM4/idefics2-8b issue fix by @sywangyi in https://github.com/huggingface/text-generation-inference/pull/3264
- [Gaudi] Enable Qwen3_moe model by @yuanwu2017 in https://github.com/huggingface/text-generation-inference/pull/3244
- [Gaudi]Fix the integration-test issues by @yuanwu2017 in https://github.com/huggingface/text-generation-inference/pull/3265
- [Gaudi] use pad_token_id to pad input id by @sywangyi in https://github.com/huggingface/text-generation-inference/pull/3268
- chore: prepare release 3.3.3 by @dacorvo in https://github.com/huggingface/text-generation-inference/pull/3269
- [gaudi] Refine logging for Gaudi warmup by @regisss in https://github.com/huggingface/text-generation-inference/pull/3222
- doc: fix README by @dacorvo in https://github.com/huggingface/text-generation-inference/pull/3271
Full Changelog: https://github.com/huggingface/text-generation-inference/compare/v3.3.2...v3.3.3