Text Generation Inference Files

Large Language Model Text Generation Inference

This is an exact mirror of the Text Generation Inference project, hosted at https://github.com/huggingface/text-generation-inference. SourceForge is not affiliated with Text Generation Inference. For more information, see the SourceForge Open Source Mirror Directory.

The interactive file manager requires Javascript. Please enable it or use sftp or scp.
You may still browse the files here.

Name	Modified	Size	InfoDownloads / Week
Parent folder
README.md	2025-05-30	1.0 kB	0
v3.3.2 source code.tar.gz	2025-05-30	3.2 MB	0
v3.3.2 source code.zip	2025-05-30	3.8 MB	1
Totals: 3 Items		7.0 MB	1

Gaudi improvements.

What's Changed

upgrade to new vllm extension ops(fix issue in exponential bucketing) by @sywangyi in https://github.com/huggingface/text-generation-inference/pull/3239
Nix: switch to hf-nix by @danieldk in https://github.com/huggingface/text-generation-inference/pull/3240
Add Qwen3 by @yuanwu2017 in https://github.com/huggingface/text-generation-inference/pull/3229
fp8 compressed_tensors w8a8 support by @sywangyi in https://github.com/huggingface/text-generation-inference/pull/3242
[Gaudi] Fix the OOM issue of Llama-4-Scout-17B-16E-Instruct by @yuanwu2017 in https://github.com/huggingface/text-generation-inference/pull/3245
Fix the Llama-4-Maverick-17B-128E crash issue by @yuanwu2017 in https://github.com/huggingface/text-generation-inference/pull/3246
Prepare for 3.3.2 by @danieldk in https://github.com/huggingface/text-generation-inference/pull/3249

Full Changelog: https://github.com/huggingface/text-generation-inference/compare/v3.3.1...v3.3.2

Source: README.md, updated 2025-05-30

Other Useful Business Software

Our Free Plans just got better! | Auth0 Icon

Our Free Plans just got better! | Auth0

With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now

Simplify IT and security with a single endpoint management platform Icon

Simplify IT and security with a single endpoint management platform

Automate the hardest parts of IT

NinjaOne automates the hardest parts of IT, delivering visibility, security, and control over all endpoints for more than 20,000 customers. The NinjaOne automated endpoint management platform is proven to increase productivity, reduce security risk, and lower costs for IT teams and managed service providers. The company seamlessly integrates with a wide range of IT and security technologies. NinjaOne is obsessed with customer success and provides free and unlimited onboarding, training, and support.

Learn More

Recommended Projects

Artificial Intelligence Controller
AICI: Prompts as (Wasm) Programs
DeSmuME: Nintendo DS emulator
DeSmuME is a Nintendo DS emulator
7-Zip
A free file archiver for extremely high compression
KeePass
A lightweight and easy-to-use password manager
Apache OpenOffice
The free and Open Source productivity suite