Hugging Face Transformer Files

CPU/GPU inference server for Hugging Face transformer models

This is an exact mirror of the Hugging Face Transformer project, hosted at https://github.com/ELS-RD/transformer-deploy. SourceForge is not affiliated with Hugging Face Transformer. For more information, see the SourceForge Open Source Mirror Directory.

The interactive file manager requires Javascript. Please enable it or use sftp or scp.
You may still browse the files here.

Name	Modified	Size	InfoDownloads / Week
Parent folder
add GPU quantization support.tar.gz	2021-12-08	3.4 MB	0
add GPU quantization support.zip	2021-12-08	3.4 MB	0
README.md	2021-12-08	957 Bytes	0
Totals: 3 Items		6.9 MB	0

support int-8 GPU quantization
add a tuto to perform quantization end to end
add QDQRoberta model
switch to ONNX opset 13
refactoring in the TensorRT engine creation
fix bugs
add auth token (for private HF repo)

What's Changed

Update triton by @pommedeterresautee in https://github.com/ELS-RD/transformer-deploy/pull/11
fix README.md by @pommedeterresautee in https://github.com/ELS-RD/transformer-deploy/pull/13
Fix install errors by @sam-writer in https://github.com/ELS-RD/transformer-deploy/pull/20
Add auth token by @sam-writer in https://github.com/ELS-RD/transformer-deploy/pull/19
Support GPU INT-8 quantization by @pommedeterresautee in https://github.com/ELS-RD/transformer-deploy/pull/15

New Contributors

@sam-writer made their first contribution in https://github.com/ELS-RD/transformer-deploy/pull/20

Full Changelog: https://github.com/ELS-RD/transformer-deploy/compare/v0.1.1...v0.2.0

Source: README.md, updated 2021-12-08

Other Useful Business Software

Our Free Plans just got better! | Auth0 Icon

Our Free Plans just got better! | Auth0

With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now

MongoDB Atlas runs apps anywhere Icon

MongoDB Atlas runs apps anywhere

Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free

Recommended Projects

AdaNet
Fast and flexible AutoML with learning guarantees
Clonezilla
A partition and disk imaging/cloning program
KeePass
A lightweight and easy-to-use password manager
DeSmuME: Nintendo DS emulator
DeSmuME is a Nintendo DS emulator
7-Zip
A free file archiver for extremely high compression