Download Latest Version Add GPT-2 acceleration support.zip (3.5 MB)
Email in envelope

Get an email when there's a new version of Hugging Face Transformer

Home / v0.2.0
Name Modified Size InfoDownloads / Week
Parent folder
add GPU quantization support.tar.gz 2021-12-08 3.4 MB
add GPU quantization support.zip 2021-12-08 3.4 MB
README.md 2021-12-08 957 Bytes
Totals: 3 Items   6.9 MB 0
  • support int-8 GPU quantization
  • add a tuto to perform quantization end to end
  • add QDQRoberta model
  • switch to ONNX opset 13
  • refactoring in the TensorRT engine creation
  • fix bugs
  • add auth token (for private HF repo)

What's Changed

New Contributors

Full Changelog: https://github.com/ELS-RD/transformer-deploy/compare/v0.1.1...v0.2.0

Source: README.md, updated 2021-12-08