Download Latest Version Add GPT-2 acceleration support.zip (3.5 MB)
Email in envelope

Get an email when there's a new version of Hugging Face Transformer

Home / v0.4.0
Name Modified Size InfoDownloads / Week
Parent folder
Add GPT-2 acceleration support.tar.gz 2022-02-08 3.5 MB
Add GPT-2 acceleration support.zip 2022-02-08 3.5 MB
README.md 2022-02-08 283 Bytes
Totals: 3 Items   6.9 MB 0
  • add support for decoder based model (GPT-2) on both ONNX Runtime and TensorRT
  • refactor triton configuration generation (simplification)
  • add GPT-2 model documentation (notebook)
  • fix CPU quantization benchmark (was not using the quant model)
  • fix sentence transformers bug
Source: README.md, updated 2022-02-08