Name | Modified | Size | Downloads / Week |
---|---|---|---|
Parent folder | |||
Add GPT-2 acceleration support.tar.gz | 2022-02-08 | 3.5 MB | |
Add GPT-2 acceleration support.zip | 2022-02-08 | 3.5 MB | |
README.md | 2022-02-08 | 283 Bytes | |
Totals: 3 Items | 6.9 MB | 0 |
- add support for decoder based model (GPT-2) on both ONNX Runtime and TensorRT
- refactor triton configuration generation (simplification)
- add GPT-2 model documentation (notebook)
- fix CPU quantization benchmark (was not using the quant model)
- fix sentence transformers bug