C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
Bolt is a deep learning library with high performance
Making large AI models cheaper, faster and more accessible
OpenVINO™ Toolkit repository
Fast and customizable framework for automatic ML model creation
Neural Network Compression Framework for enhanced OpenVINO
An easy-to-use LLMs quantization package with user-friendly apis
Transformers4Rec is a flexible and efficient library