Qualcomm Innovation Center (QuIC) is at the forefront of enabling low-power inference at the edge through its pioneering model-efficiency research. QuIC has a mission to help migrate the ecosystem toward fixed-point inference. With this goal, QuIC presents the AI Model Efficiency Toolkit (AIMET) - a library that provides advanced quantization and compression techniques for trained neural network models. AIMET enables neural networks to run more efficiently on fixed-point AI hardware accelerators. Quantized inference is significantly faster than floating point inference. For example, models that we’ve run on the Qualcomm® Hexagon™ DSP rather than on the Qualcomm® Kryo™ CPU have resulted in a 5x to 15x speedup. Plus, an 8-bit model also has a 4x smaller memory footprint relative to a 32-bit model. However, often when quantizing a machine learning model (e.g., from 32-bit floating point to an 8-bit fixed point value), the model accuracy is sacrificed.

Features

  • Equalize weight tensors to reduce amplitude variation across channels
  • Tensor-decomposition technique to split a large layer into two smaller ones
  • Corrects shift in layer outputs introduced due to quantization
  • Removes redundant input channels from a layer and reconstructs layer weights
  • Use quantization sim to train the model further to improve accuracy
  • Automatically selects how much to compress each layer in the model

Project Samples

Project Activity

See All Activity >

License

MIT License

Follow AIMET

AIMET Web Site

Other Useful Business Software
Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
Try Free
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of AIMET!

Additional Project Details

Programming Language

Python

Related Categories

Python Machine Learning Software, Python Neural Network Libraries, Python LLM Inference Tool

Registered

2022-08-19