XNNPACK

XNNPACK is a highly optimized, low-level neural network inference library developed by Google for accelerating deep learning workloads across a variety of hardware architectures, including ARM, x86, WebAssembly, and RISC-V. Rather than serving as a standalone ML framework, XNNPACK provides high-performance computational primitives—such as convolutions, pooling, activation functions, and arithmetic operations—that are integrated into higher-level frameworks like TensorFlow Lite, PyTorch Mobile, ONNX Runtime, TensorFlow.js, and MediaPipe. The library is written in C/C++ and designed for maximum portability, efficiency, and performance, leveraging platform-specific instruction sets (e.g., NEON, AVX, SIMD) for optimized execution. It supports NHWC tensor layouts and allows flexible striding along the channel dimension to efficiently handle channel-split and concatenation operations without additional cost.

Features

Cross-platform neural network inference backend optimized for ARM, x86, WebAssembly, and RISC-V
High-performance implementations for 2D convolutions, pooling, activation, and quantization operators
Supports both FP32 and INT8 inference with per-channel quantization
Efficient NHWC tensor layout with flexible channel stride
Integrates seamlessly with frameworks like TensorFlow Lite, TensorFlow.js, PyTorch, ONNX Runtime, and MediaPipe
Multi-threaded and vectorized operator implementations

Project Activity

See All Activity >

License

Creative Commons Attribution License

Follow XNNPACK

XNNPACK Web Site

Other Useful Business Software

Gen AI apps are built with MongoDB Atlas

The database for AI-powered applications.

MongoDB Atlas is the developer-friendly database used to build, scale, and run gen AI and LLM-powered apps—without needing a separate vector database. Atlas offers built-in vector search, global availability across 115+ regions, and flexible document modeling. Start building AI apps faster, all in one place.

Start Free

Rate This Project

User Reviews

Be the first to post a review of XNNPACK!

Additional Project Details

Operating Systems

Android, Apple iPhone, Linux, Mac, Windows

Programming Language

Assembly, C, C++, Unix Shell

Related Categories

Unix Shell Neural Network Libraries, C++ Neural Network Libraries, Assembly Neural Network Libraries, C Neural Network Libraries

Registered

1 day ago

Similar Business Software

Zebra by Mipsology

Zebra by Mipsology is the ideal Deep Learning compute engine for neural network inference. Zebra seamlessly replaces or complements CPUs/GPUs, allowing any neural network to compute faster, with lower power consumption, at a lower cost. Zebra deploys swiftly, seamlessly, and painlessly without...

See Software
Microsoft Cognitive Toolkit

The Microsoft Cognitive Toolkit (CNTK) is an open-source toolkit for commercial-grade distributed deep learning. It describes neural networks as a series of computational steps via a directed graph. CNTK allows the user to easily realize and combine popular model types such as feed-forward DNNs,...

See Software
ThirdAI

ThirdAI (pronunciation: /THərd ī/ Third eye) is a cutting-edge Artificial intelligence startup carving scalable and sustainable AI. ThirdAI accelerator builds hash-based processing algorithms for training and inference with neural networks. The technology is a result of 10 years of innovation...

See Software
TFLearn

TFlearn is a modular and transparent deep learning library built on top of Tensorflow. It was designed to provide a higher-level API to TensorFlow in order to facilitate and speed up experimentations while remaining fully transparent and compatible with it. Easy-to-use and understand high-level...

See Software
DeePhi Quantization Tool

This is a model quantization tool for convolution neural networks(CNN). This tool could quantize both weights/biases and activations from 32-bit floating-point (FP32) format to 8-bit integer(INT8) format or any other bit depths. With this tool, you can boost the inference performance and...

See Software
Deci

Easily build, optimize, and deploy fast & accurate models with Deci’s deep learning development platform powered by Neural Architecture Search. Instantly achieve accuracy & runtime performance that outperform SoTA models for any use case and inference hardware. Reach production faster with...

See Software

Report inappropriate content

XNNPACK

High-efficiency floating-point neural network inference operators

Get an email when there's a new version of XNNPACK

Features

Project Activity

Categories

License

Follow XNNPACK

User Reviews

Additional Project Details

Operating Systems

Programming Language

Related Categories

Registered