FlashMLA: Efficient Multi-head Latent Attention Kernels
C++ library for high performance inference on NVIDIA GPUs
The Compute Library is a set of computer vision and machine learning
Toolkit for making machine learning and data analysis applications
Clean and efficient FP8 GEMM kernels with fine-grained scaling
oneAPI Deep Neural Network Library (oneDNN)
Runtime extension of Proximus enabling Deployment on AMD Ryzen™ AI
A C++ standalone library for machine learning
Deep learning inference framework optimized for mobile platforms
Fast and user-friendly runtime for transformer inference
Machine learning with Gaussian kernels.
Calculates similarity between neighborhoods of two vertices in a graph
Azul OS version dev(Linux) IA
A Tailored Small Linux for Beagleboard-xm
Computer vision and image processing library for Qt.