TT-NN operator library, and TT-Metalium low level kernel programming
C++ library for high performance inference on NVIDIA GPUs
FlashMLA: Efficient Multi-head Latent Attention Kernels
oneAPI Deep Neural Network Library (oneDNN)
Toolkit for making machine learning and data analysis applications
Runtime extension of Proximus enabling Deployment on AMD Ryzen™ AI
A C++ standalone library for machine learning
Deep learning inference framework optimized for mobile platforms
Computer vision and image processing library for Qt.