C-based Application Programming Interface (API)
C++ library for high performance inference on NVIDIA GPUs
ONNX Runtime: cross-platform, high performance ML inferencing
Visual SLAM/odometry package based on NVIDIA-accelerated cuVSLAM
FlashMLA: Efficient Multi-head Latent Attention Kernels
Thin, unified, C++-flavored wrappers for the CUDA APIs
GPU DataFrame Library
oneAPI Deep Neural Network Library (oneDNN)
A GPU-accelerated library containing highly optimized building blocks
Lightning fast C++/CUDA neural network framework
The C++ parallel algorithms library
YOLO ROS: Real-Time Object Detection for ROS
Polyhedral compiler for expressing fast and portable data algorithms
A fast open framework for deep learning
CUDA-enabled machine learning library for recurrent neural networks