CV-CUDA™ is an open-source, GPU accelerated library
Lightning fast C++/CUDA neural network framework
ONNX-TensorRT: TensorRT backend for ONNX
CUDA Templates for Linear Algebra Subroutines
Diffusion model(SD,Flux,Wan,Qwen Image,Z-Image,...) inference
C++ library for high performance inference on NVIDIA GPUs
Clean and efficient FP8 GEMM kernels with fine-grained scaling
Fast LLM speculative inference server for consumer hardware
Open Source Computer Vision Library
QVAC Fabric: cross-platform LLM inference and fine-tuning
PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT
RAPIDS Machine Learning Library
LLM inference in C/C++
ArrayFire, a general purpose GPU library
GPU accelerated decision optimization
Instant neural graphics primitives: lightning fast NeRF and more
Easy-to-use deep learning framework with 3 key features
fast C++ library for GPU linear algebra & scientific computing
Transformer related optimization, including BERT, GPT
A High Performance Library for Sequence Processing and Generation
Real-time collision detection and multi-physics simulation for VR
Implements a reference architecture for creating information systems
A C++ standalone library for machine learning
YOLO ROS: Real-Time Object Detection for ROS
Tool for feature selection using the JMI metric and multiple GPUs