Performance meets Productivity
cuda-oxide is an experimental Rust-to-CUDA compiler
The CUDA target for Numba
Thin, unified, C++-flavored wrappers for the CUDA APIs
A NumPy-compatible array library accelerated by CUDA
Lightning fast C++/CUDA neural network framework
CUDA Core Compute Libraries
Build an automated pipeline that converts CUDA APIs into Numba
A Python framework for accelerated simulation, data generation
OpenCV wrapper for .NET
C++ library for high performance inference on NVIDIA GPUs
Development repository for the Triton language and compiler
Geometric deep learning extension library for PyTorch
Jittor is a high-performance deep learning framework
Fast Differentiable Tensor Library in JavaScript & TypeScript with Bun
A language for fast, portable data-parallel computation
GPU DataFrame Library
The official SuiteSparse library: a suite of sparse matrix algorithms
A fast compiler cache
NVIDIA GPU Operator creates/configures/manages GPUs atop Kubernetes
Multi-platform high-performance compute language extension for Rust
Unified Model Serving Framework
ArrayFire, a general purpose GPU library
Library for efficient similarity search and clustering dense vectors
An open source library for GPU-accelerated robot learning