Performance meets Productivity
cuda-oxide is an experimental Rust-to-CUDA compiler
The CUDA target for Numba
Thin, unified, C++-flavored wrappers for the CUDA APIs
A NumPy-compatible array library accelerated by CUDA
Lightning fast C++/CUDA neural network framework
CUDA Core Compute Libraries
Build an automated pipeline that converts CUDA APIs into Numba
A Python framework for accelerated simulation, data generation
OpenCV wrapper for .NET
Geometric deep learning extension library for PyTorch
Jittor is a high-performance deep learning framework
Fast Differentiable Tensor Library in JavaScript & TypeScript with Bun
A language for fast, portable data-parallel computation
The official SuiteSparse library: a suite of sparse matrix algorithms
A fast compiler cache
NVIDIA GPU Operator creates/configures/manages GPUs atop Kubernetes
Multi-platform high-performance compute language extension for Rust
Unified Model Serving Framework
ArrayFire, a general purpose GPU library
Library for efficient similarity search and clustering dense vectors
An open source library for GPU-accelerated robot learning
Ready-to-run Docker images containing Jupyter applications
2D and 3D Face alignment library build using pytorch
A set of Docker images for training and serving models in TensorFlow