CUDA Core Compute Libraries
A high-performance, zero-overhead, extensible Python compiler
Fast Differentiable Tensor Library in JavaScript & TypeScript with Bun
User interface for recording and managing ETW traces
FlashMLA: Efficient Multi-head Latent Attention Kernels
C++ library for high performance inference on NVIDIA GPUs
A GPU-accelerated library containing highly optimized building blocks
Samples that demonstrate how to build graphics intensive applications
Kotlin Multiplatform bindings to Skia
Math library using HLSL syntax with multiplatform SIMD support
Cross-platform, customizable ML solutions for live and streaming media
ArrayFire, a general purpose GPU library
High-performance neural network inference framework for mobile
MNN is a blazing fast, lightweight deep learning framework
A language for fast, portable data-parallel computation
Real-time image and video processing library similar to GPUImage
oneAPI Deep Neural Network Library (oneDNN)
Khronos Vulkan, OpenGL, and OpenGL ES Conformance Tests
Official inference framework for 1-bit LLMs
Visual SLAM/odometry package based on NVIDIA-accelerated cuVSLAM
A multimedia framework based on Qt and FFmpeg
High performance C++ library for building Wayland compositors
Portable suite of libraries and tools for building client applications
Structure of Arrays of multiple types