Performance meets Productivity
CV-CUDA™ is an open-source, GPU accelerated library
C++ and Python support for the CUDA Quantum programming model
Accelerated libraries for quantum-classical computing built on CUDA-Q
cuda-oxide is an experimental Rust-to-CUDA compiler
Large-Scale Agentic RL for High-Performance CUDA Kernel Generation
CUDA programming in Julia
The CUDA target for Numba
Thin, unified, C++-flavored wrappers for the CUDA APIs
How to optimize some algorithm in cuda
A NumPy-compatible array library accelerated by CUDA
Lightning fast C++/CUDA neural network framework
CUDA Core Compute Libraries
Build an automated pipeline that converts CUDA APIs into Numba
Machine Learning Containers for NVIDIA Jetson and JetPack-L4T
The best AI Aimbot for Fortnite, Valorant, CS2, R6, COD, Apex, & more
ONNX-TensorRT: TensorRT backend for ONNX
A Python framework for accelerated simulation, data generation
RandomX, KawPow, CryptoNight, AstroBWT and GhostRider unified miner
CUDA Templates for Linear Algebra Subroutines
Solve puzzles. Learn CUDA
Solving the Satoshi Puzzle
Clean and efficient FP8 GEMM kernels with fine-grained scaling
Diffusion model(SD,Flux,Wan,Qwen Image,Z-Image,...) inference
Extensible, Efficient Quantum Algorithm Design for Humans