A NumPy-compatible array library accelerated by CUDA
CuPy is highly compatible with NumPy, serving as a drop-in replacement in most cases.
CuPy is very easy to install through pip or through precompiled binary packages called wheels for recommended environments. It also makes writing a custom CUDA kernel very easy, requiring only a small code snippet of C++.
...It allows to easily perform tensor decomposition, tensor learning and tensor algebra. Its backend system allows to seamlessly perform computation with NumPy, PyTorch, JAX, TensorFlow, CuPy or Paddle, and run methods at scale on CPU or GPU.