Production-grade client-side tracing, profiling, and analysis
FlashMLA: Efficient Multi-head Latent Attention Kernels
C++ library for high performance inference on NVIDIA GPUs
Coroutine-based concurrency library for PHP
oneAPI Deep Neural Network Library (oneDNN)
A C++ standalone library for machine learning
Event-driven network library for multi-threaded Linux server in C++11
OSE RTOS host simulator (windows and Linux/POSIX)
Linear algebra and solver library using CUDA, OpenCL, and OpenMP