Running large language models on a single GPU
C++ library for high performance inference on NVIDIA GPUs
Fast and memory-efficient exact attention
OpenMLDB is an open-source machine learning database
A GPU-accelerated library containing highly optimized building blocks
Simple and distributed Machine Learning
Lightweight inference library for ONNX files, written in C++
Fast Forward Computer Vision (and other ML workloads!)
Feature selection and deep learning modeling for omic biomarker study
Efficient approximate nearest neighbor search algorithm collections
Ultra-fast matching engine written in Java based on LMAX Disruptor
An industrial deep learning framework for high-dimension sparse data
Three different software tools for phenotyping plant root images
The Janelia Automated Animal Behavior Annotator