C++ library for high performance inference on NVIDIA GPUs
Alibaba's high-performance LLM inference engine for diverse apps
C++ Discord API Bot Library - D++ is Lightweight and scalable
Pika is a Redis-Compatible database
FlashMLA: Efficient Multi-head Latent Attention Kernels
C++-based high-performance parallel environment execution engine
A modern replacement for Redis and Memcached
A lightweight header-only library for using Keras (TensorFlow) models
An open source SQL database designed to process time series data
A fast and sensitive gapped read aligner
Official inference framework for 1-bit LLMs
A lightweight, lightning-fast, in-process vector database
A hybrid thread / fiber task scheduler written in C++ 11
The repository contains Google's robots.txt parser
OpenMLDB is an open-source machine learning database
A scalable inference server for models optimized with OpenVINO
Library for reading and writing large multi-dimensional arrays
FAIR Sequence Modeling Toolkit 2
Visual SLAM/odometry package based on NVIDIA-accelerated cuVSLAM
A GPU-accelerated library containing highly optimized building blocks
QVAC Fabric: cross-platform LLM inference and fine-tuning
Mooncake is the serving platform for Kimi
XLS: Accelerated HW Synthesis
HTTP/WebSocket server C++14 library
Industrial-grade RPC framework used throughout Baidu