The universal tool suite for vector database management
A vector index built on TurboQuant, written in Rust with Python
A distributed system for embedding-based vector retrieval
Implementation of TurboQuant (ICLR 2026)
Accessible large language models via k-bit quantization for PyTorch
From-scratch PyTorch implementation of Google's TurboQuant
AIMET is a library that provides advanced quantization and compression
A machine learning package built for humans
C++ library for high performance inference on NVIDIA GPUs
Plain python implementations of basic machine learning algorithms
Libraries for applying sparsification recipes to neural networks
Minimal and clean examples of machine learning algorithms
Neural Network Compression Framework for enhanced OpenVINO
Weaviate is a cloud-native, modular, real-time vector search engine
Toolkit for making machine learning and data analysis applications
An implementation of a deep learning recommendation model (DLRM)
A kernel library written in tilelang
A gallery that showcases on-device ML/GenAI use cases
Bolt is a deep learning library with high performance
Tensor library for machine learning
A @ClickHouse fork that supports high-performance vector search
Open-Source AI Camera. Empower any camera/CCTV
Modern columnar data format for ML and LLMs implemented in Rust
Build AI-powered semantic search applications
Open-source large language model family from Tencent Hunyuan