The universal tool suite for vector database management
Implementation of TurboQuant (ICLR 2026)
AIMET is a library that provides advanced quantization and compression
A distributed system for embedding-based vector retrieval
A machine learning package built for humans
From-scratch PyTorch implementation of Google's TurboQuant
Accessible large language models via k-bit quantization for PyTorch
Libraries for applying sparsification recipes to neural networks
Plain python implementations of basic machine learning algorithms
C++ library for high performance inference on NVIDIA GPUs
Minimal and clean examples of machine learning algorithms
Weaviate is a cloud-native, modular, real-time vector search engine
A gallery that showcases on-device ML/GenAI use cases
An implementation of a deep learning recommendation model (DLRM)
Neural Network Compression Framework for enhanced OpenVINO
Toolkit for making machine learning and data analysis applications
Open-source large language model family from Tencent Hunyuan
A @ClickHouse fork that supports high-performance vector search
Open-Source AI Camera. Empower any camera/CCTV
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs
A unified library of SOTA model optimization techniques
Tensor library for machine learning
Fast inference engine for Transformer models
Bolt is a deep learning library with high performance
Machine learning on FPGAs using HLS