Tensor search for humans
Multi-Modal Neural Networks for Semantic Search, based on Mid-Fusion
A library for accelerating Transformer models on NVIDIA GPUs
LMDeploy is a toolkit for compressing, deploying, and serving LLMs
Ready-to-use OCR with 80+ supported languages
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT method
Superduper: Integrate AI models and machine learning workflows
Pytorch domain library for recommendation systems
Unified Model Serving Framework
Standardized Serverless ML Inference Platform on Kubernetes
High quality, fast, modular reference implementation of SSD in PyTorch
OpenMMLab Model Deployment Framework
Framework that is dedicated to making neural data processing
Serve machine learning models within a Docker container
Framework for Accelerating LLM Generation with Multiple Decoding Heads
Lightweight anchor-free object detection model
CPU/GPU inference server for Hugging Face transformer models