Browse free open source Python LLM Inference Tools and projects below. Use the toggles on the left to filter open source Python LLM Inference Tools by OS, license, language, programming language, and project status.
A Unified Library for Parameter-Efficient Learning
Adversarial Robustness Toolbox (ART) - Python Library for ML security
An easy-to-use LLMs quantization package with user-friendly apis
The unofficial python package that returns response of Google Bard
Unified Model Serving Framework
Deploy a ML inference service on a budget in 10 lines of code
Sparsity-aware deep learning inference runtime for CPUs
DoWhy is a Python library for causal inference
Library for OCR-related tasks powered by Deep Learning
Python Package for ML-Based Heterogeneous Treatment Effects Estimation
Database system for building simpler and faster AI-powered application
Open platform for training, serving, and evaluating language models
Gaussian processes in TensorFlow
Low-latency REST API for serving text-embeddings
Build your chatbot within minutes on your favorite device
LLMFlows - Simple, Explicit and Transparent LLM Apps
Easiest and laziest way for building multi-agent LLMs applications
Toolbox of models, callbacks, and datasets for AI/ML researchers
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT method
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
OpenMMLab Model Deployment Framework
OpenMMLab Video Perception Toolbox
Framework for Accelerating LLM Generation with Multiple Decoding Heads
Neural Network Compression Framework for enhanced OpenVINO
Lightweight Python library for adding real-time multi-object tracking