Browse free open source Python LLM Inference Tools and projects below. Use the toggles on the left to filter open source Python LLM Inference Tools by OS, license, language, programming language, and project status.
Openai style api for open large language models
An easy-to-use LLMs quantization package with user-friendly apis
Images to inference with no labeling
Deploy a ML inference service on a budget in 10 lines of code
PyTorch library of curated Transformer models and their components
MII makes low-latency and high-throughput inference possible
DoWhy is a Python library for causal inference
Python Package for ML-Based Heterogeneous Treatment Effects Estimation
Gaussian processes in TensorFlow
GPU environment management and cluster orchestration
CPU/GPU inference server for Hugging Face transformer models
Low-latency REST API for serving text-embeddings
Build your chatbot within minutes on your favorite device
Standardized Serverless ML Inference Platform on Kubernetes
LLM training code for MosaicML foundation models
LLMFlows - Simple, Explicit and Transparent LLM Apps
A Pythonic framework to simplify AI service building
Toolbox of models, callbacks, and datasets for AI/ML researchers
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT method
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
OpenMMLab Video Perception Toolbox
Framework for Accelerating LLM Generation with Multiple Decoding Heads
Official inference library for Mistral models
OpenFieldAI is an AI based Open Field Test Rodent Tracker
Operating LLMs in production