Browse free open source Python LLM Inference Tools and projects below. Use the toggles on the left to filter open source Python LLM Inference Tools by OS, license, language, programming language, and project status.
Create HTML profiling reports from pandas DataFrame objects
Run 100B+ language models at home, BitTorrent-style
Phi-3.5 for Mac: Locally-run Vision and Language Models
PyTorch extensions for fast R&D prototyping and Kaggle farming
Implementation of "Tree of Thoughts
High quality, fast, modular reference implementation of SSD in PyTorch
Library for serving Transformers models on Amazon SageMaker
Serve machine learning models within a Docker container
Toolkit for allowing inference and serving with MXNet in SageMaker
An MLOps framework to package, deploy, monitor and manage models
Sequence-to-sequence framework, focused on Neural Machine Translation
Libraries for applying sparsification recipes to neural networks
A library to communicate with ChatGPT, Claude, Copilot, Gemini
A toolkit to optimize ML models for deployment for Keras & TensorFlow
Probabilistic reasoning and statistical analysis in TensorFlow
Large Language Model Text Generation Inference
Pytorch domain library for recommendation systems
A library for accelerating Transformer models on NVIDIA GPUs
Multi-Modal Neural Networks for Semantic Search, based on Mid-Fusion
Open-source tool designed to enhance the efficiency of workloads
Run any Llama 2 locally with gradio UI on GPU or CPU from anywhere
Tensor search for humans
A computer vision framework to create and deploy apps in minutes
Multilingual Automatic Speech Recognition with word-level timestamps