Browse free open source Python LLM Inference Tools and projects below. Use the toggles on the left to filter open source Python LLM Inference Tools by OS, license, language, programming language, and project status.
PyTorch extensions for fast R&D prototyping and Kaggle farming
Simplifies the local serving of AI models from any source
A unified framework for scalable computing
Implementation of "Tree of Thoughts
A lightweight vision library for performing large object detection
High quality, fast, modular reference implementation of SSD in PyTorch
Library for serving Transformers models on Amazon SageMaker
Serve machine learning models within a Docker container
Toolkit for allowing inference and serving with MXNet in SageMaker
An MLOps framework to package, deploy, monitor and manage models
Efficient few-shot learning with Sentence Transformers
Sequence-to-sequence framework, focused on Neural Machine Translation
Libraries for applying sparsification recipes to neural networks
Integrate, train and manage any AI models and APIs with your database
A library to communicate with ChatGPT, Claude, Copilot, Gemini
A toolkit to optimize ML models for deployment for Keras & TensorFlow
Probabilistic reasoning and statistical analysis in TensorFlow
Large Language Model Text Generation Inference
Pytorch domain library for recommendation systems
Multi-Modal Neural Networks for Semantic Search, based on Mid-Fusion
Open-source tool designed to enhance the efficiency of workloads
Run any Llama 2 locally with gradio UI on GPU or CPU from anywhere
Tensor search for humans
A computer vision framework to create and deploy apps in minutes
Framework that is dedicated to making neural data processing