Browse free open source Python LLM Inference Tools and projects below. Use the toggles on the left to filter open source Python LLM Inference Tools by OS, license, language, programming language, and project status.
Run Local LLMs on Any Device. Open-source
Lightweight anchor-free object detection model
Everything you need to build state-of-the-art foundation models
Ready-to-use OCR with 80+ supported languages
Training and deploying machine learning models on Amazon SageMaker
Bring the notion of Model-as-a-Service to life
Uncover insights, surface problems, monitor, and fine tune your LLM
Library for OCR-related tasks powered by Deep Learning
A library for accelerating Transformer models on NVIDIA GPUs
Gaussian processes in TensorFlow
GPU environment management and cluster orchestration
A unified framework for scalable computing
The official Python client for the Huggingface Hub
Standardized Serverless ML Inference Platform on Kubernetes
OpenMMLab Model Deployment Framework
Operating LLMs in production
Phi-3.5 for Mac: Locally-run Vision and Language Models
Single-cell analysis in Python
A toolkit to optimize ML models for deployment for Keras & TensorFlow
A high-throughput and memory-efficient inference and serving engine
Openai style api for open large language models
A Unified Library for Parameter-Efficient Learning
Deep learning optimization library: makes distributed training easy
State-of-the-art diffusion models for image and audio generation
Python Package for ML-Based Heterogeneous Treatment Effects Estimation