Browse free open source Python LLM Inference Tools and projects below. Use the toggles on the left to filter open source Python LLM Inference Tools by OS, license, language, programming language, and project status.
The unofficial python package that returns response of Google Bard
Sparsity-aware deep learning inference runtime for CPUs
Database system for building simpler and faster AI-powered application
Open platform for training, serving, and evaluating language models
CPU/GPU inference server for Hugging Face transformer models
Standardized Serverless ML Inference Platform on Kubernetes
OpenMMLab Video Perception Toolbox
Bring the notion of Model-as-a-Service to life
A high-performance ML model serving framework, offers dynamic batching
Lightweight Python library for adding real-time multi-object tracking
Trainable, memory-efficient, and GPU-friendly PyTorch reproduction
State-of-the-art Parameter-Efficient Fine-Tuning
High quality, fast, modular reference implementation of SSD in PyTorch
Training and deploying machine learning models on Amazon SageMaker
Replace OpenAI GPT with another LLM in your app
Tensor search for humans
Multilingual Automatic Speech Recognition with word-level timestamps
A graphical manager for ollama that can manage your LLMs
Training & Implementation of chatbots leveraging GPT-like architecture
Openai style api for open large language models
A Unified Library for Parameter-Efficient Learning
Adversarial Robustness Toolbox (ART) - Python Library for ML security
Images to inference with no labeling
Deploy a ML inference service on a budget in 10 lines of code
Gaussian processes in TensorFlow