Browse free open source LLM Inference tools and projects for Linux below. Use the toggles on the left to filter open source LLM Inference tools by OS, license, language, programming language, and project status.
Port of OpenAI's Whisper model in C/C++
User-friendly AI Interface
Port of Facebook's LLaMA model in C/C++
Run Local LLMs on Any Device. Open-source
High-performance neural network inference framework for mobile
ONNX Runtime: cross-platform, high performance ML inferencing
OpenVINO™ Toolkit repository
LLM.swift is a simple and readable library
C++ library for high performance inference on NVIDIA GPUs
Everything you need to build state-of-the-art foundation models
MNN is a blazing fast, lightweight deep learning framework
The free, Open Source alternative to OpenAI, Claude and others
Lightweight inference library for ONNX files, written in C++
Protect and discover secrets using Gitleaks
Fast inference engine for Transformer models
Open standard for machine learning interoperability
Uncover insights, surface problems, monitor, and fine tune your LLM
A RWKV management and startup tool, full automation, only 8MB
Training and deploying machine learning models on Amazon SageMaker
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
State-of-the-art diffusion models for image and audio generation
Gaussian processes in TensorFlow
GPU environment management and cluster orchestration
C#/.NET binding of llama.cpp, including LLaMa/GPT model inference
Operating LLMs in production