Browse free open source LLM Inference tools and projects for Mac below. Use the toggles on the left to filter open source LLM Inference tools by OS, license, language, programming language, and project status.
Port of OpenAI's Whisper model in C/C++
Port of Facebook's LLaMA model in C/C++
User-friendly AI Interface
Run Local LLMs on Any Device. Open-source
ONNX Runtime: cross-platform, high performance ML inferencing
High-performance neural network inference framework for mobile
Data manipulation and transformation for audio signal processing
The free, Open Source alternative to OpenAI, Claude and others
OpenVINO™ Toolkit repository
LLM.swift is a simple and readable library
Everything you need to build state-of-the-art foundation models
Lightweight inference library for ONNX files, written in C++
Protect and discover secrets using Gitleaks
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
Fast inference engine for Transformer models
An Open-Source Programming Framework for Agentic AI
Training and deploying machine learning models on Amazon SageMaker
A set of Docker images for training and serving models in TensorFlow
Bring the notion of Model-as-a-Service to life
Uncover insights, surface problems, monitor, and fine tune your LLM
State-of-the-art diffusion models for image and audio generation
Gaussian processes in TensorFlow
GPU environment management and cluster orchestration
C#/.NET binding of llama.cpp, including LLaMa/GPT model inference
OpenMMLab Model Deployment Framework