MII makes low-latency and high-throughput inference possible
High-performance neural network inference framework for mobile
Unofficial (Golang) Go bindings for the Hugging Face Inference API
ONNX Runtime: cross-platform, high performance ML inferencing
Powering Amazon custom machine learning chips
AIMET is a library that provides advanced quantization and compression
High quality, fast, modular reference implementation of SSD in PyTorch
Replace OpenAI GPT with another LLM in your app
LLM training code for MosaicML foundation models
Library for serving Transformers models on Amazon SageMaker
PArallel Distributed Deep LEarning: Machine Learning Framework
Integrate, train and manage any AI models and APIs with your database
Bayesian inference with probabilistic programming
OpenMLDB is an open-source machine learning database
Database system for building simpler and faster AI-powered application
C#/.NET binding of llama.cpp, including LLaMa/GPT model inference
Multi-Modal Neural Networks for Semantic Search, based on Mid-Fusion
LLMs and Machine Learning done easily
An Open-Source Programming Framework for Agentic AI
On-device Speech Recognition for Apple Silicon
Standardized Serverless ML Inference Platform on Kubernetes
LLMs as Copilots for Theorem Proving in Lean
Deep Learning API and Server in C++14 support for Caffe, PyTorch
Build Production-ready Agentic Workflow with Natural Language
On-device AI across mobile, embedded and edge for PyTorch