Port of Facebook's LLaMA model in C/C++
ONNX Runtime: cross-platform, high performance ML inferencing
High-performance neural network inference framework for mobile
C++ library for high performance inference on NVIDIA GPUs
Protect and discover secrets using Gitleaks
Everything you need to build state-of-the-art foundation models
Open-Source AI Camera. Empower any camera/CCTV
PArallel Distributed Deep LEarning: Machine Learning Framework
The free, Open Source alternative to OpenAI, Claude and others
Set of comprehensive computer vision & machine intelligence libraries
Neural Network Compression Framework for enhanced OpenVINO
A set of Docker images for training and serving models in TensorFlow
Official inference library for Mistral models
A unified framework for scalable computing
A general-purpose probabilistic programming system
Library for serving Transformers models on Amazon SageMaker
LLM.swift is a simple and readable library
Superduper: Integrate AI models and machine learning workflows
Unified Model Serving Framework
AIMET is a library that provides advanced quantization and compression
Powering Amazon custom machine learning chips
Standardized Serverless ML Inference Platform on Kubernetes
A GPU-accelerated library containing highly optimized building blocks
Run serverless GPU workloads with fast cold starts on bare-metal
An MLOps framework to package, deploy, monitor and manage models