Port of Facebook's LLaMA model in C/C++
User-friendly AI Interface
The free, Open Source alternative to OpenAI, Claude and others
A set of Docker images for training and serving models in TensorFlow
Official inference library for Mistral models
Deep Learning API and Server in C++14 support for Caffe, PyTorch
CPU/GPU inference server for Hugging Face transformer models
Deploy a ML inference service on a budget in 10 lines of code