Official inference library for Mistral models
Port of OpenAI's Whisper model in C/C++
A library to communicate with ChatGPT, Claude, Copilot, Gemini
A Pythonic framework to simplify AI service building
LLM training code for MosaicML foundation models
A RWKV management and startup tool, full automation, only 8MB
C#/.NET binding of llama.cpp, including LLaMa/GPT model inference
Bring the notion of Model-as-a-Service to life
Protect and discover secrets using Gitleaks
Private Open AI on Kubernetes
A general-purpose probabilistic programming system
GPU environment management and cluster orchestration
Simplifies the local serving of AI models from any source
Library for serving Transformers models on Amazon SageMaker
Open platform for training, serving, and evaluating language models
Unified Model Serving Framework
Fast inference engine for Transformer models
State-of-the-art diffusion models for image and audio generation
A unified framework for scalable computing
Powering Amazon custom machine learning chips
Open-Source AI Camera. Empower any camera/CCTV
PyTorch library of curated Transformer models and their components
Lightweight Python library for adding real-time multi-object tracking
A high-performance ML model serving framework, offers dynamic batching
Deep Learning API and Server in C++14 support for Caffe, PyTorch