Port of OpenAI's Whisper model in C/C++
Port of Facebook's LLaMA model in C/C++
Run Local LLMs on Any Device. Open-source
User-friendly AI Interface
ONNX Runtime: cross-platform, high performance ML inferencing
High-performance neural network inference framework for mobile
OpenVINO™ Toolkit repository
Everything you need to build state-of-the-art foundation models
The free, Open Source alternative to OpenAI, Claude and others
Protect and discover secrets using Gitleaks
C++ library for high performance inference on NVIDIA GPUs
MNN is a blazing fast, lightweight deep learning framework
Open standard for machine learning interoperability
Training and deploying machine learning models on Amazon SageMaker
Uncover insights, surface problems, monitor, and fine tune your LLM
C#/.NET binding of llama.cpp, including LLaMa/GPT model inference
A RWKV management and startup tool, full automation, only 8MB
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
Fast inference engine for Transformer models
The official Python client for the Huggingface Hub
Gaussian processes in TensorFlow
A library for accelerating Transformer models on NVIDIA GPUs
Open-Source AI Camera. Empower any camera/CCTV
Operating LLMs in production
LLM.swift is a simple and readable library