Port of OpenAI's Whisper model in C/C++
Port of Facebook's LLaMA model in C/C++
Run Local LLMs on Any Device. Open-source
User-friendly AI Interface
ONNX Runtime: cross-platform, high performance ML inferencing
High-performance neural network inference framework for mobile
OpenVINO™ Toolkit repository
Everything you need to build state-of-the-art foundation models
The free, Open Source alternative to OpenAI, Claude and others
Protect and discover secrets using Gitleaks
Training and deploying machine learning models on Amazon SageMaker
Uncover insights, surface problems, monitor, and fine tune your LLM
C#/.NET binding of llama.cpp, including LLaMa/GPT model inference
Bring the notion of Model-as-a-Service to life
A RWKV management and startup tool, full automation, only 8MB
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
Fast inference engine for Transformer models
The official Python client for the Huggingface Hub
Gaussian processes in TensorFlow
A library for accelerating Transformer models on NVIDIA GPUs
Open-Source AI Camera. Empower any camera/CCTV
Operating LLMs in production
LLM.swift is a simple and readable library
Run serverless GPU workloads with fast cold starts on bare-metal
Phi-3.5 for Mac: Locally-run Vision and Language Models