Port of OpenAI's Whisper model in C/C++
Port of Facebook's LLaMA model in C/C++
Run Local LLMs on Any Device. Open-source
User-friendly AI Interface
OpenVINO™ Toolkit repository
ONNX Runtime: cross-platform, high performance ML inferencing
High-performance neural network inference framework for mobile
The free, Open Source alternative to OpenAI, Claude and others
C++ library for high performance inference on NVIDIA GPUs
Protect and discover secrets using Gitleaks
MNN is a blazing fast, lightweight deep learning framework
Everything you need to build state-of-the-art foundation models
A high-throughput and memory-efficient inference and serving engine
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
Pure C++ implementation of several models for real-time chatting
Data manipulation and transformation for audio signal processing
Open-Source AI Camera. Empower any camera/CCTV
Open standard for machine learning interoperability
The official Python client for the Huggingface Hub
Standardized Serverless ML Inference Platform on Kubernetes
Fast inference engine for Transformer models
Run serverless GPU workloads with fast cold starts on bare-metal
Trainable, memory-efficient, and GPU-friendly PyTorch reproduction
Phi-3.5 for Mac: Locally-run Vision and Language Models
Training and deploying machine learning models on Amazon SageMaker