ONNX Runtime: cross-platform, high performance ML inferencing
High-performance neural network inference framework for mobile
Run Local LLMs on Any Device. Open-source
User-friendly AI Interface
The free, Open Source alternative to OpenAI, Claude and others
OpenMLDB is an open-source machine learning database
Unified Model Serving Framework
PArallel Distributed Deep LEarning: Machine Learning Framework
Run local LLMs like llama, deepseek, kokoro etc. inside your browser
Run serverless GPU workloads with fast cold starts on bare-metal
Deep Learning API and Server in C++14 support for Caffe, PyTorch
Standardized Serverless ML Inference Platform on Kubernetes
MNN is a blazing fast, lightweight deep learning framework
An MLOps framework to package, deploy, monitor and manage models
AI interface for tinkerers (Ollama, Haystack RAG, Python)
The official Python client for the Huggingface Hub
20+ high-performance LLMs with recipes to pretrain, finetune at scale
Private Open AI on Kubernetes
Operating LLMs in production
Superduper: Integrate AI models and machine learning workflows
LLM training code for MosaicML foundation models
Easy-to-use deep learning framework with 3 key features
Open platform for training, serving, and evaluating language models
OpenMMLab Model Deployment Framework
llama.go is like llama.cpp in pure Golang