High-performance neural network inference framework for mobile
ONNX Runtime: cross-platform, high performance ML inferencing
A unified framework for scalable computing
An Open-Source Programming Framework for Agentic AI
Lightweight Python library for adding real-time multi-object tracking
Multi-Modal Neural Networks for Semantic Search, based on Mid-Fusion
Large Language Model Text Generation Inference
Easiest and laziest way for building multi-agent LLMs applications
Pytorch domain library for recommendation systems
Private Open AI on Kubernetes
Bring the notion of Model-as-a-Service to life
Phi-3.5 for Mac: Locally-run Vision and Language Models
OpenVINO™ Toolkit repository
Multilingual Automatic Speech Recognition with word-level timestamps
Build Production-ready Agentic Workflow with Natural Language
MII makes low-latency and high-throughput inference possible
A GPU-accelerated library containing highly optimized building blocks
Deep Learning API and Server in C++14 support for Caffe, PyTorch
Open platform for training, serving, and evaluating language models
High quality, fast, modular reference implementation of SSD in PyTorch
A computer vision framework to create and deploy apps in minutes
Self-contained Machine Learning and Natural Language Processing lib
Serve machine learning models within a Docker container
llama.go is like llama.cpp in pure Golang
The deep learning toolkit for speech-to-text