Deep Learning API and Server in C++14 support for Caffe, PyTorch
LLMs and Machine Learning done easily
Build Production-ready Agentic Workflow with Natural Language
Phi-3.5 for Mac: Locally-run Vision and Language Models
State-of-the-art diffusion models for image and audio generation
Openai style api for open large language models
Images to inference with no labeling
A Pythonic framework to simplify AI service building
On-device AI across mobile, embedded and edge for PyTorch
Data manipulation and transformation for audio signal processing
Everything you need to build state-of-the-art foundation models
C#/.NET binding of llama.cpp, including LLaMa/GPT model inference
Libraries for applying sparsification recipes to neural networks
An easy-to-use LLMs quantization package with user-friendly apis
Single-cell analysis in Python
Private Open AI on Kubernetes
Adversarial Robustness Toolbox (ART) - Python Library for ML security
Operating LLMs in production
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
On-device Speech Recognition for Apple Silicon
OpenMMLab Model Deployment Framework
Neural Network Compression Framework for enhanced OpenVINO
Sparsity-aware deep learning inference runtime for CPUs
Large Language Model Text Generation Inference
Training and deploying machine learning models on Amazon SageMaker