Uncover insights, surface problems, monitor, and fine tune your LLM
Pytorch domain library for recommendation systems
A toolkit to optimize ML models for deployment for Keras & TensorFlow
Library for serving Transformers models on Amazon SageMaker
Phi-3.5 for Mac: Locally-run Vision and Language Models
Neural Network Compression Framework for enhanced OpenVINO
Multilingual Automatic Speech Recognition with word-level timestamps
High quality, fast, modular reference implementation of SSD in PyTorch
A high-performance ML model serving framework, offers dynamic batching
Framework that is dedicated to making neural data processing
Toolbox of models, callbacks, and datasets for AI/ML researchers
Lightweight anchor-free object detection model
Implementation of model parallel autoregressive transformers on GPUs
Sequence-to-sequence framework, focused on Neural Machine Translation
A computer vision framework to create and deploy apps in minutes
Guide to deploying deep-learning inference networks
Toolkit for allowing inference and serving with MXNet in SageMaker
CPU/GPU inference server for Hugging Face transformer models
Deploy a ML inference service on a budget in 10 lines of code