State-of-the-art Parameter-Efficient Fine-Tuning
Python Package for ML-Based Heterogeneous Treatment Effects Estimation
A library for accelerating Transformer models on NVIDIA GPUs
Library for OCR-related tasks powered by Deep Learning
Probabilistic reasoning and statistical analysis in TensorFlow
Gaussian processes in TensorFlow
Training and deploying machine learning models on Amazon SageMaker
Easy-to-use Speech Toolkit including Self-Supervised Learning model
Deep learning optimization library: makes distributed training easy
A unified framework for scalable computing
Powering Amazon custom machine learning chips
Open-source tool designed to enhance the efficiency of workloads
LLM training code for MosaicML foundation models
Optimizing inference proxy for LLMs
Neural Network Compression Framework for enhanced OpenVINO
Build your chatbot within minutes on your favorite device
Open platform for training, serving, and evaluating language models
MII makes low-latency and high-throughput inference possible
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
GPU environment management and cluster orchestration
Phi-3.5 for Mac: Locally-run Vision and Language Models
Libraries for applying sparsification recipes to neural networks
An easy-to-use LLMs quantization package with user-friendly apis
A library to communicate with ChatGPT, Claude, Copilot, Gemini
Sparsity-aware deep learning inference runtime for CPUs