Probabilistic reasoning and statistical analysis in TensorFlow
A unified framework for scalable computing
Powering Amazon custom machine learning chips
Deep learning optimization library: makes distributed training easy
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
Open platform for training, serving, and evaluating language models
Libraries for applying sparsification recipes to neural networks
An easy-to-use LLMs quantization package with user-friendly apis
A library to communicate with ChatGPT, Claude, Copilot, Gemini
Optimizing inference proxy for LLMs
Neural Network Compression Framework for enhanced OpenVINO
Openai style api for open large language models
Sparsity-aware deep learning inference runtime for CPUs
Large Language Model Text Generation Inference
Images to inference with no labeling
Efficient few-shot learning with Sentence Transformers
Pytorch domain library for recommendation systems
Visual Instruction Tuning: Large Language-and-Vision Assistant
20+ high-performance LLMs with recipes to pretrain, finetune at scale
Adversarial Robustness Toolbox (ART) - Python Library for ML security
GPU environment management and cluster orchestration
Phi-3.5 for Mac: Locally-run Vision and Language Models
A Unified Library for Parameter-Efficient Learning
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT method
Uplift modeling and causal inference with machine learning algorithms