Open-source tool designed to enhance the efficiency of workloads
A Unified Library for Parameter-Efficient Learning
Superduper: Integrate AI models and machine learning workflows
An MLOps framework to package, deploy, monitor and manage models
Deep learning optimization library: makes distributed training easy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs
Neural Network Compression Framework for enhanced OpenVINO
Build your chatbot within minutes on your favorite device
Official inference library for Mistral models
Create HTML profiling reports from pandas DataFrame objects
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
Probabilistic reasoning and statistical analysis in TensorFlow
Phi-3.5 for Mac: Locally-run Vision and Language Models
Libraries for applying sparsification recipes to neural networks
Gaussian processes in TensorFlow
Single-cell analysis in Python
MII makes low-latency and high-throughput inference possible
Sparsity-aware deep learning inference runtime for CPUs
Large Language Model Text Generation Inference
Easiest and laziest way for building multi-agent LLMs applications
Easy-to-use Speech Toolkit including Self-Supervised Learning model
20+ high-performance LLMs with recipes to pretrain, finetune at scale
Data manipulation and transformation for audio signal processing
A Pythonic framework to simplify AI service building
Adversarial Robustness Toolbox (ART) - Python Library for ML security