Trainable models and NN optimization tools
AIMET is a library that provides advanced quantization and compression
Simplifies the local serving of AI models from any source
Standardized Serverless ML Inference Platform on Kubernetes
Official inference library for Mistral models
20+ high-performance LLMs with recipes to pretrain, finetune at scale
Phi-3.5 for Mac: Locally-run Vision and Language Models
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
Integrate, train and manage any AI models and APIs with your database
Large Language Model Text Generation Inference
A unified framework for scalable computing
Efficient few-shot learning with Sentence Transformers
Deep learning optimization library: makes distributed training easy
Uplift modeling and causal inference with machine learning algorithms
PyTorch library of curated Transformer models and their components
Unified Model Serving Framework
An MLOps framework to package, deploy, monitor and manage models
Create HTML profiling reports from pandas DataFrame objects
A set of Docker images for training and serving models in TensorFlow
Single-cell analysis in Python
Neural Network Compression Framework for enhanced OpenVINO
A Pythonic framework to simplify AI service building
DoWhy is a Python library for causal inference
Superduper: Integrate AI models and machine learning workflows
Low-latency REST API for serving text-embeddings