Training and deploying machine learning models on Amazon SageMaker
Run Local LLMs on Any Device. Open-source
Port of Facebook's LLaMA model in C/C++
Single-cell analysis in Python
DoWhy is a Python library for causal inference
Python Package for ML-Based Heterogeneous Treatment Effects Estimation
A high-throughput and memory-efficient inference and serving engine
FlashInfer: Kernel Library for LLM Serving
Gaussian processes in TensorFlow
Everything you need to build state-of-the-art foundation models
Uplift modeling and causal inference with machine learning algorithms
A Pythonic framework to simplify AI service building
Optimizing inference proxy for LLMs
A unified framework for scalable computing
Pytorch domain library for recommendation systems
Large Language Model Text Generation Inference
Neural Network Compression Framework for enhanced OpenVINO
Integrate, train and manage any AI models and APIs with your database
Operating LLMs in production
Superduper: Integrate AI models and machine learning workflows
20+ high-performance LLMs with recipes to pretrain, finetune at scale
The official Python client for the Huggingface Hub
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
Trainable models and NN optimization tools
Official inference library for Mistral models