Gaussian processes in TensorFlow
Single-cell analysis in Python
Library for serving Transformers models on Amazon SageMaker
Open-source tool designed to enhance the efficiency of workloads
Bring the notion of Model-as-a-Service to life
MII makes low-latency and high-throughput inference possible
A Pythonic framework to simplify AI service building
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT method
Tensor search for humans
Multilingual Automatic Speech Recognition with word-level timestamps
Unified Model Serving Framework
Pytorch domain library for recommendation systems
PyTorch extensions for fast R&D prototyping and Kaggle farming
A lightweight vision library for performing large object detection
Create HTML profiling reports from pandas DataFrame objects
LMDeploy is a toolkit for compressing, deploying, and serving LLMs
Neural Network Compression Framework for enhanced OpenVINO
Standardized Serverless ML Inference Platform on Kubernetes
Easy-to-use Speech Toolkit including Self-Supervised Learning model
Phi-3.5 for Mac: Locally-run Vision and Language Models
Superduper: Integrate AI models and machine learning workflows
A high-performance ML model serving framework, offers dynamic batching
Images to inference with no labeling
A toolkit to optimize ML models for deployment for Keras & TensorFlow
OpenMMLab Model Deployment Framework