Standardized Serverless ML Inference Platform on Kubernetes
AIMET is a library that provides advanced quantization and compression
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT method
Multilingual Automatic Speech Recognition with word-level timestamps
Pytorch domain library for recommendation systems
PyTorch extensions for fast R&D prototyping and Kaggle farming
A lightweight vision library for performing large object detection
Create HTML profiling reports from pandas DataFrame objects
Superduper: Integrate AI models and machine learning workflows
Bring the notion of Model-as-a-Service to life
LMDeploy is a toolkit for compressing, deploying, and serving LLMs
Phi-3.5 for Mac: Locally-run Vision and Language Models
Tensor search for humans
A high-performance ML model serving framework, offers dynamic batching
Easy-to-use Speech Toolkit including Self-Supervised Learning model
Images to inference with no labeling
OpenMMLab Model Deployment Framework
High quality, fast, modular reference implementation of SSD in PyTorch
Serve machine learning models within a Docker container
Framework that is dedicated to making neural data processing
Framework for Accelerating LLM Generation with Multiple Decoding Heads
Toolbox of models, callbacks, and datasets for AI/ML researchers
Lightweight anchor-free object detection model
Sequence-to-sequence framework, focused on Neural Machine Translation
Toolkit for allowing inference and serving with MXNet in SageMaker