Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT method
Multilingual Automatic Speech Recognition with word-level timestamps
Unified Model Serving Framework
Pytorch domain library for recommendation systems
PyTorch extensions for fast R&D prototyping and Kaggle farming
A lightweight vision library for performing large object detection
Create HTML profiling reports from pandas DataFrame objects
Bring the notion of Model-as-a-Service to life
LMDeploy is a toolkit for compressing, deploying, and serving LLMs
Phi-3.5 for Mac: Locally-run Vision and Language Models
Tensor search for humans
Superduper: Integrate AI models and machine learning workflows
A high-performance ML model serving framework, offers dynamic batching
Standardized Serverless ML Inference Platform on Kubernetes
Images to inference with no labeling
OpenMMLab Model Deployment Framework
High quality, fast, modular reference implementation of SSD in PyTorch
Serve machine learning models within a Docker container
Framework that is dedicated to making neural data processing
Framework for Accelerating LLM Generation with Multiple Decoding Heads
Toolbox of models, callbacks, and datasets for AI/ML researchers
Lightweight anchor-free object detection model
Sequence-to-sequence framework, focused on Neural Machine Translation
Toolkit for allowing inference and serving with MXNet in SageMaker
CPU/GPU inference server for Hugging Face transformer models