A high-performance ML model serving framework, offers dynamic batching
Powering Amazon custom machine learning chips
Fast inference engine for Transformer models
Open-Source AI Camera. Empower any camera/CCTV
A unified framework for scalable computing
Database system for building simpler and faster AI-powered application
Replace OpenAI GPT with another LLM in your app
Unified Model Serving Framework
A Pythonic framework to simplify AI service building
The unofficial python package that returns response of Google Bard
Lightweight Python library for adding real-time multi-object tracking
State-of-the-art diffusion models for image and audio generation
Trainable, memory-efficient, and GPU-friendly PyTorch reproduction
GPU environment management and cluster orchestration
LLM training code for MosaicML foundation models
High quality, fast, modular reference implementation of SSD in PyTorch
PyTorch library of curated Transformer models and their components
Open platform for training, serving, and evaluating language models
Serve machine learning models within a Docker container
A library for accelerating Transformer models on NVIDIA GPUs
Implementation of "Tree of Thoughts
Library for serving Transformers models on Amazon SageMaker
Guide to deploying deep-learning inference networks
Deploy a ML inference service on a budget in 10 lines of code