A high-throughput and memory-efficient inference and serving engine
Everything you need to build state-of-the-art foundation models
Create HTML profiling reports from pandas DataFrame objects
Replace OpenAI GPT with another LLM in your app
Deep learning optimization library: makes distributed training easy
A unified framework for scalable computing
Open platform for training, serving, and evaluating language models
Unified Model Serving Framework
PyTorch extensions for fast R&D prototyping and Kaggle farming
Sequence-to-sequence framework, focused on Neural Machine Translation