Open platform for training, serving, and evaluating language models
AIMET is a library that provides advanced quantization and compression
An easy-to-use LLMs quantization package with user-friendly apis
Visual Instruction Tuning: Large Language-and-Vision Assistant
PyTorch library of curated Transformer models and their components
A library for accelerating Transformer models on NVIDIA GPUs
Run any Llama 2 locally with gradio UI on GPU or CPU from anywhere