Trainable models and NN optimization tools
Trainable, memory-efficient, and GPU-friendly PyTorch reproduction
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT method
Visual Instruction Tuning: Large Language-and-Vision Assistant
A high-performance ML model serving framework, offers dynamic batching
Open platform for training, serving, and evaluating language models
Run any Llama 2 locally with gradio UI on GPU or CPU from anywhere