Trainable models and NN optimization tools
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT method
Trainable, memory-efficient, and GPU-friendly PyTorch reproduction
AI interface for tinkerers (Ollama, Haystack RAG, Python)
Run any Llama 2 locally with gradio UI on GPU or CPU from anywhere
Visual Instruction Tuning: Large Language-and-Vision Assistant
A high-performance ML model serving framework, offers dynamic batching
Open platform for training, serving, and evaluating language models