Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT method
Trainable, memory-efficient, and GPU-friendly PyTorch reproduction
Trainable models and NN optimization tools
AI interface for tinkerers (Ollama, Haystack RAG, Python)
A high-performance ML model serving framework, offers dynamic batching
Visual Instruction Tuning: Large Language-and-Vision Assistant
Open platform for training, serving, and evaluating language models
Run any Llama 2 locally with gradio UI on GPU or CPU from anywhere