Trainable models and NN optimization tools
Visual Instruction Tuning: Large Language-and-Vision Assistant
Trainable, memory-efficient, and GPU-friendly PyTorch reproduction
A high-performance ML model serving framework, offers dynamic batching
AI interface for tinkerers (Ollama, Haystack RAG, Python)
Open platform for training, serving, and evaluating language models
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT method
Run any Llama 2 locally with gradio UI on GPU or CPU from anywhere