Inference Llama 2 in one file of pure C
A high-performance ML model serving framework, offers dynamic batching
Easy-to-use LLM fine-tuning framework (LLaMA-2, BLOOM, Falcon
Tools like web browser, computer access and code runner for LLMs
Ongoing research training transformer models at scale
Codes for "Chameleon: Plug-and-Play Compositional Reasoning