Port of Facebook's LLaMA model in C/C++
Connect home devices into a powerful cluster to accelerate LLM
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT method
C#/.NET binding of llama.cpp, including LLaMa/GPT model inference
LLM training code for MosaicML foundation models
Run Local LLMs on Any Device. Open-source
Self-hosted, community-driven, local OpenAI compatible API
A Pythonic framework to simplify AI service building
Run any Llama 2 locally with gradio UI on GPU or CPU from anywhere
Operating LLMs in production
Easiest and laziest way for building multi-agent LLMs applications
Run local LLMs like llama, deepseek, kokoro etc. inside your browser
PyTorch library of curated Transformer models and their components
llama.go is like llama.cpp in pure Golang