Run AI models locally on your machine with node.js bindings for llama
Python bindings for llama.cpp
Fully private LLM chatbot that runs entirely with a browser
The official Meta Llama 3 GitHub site
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Utilities intended for use with Llama models
Set of tools to assess and improve LLM security
Port of Facebook's LLaMA model in C/C++
Unifying 3D Mesh Generation with Language Models
Llama Chinese community, real-time aggregation
VS Code extension for LLM-assisted code/text completion
Vim plugin for LLM-assisted code/text completion
Easy-to-use LLM fine-tuning framework (LLaMA-2, BLOOM, Falcon
Chinese Llama-3 LLMs) developed from Meta Llama 3
Instruction-tuning LLM with Chinese Medical Knowledge
Inference Llama 2 in one file of pure C
Inference code for CodeLlama models
Run models like Kimi-K2.5, GLM-5, DeepSeek, gpt-oss, Gemma, Qwen etc.
C#/.NET binding of llama.cpp, including LLaMa/GPT model inference
LLM training code for MosaicML foundation models
An elegent pytorch implement of transformers
Distribute and run LLMs with a single file
Open-source, high-performance AI model with advanced reasoning
Chat with private and local large language models
Alibaba's high-performance LLM inference engine for diverse apps