A course of learning LLM inference serving on Apple Silicon
Schema-Guided Reasoning (SGR) has agentic system design
Here comes a selection of technology stacks and tool repositories
LLM Frontend for Power Users
LLM training in simple, raw C/CUDA
TT-NN operator library, and TT-Metalium low level kernel programming
Port of Facebook's LLaMA model in C/C++
Official code repo for the O'Reilly Book
Drag & drop UI to build your customized LLM flow
Run Local LLMs on Any Device. Open-source
Distribute and run LLMs with a single file
Fast and efficient unstructured data extraction
Fully automatic censorship removal for language models
An elegent pytorch implement of transformers
Your Second Brain supercharged by Generative AI
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
LLM inference in C/C++
The PHP Agentic Framework to build production-ready AI driven apps
Scalable data pre processing and curation toolkit for LLMs
C#/.NET binding of llama.cpp, including LLaMa/GPT model inference
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
A lightweight vLLM implementation built from scratch
Emscripten: An LLVM-to-WebAssembly Compiler
950 line, minimal, extensible LLM inference engine built from scratch
Fast Multimodal LLM on Mobile Devices