A course of learning LLM inference serving on Apple Silicon
Schema-Guided Reasoning (SGR) has agentic system design
Here comes a selection of technology stacks and tool repositories
LLM Frontend for Power Users
LLM training in simple, raw C/CUDA
TT-NN operator library, and TT-Metalium low level kernel programming
Port of Facebook's LLaMA model in C/C++
Run Local LLMs on Any Device. Open-source
Official code repo for the O'Reilly Book
Drag & drop UI to build your customized LLM flow
Distribute and run LLMs with a single file
Fast and efficient unstructured data extraction
Fully automatic censorship removal for language models
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
An elegent pytorch implement of transformers
Your Second Brain supercharged by Generative AI
The PHP Agentic Framework to build production-ready AI driven apps
Scalable data pre processing and curation toolkit for LLMs
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
Emscripten: An LLVM-to-WebAssembly Compiler
C#/.NET binding of llama.cpp, including LLaMa/GPT model inference
A lightweight vLLM implementation built from scratch
950 line, minimal, extensible LLM inference engine built from scratch
Fast Multimodal LLM on Mobile Devices
A simple, easy-to-hack GraphRAG implementation