Inference Llama 2 in one file of pure C
Port of Facebook's LLaMA model in C/C++
Run Local LLMs on Any Device. Open-source
Integrate cutting-edge LLM technology quickly and easily into your app
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
Ongoing research training transformer models at scale
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
LLM training in simple, raw C/CUDA
Open-source large language model family from Tencent Hunyuan
Python bindings for the Transformer models implemented in C/C++
Implements a reference architecture for creating information systems