Inference Llama 2 in one file of pure C
Tools like web browser, computer access and code runner for LLMs
Code for the paper "Evaluating Large Language Models Trained on Code"
Get up and running with Llama 2 and other large language models
Port of Facebook's LLaMA model in C/C++
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
A guidance language for controlling large language models
CodeGeeX2: A More Powerful Multilingual Code Generation Model
Run Local LLMs on Any Device. Open-source
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
Inference code for CodeLlama models
Integrate cutting-edge LLM technology quickly and easily into your app
Emscripten: An LLVM-to-WebAssembly Compiler
Ongoing research training transformer models at scale
Qwen3-Coder is the code version of Qwen3
Distribute and run LLMs with a single file
Self-hosted, community-driven, local OpenAI compatible API
C#/.NET binding of llama.cpp, including LLaMa/GPT model inference
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
LLM training in simple, raw C/CUDA
Open-source large language model family from Tencent Hunyuan
Vector database plugin for Postgres, written in Rust
Database system for building simpler and faster AI-powered application
Qwen2.5-Coder is the code version of Qwen2.5, the large language model
Python bindings for the Transformer models implemented in C/C++