Port of Facebook's LLaMA model in C/C++
Python bindings for llama.cpp
Qwen3 is the large language model series developed by Qwen team
GLM-4 series: Open Multilingual Multimodal Chat LMs
Chinese LLaMA & Alpaca large language model + local CPU/GPU training
JetBrains’ 4B parameter code model for completions