GLM-4.5: Open-source LLM for intelligent agents by Z.ai
LLM training in simple, raw C/CUDA
Diversity-driven optimization and large-model reasoning ability
Text-space optimizer that trains reusable natural-language skills
ChatGLM3 series: Open Bilingual Chat LLMs | Open Source Bilingual Chat
A lightweight vLLM implementation built from scratch
TONL (Token-Optimized Notation Language)
Run PyTorch LLMs locally on servers, desktop and mobile
Inference Llama 2 in one file of pure C
Distributed LLM and StableDiffusion inference
Llama 2 Everywhere (L2E)
Efficient MoE reasoning model for coding and math workloads