Find the local LLM that actually runs and performs best
ChatGLM2-6B: An Open Bilingual Chat LLM
Agentic, Reasoning, and Coding (ARC) foundation models
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
A.S.E (AICGSecEval) is a repository-level AI-generated code security
Code for the paper "Evaluating Large Language Models Trained on Code"
LongBench v2 and LongBench (ACL 25'&24')
Leaderboard Comparing LLM Performance at Producing Hallucinations
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
LLM inference in C/C++
High-speed Large Language Model Serving for Local Deployment
ChatGLM3 series: Open Bilingual Chat LLMs | Open Source Bilingual Chat
Benchmark LLMs by fighting in Street Fighter 3
Implement CPU from scratch and play with large model deployments
Advanced language and coding AI model
A high-performance ML model serving framework, offers dynamic batching
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
Run AI models locally on your machine with node.js bindings for llama
157 models, 30 providers, one command to find what runs on hardware
The official repo of Qwen chat & pretrained large language model
Capable of understanding text, audio, vision, video
Open-source model for program synthesis
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Unleashing 10,000+ Word Generation from Long Context LLMs