GLM-4 is a family of open models from ZhipuAI that spans base, chat, and reasoning variants at both 32B and 9B scales, with long-context support and practical local-deployment options. The GLM-4-32B-0414 models are trained on ~15T high-quality data (including substantial synthetic reasoning data), then post-trained with preference alignment, rejection sampling, and reinforcement learning to improve instruction following, coding, function calling, and agent-style behaviors. The GLM-Z1-32B-0414 line adds deeper mathematical, coding, and logical reasoning via extended reinforcement learning and pairwise ranking feedback, while GLM-Z1-Rumination-32B-0414 introduces a “rumination” mode that performs longer, tool-using deep research for complex, open-ended tasks. A lightweight GLM-Z1-9B-0414 brings many of these techniques to a smaller model, targeting strong reasoning under tight resource budgets.

Features

  • Model lineup: GLM-4-32B (Base/Chat), GLM-Z1-32B (Reasoning), GLM-Z1-Rumination-32B, and GLM-(Z1)-9B variants
  • Long context: 32K native; guidance for YaRN rope scaling to reach up to 128K (and specific Z1 settings)
  • Training pipeline: 15T pretraining plus preference alignment, rejection sampling, and RL to boost chat, code, and tool use
  • Reasoning focus: Z1 models strengthened for math/code/logic; Rumination model supports deep, tool-assisted research workflows
  • Implementations available/merged for vLLM, transformers, and llama.cpp; OpenAI-style API examples and prompt templates included
  • Fine-tuning support with example scripts and requirements; guidance for resource-constrained inference and quantization

Project Samples

Project Activity

See All Activity >

License

Apache License V2.0

Follow GLM-4

GLM-4 Web Site

Other Useful Business Software
MongoDB Atlas runs apps anywhere Icon
MongoDB Atlas runs apps anywhere

Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
Start Free
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of GLM-4!

Additional Project Details

Operating Systems

Linux

Programming Language

Python

Related Categories

Python Large Language Models (LLM), Python AI Models

Registered

2025-10-04