GLM-4

GLM-4 is a family of open models from ZhipuAI that spans base, chat, and reasoning variants at both 32B and 9B scales, with long-context support and practical local-deployment options. The GLM-4-32B-0414 models are trained on ~15T high-quality data (including substantial synthetic reasoning data), then post-trained with preference alignment, rejection sampling, and reinforcement learning to improve instruction following, coding, function calling, and agent-style behaviors. The GLM-Z1-32B-0414 line adds deeper mathematical, coding, and logical reasoning via extended reinforcement learning and pairwise ranking feedback, while GLM-Z1-Rumination-32B-0414 introduces a “rumination” mode that performs longer, tool-using deep research for complex, open-ended tasks. A lightweight GLM-Z1-9B-0414 brings many of these techniques to a smaller model, targeting strong reasoning under tight resource budgets.

Features

Model lineup: GLM-4-32B (Base/Chat), GLM-Z1-32B (Reasoning), GLM-Z1-Rumination-32B, and GLM-(Z1)-9B variants
Long context: 32K native; guidance for YaRN rope scaling to reach up to 128K (and specific Z1 settings)
Training pipeline: 15T pretraining plus preference alignment, rejection sampling, and RL to boost chat, code, and tool use
Reasoning focus: Z1 models strengthened for math/code/logic; Rumination model supports deep, tool-assisted research workflows
Implementations available/merged for vLLM, transformers, and llama.cpp; OpenAI-style API examples and prompt templates included
Fine-tuning support with example scripts and requirements; guidance for resource-constrained inference and quantization

Project Samples

Project Activity

See All Activity >

License

Apache License V2.0

Follow GLM-4

GLM-4 Web Site

Other Useful Business Software

Gen AI apps are built with MongoDB Atlas

Build gen AI apps with an all-in-one modern database: MongoDB Atlas

MongoDB Atlas provides built-in vector search and a flexible document model so developers can build, scale, and run gen AI apps without stitching together multiple databases. From LLM integration to semantic search, Atlas simplifies your AI architecture—and it’s free to get started.

Start Free

Rate This Project

User Reviews

Be the first to post a review of GLM-4!

Additional Project Details

Operating Systems

Linux

Programming Language

Python

Related Categories

Python Large Language Models (LLM), Python AI Models

Registered

2 days ago

Similar Business Software

Tülu 3

Tülu 3 is an advanced instruction-following language model developed by the Allen Institute for AI (Ai2), designed to enhance capabilities in areas such as knowledge, reasoning, mathematics, coding, and safety. Built upon the Llama 3 Base, Tülu 3 employs a comprehensive four-stage post-training...

See Software
Reka Flash 3

Reka Flash 3 is a 21-billion-parameter multimodal AI model developed by Reka AI, designed to excel in general chat, coding, instruction following, and function calling. It processes and reasons with text, images, video, and audio inputs, offering a compact, general-purpose solution for various...

See Software
GLM-4.5

GLM‑4.5 is Z.ai’s latest flagship model in the GLM family, engineered with 355 billion total parameters (32 billion active) and a companion GLM‑4.5‑Air variant (106 billion total, 12 billion active) to unify advanced reasoning, coding, and agentic capabilities in one architecture. It operates in...

See Software
GLM-4.6

GLM-4.6 advances upon its predecessor with stronger reasoning, coding, and agentic capabilities: it demonstrates clear improvements in inferential performance, supports tool use during inference, and more effectively integrates into agent frameworks. In benchmark tests spanning reasoning,...

See Software
Sky-T1

Sky-T1-32B-Preview is an open source reasoning model developed by the NovaSky team at UC Berkeley's Sky Computing Lab. It matches the performance of proprietary models like o1-preview on reasoning and coding benchmarks, yet was trained for under $450, showcasing the feasibility of...

See Software
FLUX.1 Krea

FLUX.1 Krea is an open source, guidance-distilled 12 billion-parameter diffusion transformer released by Krea in collaboration with Black Forest Labs, engineered to deliver superior aesthetic control and photorealism while eschewing the generic “AI look.” Fully compatible with the FLUX.1-dev...

See Software

Report inappropriate content

GLM-4

GLM-4 series: Open Multilingual Multimodal Chat LMs

Get an email when there's a new version of GLM-4

Features

Project Samples

Project Activity

Categories

License

Follow GLM-4

User Reviews

Additional Project Details

Operating Systems

Programming Language

Related Categories

Registered