CodeGeeX2

CodeGeeX2 is the second-generation multilingual code generation model from ZhipuAI, built upon the ChatGLM2-6B architecture and trained on 600B code tokens. Compared to the first generation, it delivers a significant boost in programming ability across multiple languages, outperforming even larger models like StarCoder-15B in some benchmarks despite having only 6B parameters. The model excels at code generation, translation, summarization, debugging, and comment generation, and it supports over 100 programming languages. With improved inference efficiency, quantization options, and multi-query/flash attention, CodeGeeX2 achieves faster generation speeds and lightweight deployment, requiring as little as 6GB GPU memory at INT4 precision. Its backend powers the CodeGeeX IDE plugins for VS Code, JetBrains, and other editors, offering developers interactive AI assistance with features like infilling and cross-file completion.

Features

Trained on 600B code tokens with significant multilingual coding improvements
Surpasses larger models (e.g., StarCoder-15B) on benchmarks like HumanEval-X
Efficient inference with Flash Attention and Multi-Query Attention
Supports 100+ programming languages with infilling and cross-file completion
Lightweight deployment with INT4 quantization needing ~6GB GPU memory
Integrates into IDEs (VS Code, JetBrains, PyCharm, WebStorm, etc.) via CodeGeeX plugin

Project Samples

Project Activity

See All Activity >

License

Apache License V2.0

Follow CodeGeeX2

CodeGeeX2 Web Site

Other Useful Business Software

MongoDB Atlas runs apps anywhere

Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free

Rate This Project

User Reviews

Be the first to post a review of CodeGeeX2!

Additional Project Details

Operating Systems

Linux, Mac, Windows

Programming Language

Python, Unix Shell

Related Categories

Unix Shell Large Language Models (LLM), Unix Shell AI Models, Python Large Language Models (LLM), Python AI Models

Registered

2 days ago

Similar Business Software

CodeQwen

CodeQwen is the code version of Qwen, the large language model series developed by the Qwen team, Alibaba Cloud. It is a transformer-based decoder-only language model pre-trained on a large amount of data of codes. Strong code generation capabilities and competitive performance across a series...

See Software
Solar Pro 2

Solar Pro 2 is Upstage’s latest frontier‑scale large language model, designed to power complex tasks and agent‑like workflows across domains such as finance, healthcare, and legal. Packaged in a compact 31 billion‑parameter architecture, it delivers top‑tier multilingual performance, especially...

See Software
GPT-J

GPT-J is a cutting-edge language model created by the research organization EleutherAI. In terms of performance, GPT-J exhibits a level of proficiency comparable to that of OpenAI's renowned GPT-3 model in a range of zero-shot tasks. Notably, GPT-J has demonstrated the ability to surpass GPT-3...

See Software
DeepSeek-V2

DeepSeek-V2 is a state-of-the-art Mixture-of-Experts (MoE) language model introduced by DeepSeek-AI, characterized by its economical training and efficient inference capabilities. With a total of 236 billion parameters, of which only 21 billion are active per token, it supports a context length...

See Software
Mistral Large 2

Mistral AI has launched the Mistral Large 2, an advanced AI model designed to excel in code generation, multilingual capabilities, and complex reasoning tasks. The model features a 128k context window, supporting dozens of languages including English, French, Spanish, and Arabic, as well as over...

See Software
Qwen2

Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud. Qwen2 is a series of large language models developed by the Qwen team at Alibaba Cloud. It includes both base language models and instruction-tuned models, ranging from 0.5 billion to 72 billion parameters, and...

See Software

Report inappropriate content

CodeGeeX2

CodeGeeX2: A More Powerful Multilingual Code Generation Model

Get an email when there's a new version of CodeGeeX2

Features

Project Samples

Project Activity

Categories

License

Follow CodeGeeX2

User Reviews

Additional Project Details

Operating Systems

Programming Language

Related Categories

Registered