VisualGLM-6B is an open-source multimodal conversational language model developed by ZhipuAI that supports both images and text in Chinese and English. It builds on the ChatGLM-6B backbone, with 6.2 billion language parameters, and incorporates a BLIP2-Qformer visual module to connect vision and language. In total, the model has 7.8 billion parameters. Trained on a large bilingual dataset — including 30 million high-quality Chinese image-text pairs from CogView and 300 million English pairs — VisualGLM-6B is designed for image understanding, description, and question answering. Fine-tuning on long visual QA datasets further aligns the model’s responses with human preferences. The repository provides inference APIs, command-line demos, web demos, and efficient fine-tuning options like LoRA, QLoRA, and P-tuning. It also supports quantization down to INT4, enabling local deployment on consumer GPUs with as little as 6.3 GB VRAM.

Features

  • 7.8B parameter multimodal conversational model (6.2B language + vision module)
  • Supports Chinese and English image-based dialogue
  • Pretrained on 330M bilingual image-text pairs for strong alignment
  • Fine-tuning support via LoRA, QLoRA, and P-tuning for domain-specific tasks
  • Efficient INT4 quantization allows inference with only 6.3 GB GPU memory
  • Provides CLI demos, web demos, and REST API deployment options

Project Samples

Project Activity

See All Activity >

License

Apache License V2.0

Follow VisualGLM-6B

VisualGLM-6B Web Site

Other Useful Business Software
AI-generated apps that pass security review Icon
AI-generated apps that pass security review

Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
Try Retool free
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of VisualGLM-6B!

Additional Project Details

Operating Systems

Linux

Programming Language

Python, Unix Shell

Related Categories

Unix Shell Large Language Models (LLM), Unix Shell AI Models, Python Large Language Models (LLM), Python AI Models

Registered

2025-10-04