VisualGLM-6B is an open-source multimodal conversational language model developed by ZhipuAI that supports both images and text in Chinese and English. It builds on the ChatGLM-6B backbone, with 6.2 billion language parameters, and incorporates a BLIP2-Qformer visual module to connect vision and language. In total, the model has 7.8 billion parameters. Trained on a large bilingual dataset — including 30 million high-quality Chinese image-text pairs from CogView and 300 million English pairs — VisualGLM-6B is designed for image understanding, description, and question answering. Fine-tuning on long visual QA datasets further aligns the model’s responses with human preferences. The repository provides inference APIs, command-line demos, web demos, and efficient fine-tuning options like LoRA, QLoRA, and P-tuning. It also supports quantization down to INT4, enabling local deployment on consumer GPUs with as little as 6.3 GB VRAM.

Features

  • 7.8B parameter multimodal conversational model (6.2B language + vision module)
  • Supports Chinese and English image-based dialogue
  • Pretrained on 330M bilingual image-text pairs for strong alignment
  • Fine-tuning support via LoRA, QLoRA, and P-tuning for domain-specific tasks
  • Efficient INT4 quantization allows inference with only 6.3 GB GPU memory
  • Provides CLI demos, web demos, and REST API deployment options

Project Samples

Project Activity

See All Activity >

License

Apache License V2.0

Follow VisualGLM-6B

VisualGLM-6B Web Site

Other Useful Business Software
$300 Free Credits for Your Google Cloud Projects Icon
$300 Free Credits for Your Google Cloud Projects

Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
Start Free Trial
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of VisualGLM-6B!

Additional Project Details

Operating Systems

Linux

Programming Language

Python, Unix Shell

Related Categories

Unix Shell Large Language Models (LLM), Unix Shell AI Models, Python Large Language Models (LLM), Python AI Models

Registered

2025-10-04