GLM-4.6V

GLM-4.6V

Zhipu AI
MedGemma

MedGemma

Google DeepMind
+
+

Related Products

  • LM-Kit.NET
    25 Ratings
    Visit Website
  • Google AI Studio
    11 Ratings
    Visit Website
  • Vertex AI
    944 Ratings
    Visit Website
  • Google Cloud Speech-to-Text
    375 Ratings
    Visit Website
  • Picsart Enterprise
    27 Ratings
    Visit Website
  • AthenaHQ
    33 Ratings
    Visit Website
  • Fathom
    7,370 Ratings
    Visit Website
  • MobiPDF (formerly PDF Extra)
    6,519 Ratings
    Visit Website
  • CallHub
    424 Ratings
    Visit Website
  • Docmosis
    48 Ratings
    Visit Website

About

GLM-4.6V is a state-of-the-art open source multimodal vision-language model from the Z.ai (GLM-V) family designed for reasoning, perception, and action. It ships in two variants: a full-scale version (106B parameters) for cloud or high-performance clusters, and a lightweight “Flash” variant (9B) optimized for local deployment or low-latency use. GLM-4.6V supports a native context window of up to 128K tokens during training, enabling it to process very long documents or multimodal inputs. Crucially, it integrates native Function Calling, meaning the model can take images, screenshots, documents, or other visual media as input directly (without manual text conversion), reason about them, and trigger tool calls, bridging “visual perception” with “executable action.” This enables a wide spectrum of capabilities; interleaved image-and-text content generation (for example, combining document understanding with text summarization or generation of image-annotated responses).

About

MedGemma is a collection of Gemma 3 variants that are trained for performance on medical text and image comprehension. Developers can use MedGemma to accelerate building healthcare-based AI applications. MedGemma currently comes in two variants: a 4B multimodal version and a 27B text-only version. MedGemma 4B utilizes a SigLIP image encoder that has been specifically pre-trained on a variety of de-identified medical data, including chest X-rays, dermatology images, ophthalmology images, and histopathology slides. Its LLM component is trained on a diverse set of medical data, including radiology images, histopathology patches, ophthalmology images, and dermatology images. MedGemma 4B is available in both pre-trained (suffix: -pt) and instruction-tuned (suffix -it) versions. The instruction-tuned version is a better starting point for most applications.

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Audience

Developers, researchers, and AI engineers wanting a solution to build agents that understand images and text, manipulate documents or UIs, and generate complex image-text outputs

Audience

Healthcare AI developers wanting a solution offering fine-tunable models for medical text and image comprehension tasks

Support

Phone Support
24/7 Live Support
Online

Support

Phone Support
24/7 Live Support
Online

API

Offers API

API

Offers API

Screenshots and Videos

Screenshots and Videos

Pricing

Free
Free Version
Free Trial

Pricing

No information available.
Free Version
Free Trial

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Training

Documentation
Webinars
Live Online
In Person

Training

Documentation
Webinars
Live Online
In Person

Company Information

Zhipu AI
Founded: 2023
China
chat.z.ai/

Company Information

Google DeepMind
Founded: 2010
United Kingdom
deepmind.google/models/gemma/medgemma/

Alternatives

GPT-5.2

GPT-5.2

OpenAI

Alternatives

CodeGemma

CodeGemma

Google
GLM-4.1V

GLM-4.1V

Zhipu AI
PaliGemma 2

PaliGemma 2

Google
GLM-4.5V-Flash

GLM-4.5V-Flash

Zhipu AI
Gemma

Gemma

Google
Qwen3-VL

Qwen3-VL

Alibaba
Gemma 3n

Gemma 3n

Google DeepMind
Qwen3.5

Qwen3.5

Alibaba

Categories

Categories

Integrations

Claude Code
Cline
Dr7.ai
Gemma 2
Gemma 3
Gemma 4
Hugging Face
Kilo Code
OpenRouter
Roo Code
Sup AI
Vertex AI

Integrations

Claude Code
Cline
Dr7.ai
Gemma 2
Gemma 3
Gemma 4
Hugging Face
Kilo Code
OpenRouter
Roo Code
Sup AI
Vertex AI
Claim GLM-4.6V and update features and information
Claim GLM-4.6V and update features and information
Claim MedGemma and update features and information
Claim MedGemma and update features and information