GLM-4.1V

GLM-4.1V

Zhipu AI
+
+

Related Products

  • LM-Kit.NET
    29 Ratings
    Visit Website
  • Striven
    233 Ratings
    Visit Website
  • Kognition
    2 Ratings
    Visit Website
  • LTX
    181 Ratings
    Visit Website
  • Google AI Studio
    26 Ratings
    Visit Website
  • Gemini Enterprise Agent Platform
    967 Ratings
    Visit Website
  • Rise Vision
    1,497 Ratings
    Visit Website
  • ActCAD Software
    401 Ratings
    Visit Website
  • Mentornity
    99 Ratings
    Visit Website
  • Planview Software Product Delivery
    2 Ratings
    Visit Website

About

Command A Vision is Cohere’s multimodal AI solution built for enterprise use that combines image understanding with language capabilities to drive business outcomes while keeping compute costs low; it extends the Command family by adding vision comprehension, allowing organizations to interpret and act on visual content in concert with text, and integrates into workplace systems to surface insights, boost productivity, and enable more intelligent search and discovery. The offering is positioned alongside Cohere’s broader AI stack and emphasizes putting AI to work in real-world workflows, helping teams unify multimodal signals, extract actionable meaning from images and associated metadata, and surface relevant business intelligence without excessive infrastructure overhead. Command A Vision excels at understanding and analyzing a wide range of visual and multilingual data, including charts, graphs, tables, and diagrams.

About

GLM-4.1V is a vision-language model, providing a powerful, compact multimodal model designed for reasoning and perception across images, text, and documents. The 9-billion-parameter variant (GLM-4.1V-9B-Thinking) is built on the GLM-4-9B foundation and enhanced through a specialized training paradigm using Reinforcement Learning with Curriculum Sampling (RLCS). It supports a 64k-token context window and accepts high-resolution inputs (up to 4K images, any aspect ratio), enabling it to handle complex tasks such as optical character recognition, image captioning, chart and document parsing, video and scene understanding, GUI-agent workflows (e.g., interpreting screenshots, recognizing UI elements), and general vision-language reasoning. In benchmark evaluations at the 10 B-parameter scale, GLM-4.1V-9B-Thinking achieved top performance on 23 of 28 tasks.

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Audience

Enterprise teams and knowledge workers needing a tool to understand and operationalize visual and textual data together for smarter insights and decision support

Audience

Developers and AI researchers seeking a solution offering a vision-language model that balances size and capability, ideal for building multimodal agents, document/image analysis tools, or GUI-based automation workflows

Support

Phone Support
24/7 Live Support
Online

Support

Phone Support
24/7 Live Support
Online

API

Offers API

API

Offers API

Screenshots and Videos

Screenshots and Videos

Pricing

No information available.
Free Version
Free Trial

Pricing

Free
Free Version
Free Trial

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Training

Documentation
Webinars
Live Online
In Person

Training

Documentation
Webinars
Live Online
In Person

Company Information

Cohere AI
Founded: 2019
Canada
cohere.com/blog/command-a-vision

Company Information

Zhipu AI
Founded: 2023
China
chat.z.ai/

Alternatives

Command A+

Command A+

Cohere AI

Alternatives

GLM-4.6V

GLM-4.6V

Zhipu AI
Ray2

Ray2

Luma AI
Qwen3.5

Qwen3.5

Alibaba
Cohere

Cohere

Cohere AI
GLM-4.5V-Flash

GLM-4.5V-Flash

Zhipu AI
HunyuanOCR

HunyuanOCR

Tencent

Categories

Categories

Integrations

Claude Code
Cline
Kilo Code
OpenRouter
Roo Code
Sup AI

Integrations

Claude Code
Cline
Kilo Code
OpenRouter
Roo Code
Sup AI
Claim Command A Vision and update features and information
Claim Command A Vision and update features and information
Claim GLM-4.1V and update features and information
Claim GLM-4.1V and update features and information