GLM-4.6V

GLM-4.6V

Zhipu AI
+
+

Related Products

  • LM-Kit.NET
    29 Ratings
    Visit Website
  • Google AI Studio
    26 Ratings
    Visit Website
  • Gemini Enterprise Agent Platform
    967 Ratings
    Visit Website
  • Adobe Firefly
    25,003 Ratings
    Visit Website
  • Google Cloud Speech-to-Text
    365 Ratings
    Visit Website
  • Fathom
    7,661 Ratings
    Visit Website
  • MobiPDF (formerly PDF Extra)
    6,998 Ratings
    Visit Website
  • AthenaHQ
    38 Ratings
    Visit Website
  • CallHub
    426 Ratings
    Visit Website
  • Docmosis
    51 Ratings
    Visit Website

About

GLM-4.6V is a state-of-the-art open source multimodal vision-language model from the Z.ai (GLM-V) family designed for reasoning, perception, and action. It ships in two variants: a full-scale version (106B parameters) for cloud or high-performance clusters, and a lightweight “Flash” variant (9B) optimized for local deployment or low-latency use. GLM-4.6V supports a native context window of up to 128K tokens during training, enabling it to process very long documents or multimodal inputs. Crucially, it integrates native Function Calling, meaning the model can take images, screenshots, documents, or other visual media as input directly (without manual text conversion), reason about them, and trigger tool calls, bridging “visual perception” with “executable action.” This enables a wide spectrum of capabilities; interleaved image-and-text content generation (for example, combining document understanding with text summarization or generation of image-annotated responses).

About

Oxlo.ai is a privacy-first inference stack for agents, built to run frontier-class open-source models with unlimited agentic tool calls, secure failover, and zero data retention or training. It gives developers request-based access to curated open models through a unified HTTP API designed for predictable usage, low-latency inference, and clean integration into production systems. Teams can call models through OpenAI-compatible endpoints, switch from another provider by changing the base URL and API key, and keep support for streaming, function calling, JSON mode, vision models, embeddings, and image generation. Oxlo.ai supports more than 40 models across text, chat, reasoning, coding, image generation, audio, embeddings, computer vision, vision-language, speech-to-text, text-to-speech, long-context, and detection workflows.

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Audience

Developers, researchers, and AI engineers wanting a solution to build agents that understand images and text, manipulate documents or UIs, and generate complex image-text outputs

Audience

AI infrastructure teams that need private, OpenAI-compatible access to open models for agents, RAG, coding, vision, audio, and production inference workflows

Support

Phone Support
24/7 Live Support
Online

Support

Phone Support
24/7 Live Support
Online

API

Offers API

API

Offers API

Screenshots and Videos

Screenshots and Videos

Pricing

Free
Free Version
Free Trial

Pricing

$80 per month
Free Version
Free Trial

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Training

Documentation
Webinars
Live Online
In Person

Training

Documentation
Webinars
Live Online
In Person

Company Information

Zhipu AI
Founded: 2023
China
chat.z.ai/

Company Information

Oxlo.ai
United Arab Emirates
www.oxlo.ai/

Alternatives

Alternatives

GPT-5.2

GPT-5.2

OpenAI
GLM-4.1V

GLM-4.1V

Zhipu AI
GLM-4.5V-Flash

GLM-4.5V-Flash

Zhipu AI
MiMo-V2.5

MiMo-V2.5

Xiaomi Technology

Categories

Categories

Integrations

Claude Code
Cline
DeepSeek-V3
DeepSeek-V4-Flash
FLUX.1
GLM-5
Gemma 3
Kimi K2.5
Kokoro TTS
Llama 3.2
Llama 3.3
Llama 4 Maverick
MiniMax M2.5
Mistral 7B
OpenAI
Qwen2.5
Qwen3
Stable Diffusion
Sup AI
gpt-oss-120b

Integrations

Claude Code
Cline
DeepSeek-V3
DeepSeek-V4-Flash
FLUX.1
GLM-5
Gemma 3
Kimi K2.5
Kokoro TTS
Llama 3.2
Llama 3.3
Llama 4 Maverick
MiniMax M2.5
Mistral 7B
OpenAI
Qwen2.5
Qwen3
Stable Diffusion
Sup AI
gpt-oss-120b
Claim GLM-4.6V and update features and information
Claim GLM-4.6V and update features and information
Claim Oxlo.ai and update features and information
Claim Oxlo.ai and update features and information