Holo3.1H Company
|
Qwen2.5-VLAlibaba
|
|||||
Related Products
|
||||||
About
Holo3.1 is H Company’s family of fast and local computer-use agents, built to operate across web, desktop, and mobile environments while integrating more smoothly into different agent frameworks and deployment targets. Based on the Qwen family, Holo3.1 improves robustness across the environments where computer-use agents are actually deployed, addressing the distribution shifts that appear across mobile devices, alternative agent harnesses, and different execution frameworks. The release expands Holo3’s capabilities beyond browser and desktop control, with major gains in mobile automation, including AndroidWorld improvements from 67% to 79.3% for the 35B-A3B model and from 58% to 71% for the smaller 4B and 9B variants. Holo3.1 also introduces native support for function-calling protocols in addition to structured JSON outputs, helping teams deploy the model inside third-party agent stacks with near-parity between function-calling and native execution.
|
About
Qwen2.5-VL is the latest vision-language model from the Qwen series, representing a significant advancement over its predecessor, Qwen2-VL. This model excels in visual understanding, capable of recognizing a wide array of objects, including text, charts, icons, graphics, and layouts within images. It functions as a visual agent, capable of reasoning and dynamically directing tools, enabling applications such as computer and phone usage. Qwen2.5-VL can comprehend videos exceeding one hour in length and can pinpoint relevant segments within them. Additionally, it accurately localizes objects in images by generating bounding boxes or points and provides stable JSON outputs for coordinates and attributes. The model also supports structured outputs for data like scanned invoices, forms, and tables, benefiting sectors such as finance and commerce. Available in base and instruct versions across 3B, 7B, and 72B sizes, Qwen2.5-VL is accessible through platforms like Hugging Face and ModelScope.
|
|||||
Platforms Supported
Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook
|
Platforms Supported
Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook
|
|||||
Audience
AI agent developers and enterprise automation teams that need computer-use models for browser, desktop, and mobile workflows with local deployment and flexible agent-framework integration
|
Audience
AI researchers, developers, and enterprises seeking a powerful vision-language model for advanced image analysis, document processing, and multimodal AI applications
|
|||||
Support
Phone Support
24/7 Live Support
Online
|
Support
Phone Support
24/7 Live Support
Online
|
|||||
API
Offers API
|
API
Offers API
|
|||||
Screenshots and Videos |
Screenshots and Videos |
|||||
Pricing
No information available.
Free Version
Free Trial
|
Pricing
Free
Free Version
Free Trial
|
|||||
Reviews/
|
Reviews/
|
|||||
Training
Documentation
Webinars
Live Online
In Person
|
Training
Documentation
Webinars
Live Online
In Person
|
|||||
Company InformationH Company
France
hcompany.ai/holo3.1
|
Company InformationAlibaba
Founded: 1999
China
qwenlm.github.io/blog/qwen2.5-vl/
|
|||||
Alternatives |
Alternatives |
|||||
|
|
|
|||||
|
|
|
|||||
|
|
|
|||||
|
|
|
|||||
Categories |
Categories |
|||||
Integrations
Alibaba Cloud
BLACKBOX AI
Hugging Face
JSON
LM-Kit.NET
ModelScope
Parasail
Qwen Studio
kluster.ai
|
Integrations
Alibaba Cloud
BLACKBOX AI
Hugging Face
JSON
LM-Kit.NET
ModelScope
Parasail
Qwen Studio
kluster.ai
|
|||||
|
|
|