Gemini 2.5 Computer UseGoogle
|
Holo3.1H Company
|
|||||
Related Products
|
||||||
About
Introducing the Gemini 2.5 Computer Use model, a specialized agent model built on top of Gemini 2.5 Pro’s visual reasoning capabilities, designed to interact directly with user interfaces (UIs). It is exposed via a new computer-use tool in the Gemini API, with inputs that include the user’s request, a screenshot of the UI environment, and a history of recent actions. The model generates function calls corresponding to UI actions like clicking, typing, or selecting, and may request user confirmation for higher-risk tasks. After each action is executed, a new screenshot and URL are fed back into the model to continue the loop until the task completes or is halted. It is optimized primarily for web browser control and shows promise for mobile UI interaction, though it is not yet suited for desktop OS-level control. In benchmarks across web and mobile control tasks, Gemini 2.5 Computer Use outperforms leading alternatives, delivering high accuracy at lower latency.
|
About
Holo3.1 is H Company’s family of fast and local computer-use agents, built to operate across web, desktop, and mobile environments while integrating more smoothly into different agent frameworks and deployment targets. Based on the Qwen family, Holo3.1 improves robustness across the environments where computer-use agents are actually deployed, addressing the distribution shifts that appear across mobile devices, alternative agent harnesses, and different execution frameworks. The release expands Holo3’s capabilities beyond browser and desktop control, with major gains in mobile automation, including AndroidWorld improvements from 67% to 79.3% for the 35B-A3B model and from 58% to 71% for the smaller 4B and 9B variants. Holo3.1 also introduces native support for function-calling protocols in addition to structured JSON outputs, helping teams deploy the model inside third-party agent stacks with near-parity between function-calling and native execution.
|
|||||
Platforms Supported
Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook
|
Platforms Supported
Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook
|
|||||
Audience
AI/agent developers and organizations needing a tool to interact with interfaces and automate tasks like form entry, navigation, and UI control
|
Audience
AI agent developers and enterprise automation teams that need computer-use models for browser, desktop, and mobile workflows with local deployment and flexible agent-framework integration
|
|||||
Support
Phone Support
24/7 Live Support
Online
|
Support
Phone Support
24/7 Live Support
Online
|
|||||
API
Offers API
|
API
Offers API
|
|||||
Screenshots and Videos |
Screenshots and Videos |
|||||
Pricing
Free
Free Version
Free Trial
|
Pricing
No information available.
Free Version
Free Trial
|
|||||
Reviews/
|
Reviews/
|
|||||
Training
Documentation
Webinars
Live Online
In Person
|
Training
Documentation
Webinars
Live Online
In Person
|
|||||
Company InformationGoogle
Founded: 1998
United States
blog.google/technology/google-deepmind/gemini-computer-use-model/
|
Company InformationH Company
France
hcompany.ai/holo3.1
|
|||||
Alternatives |
Alternatives |
|||||
|
|
|
|||||
|
|
|
|||||
|
|
|
|||||
|
|
|
|||||
Categories |
Categories |
|||||
Integrations
Gemini
Gemini 2.5 Pro
Gemini 3 Deep Think
Gemini Enterprise
Gemini Enterprise Agent Platform
Google AI Studio
JSON
|
Integrations
Gemini
Gemini 2.5 Pro
Gemini 3 Deep Think
Gemini Enterprise
Gemini Enterprise Agent Platform
Google AI Studio
JSON
|
|||||
|
|
|