Gobii
Gobii is a cloud-hosted platform that enables you to spin up fully managed browser-automation agents via API, allowing tasks like web-based research, form-filling, data extraction, and multi-step workflows to be automated at scale. These agents operate like “always-on employees” that can browse websites, even those without APIs, navigate dynamic content, handle JavaScript, and even rotate proxies automatically. Users can create agents, assign them prompts or tasks, and retrieve structured JSON outputs or live previews of the agent’s browser actions. Gobii supports synchronous and asynchronous task execution, secret handling for things like login credentials, schema-enforced output validation, and integrates with popular programming languages (Python, Node.js) for seamless implementation. The platform emphasises scalability (hundreds of tasks in parallel), enterprise-grade security (audit logs, proxies, task management), and a simple developer experience.
Learn more
Qwen2.5-VL
Qwen2.5-VL is the latest vision-language model from the Qwen series, representing a significant advancement over its predecessor, Qwen2-VL. This model excels in visual understanding, capable of recognizing a wide array of objects, including text, charts, icons, graphics, and layouts within images. It functions as a visual agent, capable of reasoning and dynamically directing tools, enabling applications such as computer and phone usage. Qwen2.5-VL can comprehend videos exceeding one hour in length and can pinpoint relevant segments within them. Additionally, it accurately localizes objects in images by generating bounding boxes or points and provides stable JSON outputs for coordinates and attributes. The model also supports structured outputs for data like scanned invoices, forms, and tables, benefiting sectors such as finance and commerce. Available in base and instruct versions across 3B, 7B, and 72B sizes, Qwen2.5-VL is accessible through platforms like Hugging Face and ModelScope.
Learn more
Jace
Meet your new AI assistant and focus on meaningful things. A groundbreaking digital assistant, JACE represents the future of AI agents, going beyond traditional uses of current AI chatbots like ChatGPT and their text-generation focus. Instead, JACE focuses on taking action in the digital world. It differs from existing AI-powered chatbots due to its complex cognitive architecture, which enables it to complete high-difficulty tasks. JACE can control and perform actions in the browser similarly to a human user, excelling in managing complex tasks that involve web automation, interaction, and direct communication. This is due to the development and training of Zeta Labs’ proprietary web-interaction model, AWA-1 (Autonomous Web Agent-1), which enables JACE to reliably execute tasks over long periods of time, effectively handling the challenges and inconsistencies commonly found in web interfaces.
Learn more
Lux
Lux is a powerful computer-use AI platform that enables agents to operate software just like a human user—clicking, typing, navigating, and completing tasks across any interface. It offers three execution modes—Tasker, Actor, and Thinker—giving developers the ability to choose between step-by-step precision, near-instant task execution, or long-form reasoning for complex workflows. Lux can autonomously perform actions such as crawling Amazon data, running automated QA tests, or extracting insights from Nasdaq’s insider activity pages. The platform makes it possible to prototype and deploy real computer-use agents in as little as 20 minutes using developer-friendly SDKs and templates. Its agents are built to understand vague goals, execute long-running operations, and interact naturally with human-facing software instead of relying solely on APIs. Lux represents a new paradigm where AI goes beyond reasoning and content generation to directly operate computers at scale.
Learn more