computer vision free download

Self-Operating Computer

A framework to enable multimodal models to operate a computer

The Self-Operating Computer Framework is an innovative system that enables multimodal models to autonomously operate a computer by interpreting the screen and executing mouse and keyboard actions to achieve specified objectives. This framework is compatible with various multimodal models and currently integrates with GPT-4o, o1, Gemini Pro Vision, Claude 3, and LLaVa.

1 Review

Downloads: 9 This Week

Last Update: 2025-02-28

See Project

AskUI Vision Agent

Enable AI to control your desktop, mobile and HMI devices

...The repository presents a feature overview, sample media, and frequent release notes, which show ongoing improvements such as CORS checks and other operational tweaks. The broader AskUI documentation covers the Python Vision Agent along with suite services and inference APIs, indicating a productized ecosystem rather than a single library. Community-curated lists also recognize Vision Agent as part of the broader “GUI agents” landscape, placing it among other computer-use agents.

Downloads: 11 This Week

Last Update: 2 days ago

See Project

Skyvern

Automate browser-based workflows with LLMs and Computer Vision

Skyvern uses a combination of computer vision and AI to understand content on a webpage, making it adaptable to any website. Skyvern takes instructions in natural language, allowing it to execute complex objectives with simple commands. Skyvern is an API-first product. Workflows execute in the cloud, allowing it to run hundreds of workflows at the same time. Skyvern's AI decisions come with built-in explanations, providing clear summaries and justifications for every action. ...

Downloads: 10 This Week

Last Update: 3 days ago

See Project

BotSharp

AI Multi-Agent Framework in .NET

...It opens up as much learning power as possible for your own robots and precisely control every step of the AI processing pipeline. BotSharp is an open source machine learning framework for AI Bot platform builder. This project involves natural language understanding, computer vision and audio processing technologies, and aims to promote the development and application of intelligent robot assistants in information systems. Out-of-the-box machine learning algorithms allow ordinary programmers to develop artificial intelligence applications faster and easier. It's written in C# running on .Net Core that is full cross-platform framework. ...

Downloads: 0 This Week

Last Update: 2025-10-17

See Project

OAGI Python SDK

Python SDK for the Computer Use model Lux, developed by OpenAGI

OAGI Python SDK is a Python client library for the Lux computer-use model that turns Lux into a programmable automation layer for operating human-facing software via vision and actions. It exposes the OAGI API in an ergonomic way, letting you trigger Lux in three main modes: Tasker for precise scripted sequences, Actor for fast one-shot tasks, and Thinker for open-ended, multi-step objectives.

Downloads: 6 This Week

Last Update: 2026-02-22

See Project

uvsim

This project has been renamed to oooark. Old file releases will still be available here. uvsim is a project focused on enabling algorithm development for unmanned systems. It is being constructed to provide an identical interface to simulations and h

Downloads: 0 This Week

Last Update: 2013-05-21

See Project

Intelligent Camera Controlling System

The project is aimed at automatic target following using a camera , a computer vision system and a microcontroller that moves the cam. The project should mainly work under linux and it might be ported into windows,

Downloads: 0 This Week

Last Update: 2013-04-19

See Project

Search Results for "computer vision"

Showing 7 open source projects for "computer vision"

Self-Operating Computer

AskUI Vision Agent

Skyvern

BotSharp

OAGI Python SDK

uvsim

Intelligent Camera Controlling System

Search Results for "computer vision"

Showing 7 open source projects for "computer vision"

Self-Operating Computer

AskUI Vision Agent

Skyvern

BotSharp

OAGI Python SDK

uvsim

Intelligent Camera Controlling System

Related Searches

Related Categories