A framework to enable multimodal models to operate a computer
...The framework supports features like Optical Character Recognition (OCR) and Set-of-Mark (SoM) prompting to enhance visual grounding capabilities. It is designed to be compatible with macOS, Windows, and Linux (with X server installed), and is released under the MIT license.
This project is a quest for conscious artificial intelligence. A number of prototypes will be developed as the project progresses.
This project has 2 subprojects:
Object Pascal based CAI NEURAL API - https://github.com/joaopauloschuler/neural-api
Python based K-CAI NEURAL API - https://github.com/joaopauloschuler/k-neural-api
A video from the first prototype has been made:
http://www.youtube.com/watch?v=qH-IQgYy9zg
Above video shows a popperian agent collecting mining ore from 3...
2D robotic simulator in Python 2.x, tested in 2.6 and 2.7, under Win7(64) and Ubuntu 10.04 (Lucid). Needs the matching version of Pygame installed! (www.pygame.org)
Download the .py file and two .bmp files:
-->back2_800_600 for background
-->any of the robo1.bmp through robo4.bmp for robot image (different colors).
The map of the "room" can be changed by modifying the list of obstacles.
Sakura is a Knowledge Navigator and User Interface for UNIX, which implements HyperMedia and its own windowing and packing system, both in the main program and in an extensive API for Tcl and other languages.
Deploy in 115+ regions with the modern database for every enterprise.
MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.