recognition free download

Self-Operating Computer

A framework to enable multimodal models to operate a computer

...Notably, it was the first known project to implement a multimodal model capable of viewing and controlling a computer screen. The framework supports features like Optical Character Recognition (OCR) and Set-of-Mark (SoM) prompting to enhance visual grounding capabilities. It is designed to be compatible with macOS, Windows, and Linux (with X server installed), and is released under the MIT license.

1 Review

Downloads: 2 This Week

Last Update: 2025-02-28

See Project

ILA - teachable voice assistant

ILA is a fully customizable and teachable voice assistant for Java

ILA stands for (kind of) intelligent, learning assistant and is a speech recognition system aka voice assistant very similar to Siri, Google Now and Cortana. ILA is fully customizable and you can teach her/him/it new things by yourself like executing system commands, opening web pages, programs and apps or just some basic conversation :-) ILA runs on Java und thus is compatible to Windows, Mac and Linux. It is designed to integrate with your home enviroment and for example build up your own, free and open Amazon Echo replacement ;-) Right now the key components of ILA are the open source speech recognition CMU Sphinx-4, Google (Speech Recognition/Text-To-Speech) and MaryTTS (Text-To-Speech). ...

4 Reviews

Downloads: 1 This Week

Last Update: 2018-07-23

See Project

Hemera - Intelligent System

Hemera is a Virtual Intelligent System aggregating some more advanced Artificial Intelligence Technologies (speech, speech recognition, form recognition, motion recognition ...); with applications in daily tasks, domotics and robotics ...

Downloads: 0 This Week

Last Update: 2015-01-21

See Project

PlateGatewayQt

PlateGatewayQt is an GNU GPL open source license plate recognition tool

Downloads: 0 This Week

Last Update: 2015-08-07

See Project

Open Pandora's Box

Pandora is an artificial intelligent web based bot

Pandora is an artificial intelligent web based bot written in Java. Pandora is a component based AI architecture including, database memory, XML, voice, voice rec, chat, IRC, HTTP, Wiktionary, Freebase, consciousness, language, GUI, applet, web, jsp, Android

1 Review

Downloads: 0 This Week

Last Update: 2013-11-20

See Project

openSMILE

SMILE = Speech & Music Interpretation by Large Space Extraction openSMILE is a fast, real-time (audio) feature extraction utility for automatic speech, music and paralinguistic recognition research developed originally at TUM in the scope of the EU-project SEMAINE, now maintained and supported by audEERING.

Downloads: 0 This Week

Last Update: 2014-11-27

See Project

sciSoccer

sciSoccer is a framework to develop strategies for soccer playing robots like RobotCup. It is composed by libraries for robot communication, video capturing, image recognition, strategy implementation and a graphical user interface.

Downloads: 0 This Week

Last Update: 2013-04-23

See Project

openEAR

openEAR is the Munich Open-Source Emotion and Affect Recognition Toolkit developed at the Technische Universität München (TUM). It provides efficient (audio) feature extraction algorithms implemented in C++, classfiers, and pre-trained models on well-known emotion databases. It is now maintained and supported by audEERING. Updates will follow soon.

4 Reviews

Downloads: 8 This Week

Last Update: 2015-08-06

See Project

Search Results for "recognition"

Showing 8 open source projects for "recognition"

Self-Operating Computer

ILA - teachable voice assistant

Hemera - Intelligent System

PlateGatewayQt

Open Pandora's Box

openSMILE

sciSoccer

openEAR

Search Results for "recognition"

Showing 8 open source projects for "recognition"

Self-Operating Computer

ILA - teachable voice assistant

Hemera - Intelligent System

PlateGatewayQt

Open Pandora's Box

openSMILE

sciSoccer

openEAR

Related Searches

Related Categories