Cross Audio-Visual Recognition using 3D Architectures
The input pipeline must be prepared by the users. This code is aimed to provide the implementation for Coupled 3D Convolutional Neural Networks for audio-visual matching. Lip-reading can be a specific application for this work. Audio-visual recognition (AVR) has been considered as a solution for speech recognition tasks when the audio is corrupted, as well as a visual recognition method used for speaker verification in multi-speaker scenarios. The approach of AVR systems is to leverage the...
Speech-recognition interface for a domotic system.
This product recognizes oral commands and translates them to domotic orders for a domotic system.
This product does not implement a domotic system. This product is an interface to be plugged to a domotic system.
The speech recognition is done by an arduino UNO board and an EasyVR shield.
Available oral commands are generated from a house description file in XML format. The oral commands have to be trained for a specific users. For this purpose 2 interfaces are provided: a command line...