Preprocess Sound before passing into Pocket Sphinx

Speech Recognition Toolkit

Brought to you by: air, arthchan2003, awb, bhiksha, and 5 others

This project can now be found here.

Preprocess Sound before passing into Pocket Sphinx

Forum: Speech Recognition Theory

Creator: marcus obrien

Created: 2019-08-25

Updated: 2019-08-25

marcus obrien - 2019-08-25

I want to improve my voice recognition system on a Raspberry Pi 3 running Pocket Spinx. It's pretty accurate (limited recognition vocabulary) but is terrible when other noise is around.

Is there any way to pre-process sounds before feeding them into pocket sphinx to filter only the human voice ? I was thinking there might be some way to reduce the data being passed in, eg human voice being between certain frequency etc, white noise removal etc. I haven't found anything simple so far. I'm thinking of filtering and processing the voice recordings on an STM32 that has an Arm Cortex M4 and DSP etc.

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
- Nickolay V. Shmyrev - 2019-08-25
  
  You already asked the same at https://dsp.stackexchange.com/questions/60334/preprocess-sound-for-voice-recognition-in-pocket-sphinx-on-raspberry-pi-3
  
  If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Log in to post a comment.