detect silence in raw file?

Speech Recognition Toolkit

Brought to you by: air, arthchan2003, awb, bhiksha, and 5 others

This project can now be found here.

detect silence in raw file?

Forum: Help

Creator: wesamoth

Created: 2011-07-13

Updated: 2012-09-22

wesamoth - 2011-07-13

Hi,

I am currently using pocketsphinx to do speech recognition with a robot.
I receive a data flow containing 1 chanel PCM sound (16kHz), and I am able to
record this sound and decode it in batch mode with ps_decode_raw.
But, as it is a streamed sound, I would like to decode it in real time, and
therefore detect silences...

Is it possible to do such a thing? if so, how?

Thanks in advance for your answers.

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Nickolay V. Shmyrev - 2011-07-13

For example of continuous decoding check
pocketsphinx/src/programs/continuous.c. Corresponding tool
pocketsphinx_continuous decodes stream from the microphone or from a file
configured with -infile option.

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

wesamoth - 2011-07-18

You will tell me if I am wrong, but the "infile" option is only available
since 0.7? I should use VS2008 for compatibility reasons, so I can only use
0.6...

However, I will take a look at these functions, just to see if I can adapt it.

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

wesamoth - 2011-07-18

Thanks!
I just integrated and adapted "ad_file_read" and "recognize_from_file"
functions from the 0.7 version and it works great now!
bonus question: Is there a way to avoid writting into a file and decode
silence directly from a buffer?

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Nickolay V. Shmyrev - 2011-07-18

Is there a way to avoid writting into a file and decode silence directly
from a buffer?

You can use sphinxbase AD API to decode from buffers too. It's not different
from reading from a file.

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

wesamoth - 2011-07-28

Perfect! I had some problems adapting all that stuff but in the end, it works
perfectly.

Thanks for your answers!

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Log in to post a comment.