Menu

Energy Levels for Effective Word Recognition

Help
2014-07-03
2014-07-04
  • Serotonergic

    Serotonergic - 2014-07-03

    Is there a recommended energy threshold required for sphinx speech recognition to work effectively (specifically for pocketsphinx_continuous, sphinx3_align and sphinx_pitch)?

     
    • Nickolay V. Shmyrev

      I'm not sure what energy threshold are you asking about, there is no such thing in pocketsphinx.

       
  • Serotonergic

    Serotonergic - 2014-07-03

    Thanks for the response. I mean is there a recommended threshold, e.g., SNR value, signal amplitude value, etc., for an input speech file, we know that pocketsphinx or sphinx3_align will struggle to pick up the words. We have noticed that pocketsphinx is not "recognizing" words if their energy values in the speech are "low". Any recommendation in this regard would be very helpful.

     
  • Serotonergic

    Serotonergic - 2014-07-04

    Thanks Pranav. From the second post I would be interested to know what the "high" and "low" values are (recommended). Yes, my data is clean.

     
    • Pranav Jawale

      Pranav Jawale - 2014-07-04

      There is nothing recommended (AFAIK), apart from avoiding clipping. As you
      can see even in WSJ, which is a standard database, they have done recording
      at as low as 0.05 amplitude (and there is no noise there).

       

Log in to post a comment.

MongoDB Logo MongoDB