Starting Pocket sphinx with small audio of numbers

Speech Recognition Toolkit

Brought to you by: air, arthchan2003, awb, bhiksha, and 5 others

This project can now be found here.

Starting Pocket sphinx with small audio of numbers

Forum: Help

Creator: abdul rashid

Created: 2017-06-01

Updated: 2017-06-07

abdul rashid - 2017-06-01

I am new to pocketsphinx and starting with the latest version on windows. I have audio of numbers which I am trying with pocketsphinx. The audio contains the words (eight eight four one six seven three six nine). but its shows as eighty four one six seven three six nine.
Any help/guidance is appreciated.

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
- Nickolay V. Shmyrev - 2017-06-02
  
  Speech recognition is not always accurate. For more details on how to make it accurate read http://cmusphinx.github.io/wiki/tutorialtuning
  
  If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

abdul rashid - 2017-06-07

I am trying to train model . Its a small audio and I am facing the Error "Failed to Align audio to transcript". Can you help me out to understand it. I am attaching the Audio file , the transcript and error deatails in text file.

eightysevendollars.wav

log.txt

x.transcription

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
- Nickolay V. Shmyrev - 2017-06-07
  
  You miss </s> in the x.transcription file before (x)
  
  If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

abdul rashid - 2017-06-07

Thanks . I added at the end but still it say "Failed to Align audio to transcript". I have attached the log.

New Log.txt

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
- Nickolay V. Shmyrev - 2017-06-07
  
  Your audio file is 8khz, you are trying to extract features with samprate 16000 and probably using 16-khz model, this is not a good idea. The command line is also different from tutorial, you miss argfile argument in sphinx_fe.
  
  If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

abdul rashid - 2017-06-07

I changed audio to 16khz. Also i am using -argfile en-us/feat.params as argument in sphinx_fe. Still having same error.

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
- Nickolay V. Shmyrev - 2017-06-07
  
  You can not do that, you need to use 8khz model and follow the tutorial properly.
  
  If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Log in to post a comment.