I am new to pocketsphinx and starting with the latest version on windows. I have audio of numbers which I am trying with pocketsphinx. The audio contains the words (eight eight four one six seven three six nine). but its shows as eighty four one six seven three six nine.
Any help/guidance is appreciated.
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
I am trying to train model . Its a small audio and I am facing the Error "Failed to Align audio to transcript". Can you help me out to understand it. I am attaching the Audio file , the transcript and error deatails in text file.
Your audio file is 8khz, you are trying to extract features with samprate 16000 and probably using 16-khz model, this is not a good idea. The command line is also different from tutorial, you miss argfile argument in sphinx_fe.
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
I am new to pocketsphinx and starting with the latest version on windows. I have audio of numbers which I am trying with pocketsphinx. The audio contains the words (eight eight four one six seven three six nine). but its shows as eighty four one six seven three six nine.
Any help/guidance is appreciated.
Speech recognition is not always accurate. For more details on how to make it accurate read http://cmusphinx.github.io/wiki/tutorialtuning
I am trying to train model . Its a small audio and I am facing the Error "Failed to Align audio to transcript". Can you help me out to understand it. I am attaching the Audio file , the transcript and error deatails in text file.
You miss
</s>
in thex.transcription
file before(x)
Thanks . I added at the end but still it say "Failed to Align audio to transcript". I have attached the log.
Your audio file is 8khz, you are trying to extract features with samprate 16000 and probably using 16-khz model, this is not a good idea. The command line is also different from tutorial, you miss argfile argument in sphinx_fe.
I changed audio to 16khz. Also i am using -argfile en-us/feat.params as argument in sphinx_fe. Still having same error.
You can not do that, you need to use 8khz model and follow the tutorial properly.