After running the entire sphinxtrain training, we are unable to test the WER with the decoder. This is not related to the training process, since according to our logs all previous steps are completed until this point.
What happens is that when running sphinxtrain -s decode run, we only get the output "MODULE: DECODE Decoding using models previously trained". It never continues to "Decoding 130 segments starting at 0 (part 1 of 1)". The decoder creates 2 files in the result folder, but not the align file. We are using the sphinxtrain-master version off of github for training.
I know for sure that the word-align.pl script exists on the machine, but it seems that the decoder is unable to access this. The path to the script is "/usr/local/lib/sphinxtrain/scripts/decode/word_align.pl".
Is there any way to give the decoder a pointer to where the script is? I've tried in vain to export to PATH.
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
After running the entire sphinxtrain training, we are unable to test the WER with the decoder. This is not related to the training process, since according to our logs all previous steps are completed until this point.
What happens is that when running
sphinxtrain -s decode run
, we only get the output "MODULE: DECODE Decoding using models previously trained". It never continues to "Decoding 130 segments starting at 0 (part 1 of 1)". The decoder creates 2 files in the result folder, but not the align file. We are using the sphinxtrain-master version off of github for training.I know for sure that the word-align.pl script exists on the machine, but it seems that the decoder is unable to access this. The path to the script is "/usr/local/lib/sphinxtrain/scripts/decode/word_align.pl".
Is there any way to give the decoder a pointer to where the script is? I've tried in vain to export to PATH.
What is inside those files? What is inside decoding log in logdir/decode?
Attached are the logdir/decode and result files.
Ok, share testing/04/r5310980/u0980055.wav
I sent you a DM.
You didn't convert the training data into proper format. It should be 16khz 16bit mono pcm. You can convert the files with sph2pipe.
Because data is garbage, decoder goes out of memory and crashes.
We are training a model for 8kHz (specified in sphinx_train.cfg). Do we have to specify this when running the decoder aswell?
No, it picks the values from cfg file.
Upon further investigation I have found that some of the wav files are PCM MU-LAW, while others are PCM ALAW. Do you think this might be the problem?
EDIT:
I would like to add to this that the model that we've trained has very bad recognition, which is why testing the WER is so important.
Last edit: Simen H 2019-03-05