Only note that sphinx4 PLP extraction by default works with 8khz audio.
As for other feature type, you need to implement a tool to create MFC files
yourself. The format of the MFC file is very simple it's just the number of
floats followed by feature vector values.
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
I've check that files, some files just have 4 B, it contained nothing, there
is no problem with audio file.
I ignore that files have error, But when I training I get error below:
ERROR: "backward.c", line 430: Failed to align audio to trancript: final state of the search is not reached
ERROR: "baum_welch.c", line 331: train/VI-277.areacode ignored
The contents of the file look wrong. Most likely you are decoding the audio
with the wrong sample rate. The sample rate must be 8khz the feature
extraction configuration inside the tool must be updated accordingly.
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Hi,
I completed training model sphinx4 for digits with MFCC's feature, but now I
want to try another feature vector (like PLP...) but I don't know how.
I've searched it but I couldn't find out in anywhere.
Your help is appreciated.
Tks.
Specifically for PLP you can extract features with sphinx4 feature extraction
tool instead of make_feats.pl:
Only note that sphinx4 PLP extraction by default works with 8khz audio.
As for other feature type, you need to implement a tool to create MFC files
yourself. The format of the MFC file is very simple it's just the number of
floats followed by feature vector values.
Thanks nshmyrev! I can't find it in any document.
I want to compare accuracy by using different feature vector. Do you have any
suggestion?, I appreciated about that.
Thanks for your help!
Don't bother with that.
I've extracted features with sphinx4 feature extraction tool. but when I run
script/Runall.pl to training I got error below:
Can you tell me the way to handle it!
Tks.
Maybe something went wrong. Check that files were actually created, print
their contents with sphinx_cepview tool.
I've check that files, some files just have 4 B, it contained nothing, there
is no problem with audio file.
I ignore that files have error, But when I training I get error below:
I printed VI-277.areacode 's contents:
Thanks!
I found another error:
ERROR:
The contents of the file look wrong. Most likely you are decoding the audio
with the wrong sample rate. The sample rate must be 8khz the feature
extraction configuration inside the tool must be updated accordingly.