Hi, i'm doing a training with sphinxtrain and it stop at this point:
Sphinxtrain path: /usr/lib/sphinxtrain
Sphinxtrain binaries path: /usr/lib/sphinxtrain
Running the training
MODULE: 000 Computing feature from audio files
Extracting features from segments starting at (part 1 of 1)
Extracting features from segments starting at (part 1 of 1)
Feature extraction is done
MODULE: 00 verify training files
Phase 1: Checking to see if the dict and filler dict agrees with the phonelist file.
Found 19 words using 21 phones
Phase 2: Checking to make sure there are not duplicate entries in the dictionary
Phase 3: Check general format for the fileids file; utterance length (must be positive); files exist
Phase 4: Checking number of lines in the transcript file should match lines in fileids file
Phase 5: Determine amount of training data, see if n_tied_states seems reasonable.
Estimated Total Hours Training: 0.0347111111111111
This is a small amount of data, no comment at this time
Phase 6: Checking that all the words in the transcript are in the dictionary
Words in dictionary: 16
Words in filler dictionary: 3
Use of uninitialized value $_[0] in substitution (s///) at /usr/share/perl/5.22/File/Basename.pm line 180, <trn> line 57.
fileparse(): need a valid pathname at /usr/lib/sphinxtrain/scripts/00.verify/verify_all.pl line 352.</trn>
Can you help me,please?
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Hi, i'm doing a training with sphinxtrain and it stop at this point:
Sphinxtrain path: /usr/lib/sphinxtrain
Sphinxtrain binaries path: /usr/lib/sphinxtrain
Running the training
MODULE: 000 Computing feature from audio files
Extracting features from segments starting at (part 1 of 1)
Extracting features from segments starting at (part 1 of 1)
Feature extraction is done
MODULE: 00 verify training files
Phase 1: Checking to see if the dict and filler dict agrees with the phonelist file.
Found 19 words using 21 phones
Phase 2: Checking to make sure there are not duplicate entries in the dictionary
Phase 3: Check general format for the fileids file; utterance length (must be positive); files exist
Phase 4: Checking number of lines in the transcript file should match lines in fileids file
Phase 5: Determine amount of training data, see if n_tied_states seems reasonable.
Estimated Total Hours Training: 0.0347111111111111
This is a small amount of data, no comment at this time
Phase 6: Checking that all the words in the transcript are in the dictionary
Words in dictionary: 16
Words in filler dictionary: 3
Use of uninitialized value $_[0] in substitution (s///) at /usr/share/perl/5.22/File/Basename.pm line 180, <trn> line 57.
fileparse(): need a valid pathname at /usr/lib/sphinxtrain/scripts/00.verify/verify_all.pl line 352.</trn>
Can you help me,please?
Your input file etc/...train.transcription does not have proper format. You need to review it and make sure it is properly prepared.