I try to create a new acoustic model for my language. I used the latest versions of sphinxtrain, sphinxbase and pocketsphinx . When I train my model, the sphinxtrain stop after phase 1: Cleaning up directories to Training Context Independent models ....
i have the following results :
Sphinxtrain path: /usr/local/lib/sphinxtrain
Sphinxtrain binaries path: /usr/local/libexec/sphinxtrain
Running the training
MODULE: 000 Computing feature from audio files
Extracting features from segments starting at (part 1 of 1)
Extracting features from segments starting at (part 1 of 1)
Feature extraction is done
MODULE: 00 verify training files
Phase 1: Checking to see if the dict and filler dict agrees with the phonelist file.
Found 23846 words using 46 phones
Phase 2: Checking to make sure there are not duplicate entries in the dictionary
Phase 3: Check general format for the fileids file; utterance length (must be positive); files exist
Phase 4: Checking number of lines in the transcript file should match lines in fileids file
Phase 5: Determine amount of training data, see if n_tied_states seems reasonable.
Estimated Total Hours Training: 4.30869166666667
This is a small amount of data, no comment at this time
Phase 6: Checking that all the words in the transcript are in the dictionary
Words in dictionary: 23841
Words in filler dictionary: 5
Phase 7: Checking that all the phones in the transcript are in the phonelist, and all phones in the phonelist appear at least once
MODULE: 0000 train grapheme-to-phoneme model
Skipped (set $CFG_G2P_MODEL = 'yes' to enable)
MODULE: 01 Train LDA transformation
Skipped (set $CFG_LDA_MLLT = 'yes' to enable)
MODULE: 02 Train MLLT transformation
Skipped (set $CFG_LDA_MLLT = 'yes' to enable)
MODULE: 05 Vector Quantization
Skipped for continuous models
MODULE: 10 Training Context Independent models for forced alignment and VTLN
Skipped: $ST::CFG_FORCEDALIGN set to 'no' in sphinx_train.cfg
Skipped: $ST::CFG_VTLN set to 'no' in sphinx_train.cfg
MODULE: 11 Force-aligning transcripts
Skipped: $ST::CFG_FORCEDALIGN set to 'no' in sphinx_train.cfg
MODULE: 12 Force-aligning data for VTLN
Skipped: $ST::CFG_VTLN set to 'no' in sphinx_train.cfg
MODULE: 20 Training Context Independent models
Phase 1: Cleaning up directories:
accumulator...logs...qmanager...models...
Phase 2: Flat initialize
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Thank you sir now it works properly when i train an4... but not yet working for my databases ,note that i have only features files therefore i didn't start training from comp_feat, i train my accoustic model using the following command: sphinxtrain -f verify cesm run...so i had the following results output :
MODULE: 20 Training Context Independent models
Phase 1: Cleaning up directories:
accumulator...logs...qmanager...models...
Phase 2: Flat initialize
Phase 3: Forward-Backward
Baum welch starting for 1 Gaussian(s), iteration: 1 (1 of 1)
Waiting for /home/ham/Bureau/cesm/model_parameters/cesm.ci_cont_flatinitial/mixture_weights
Waiting for /home/ham/Bureau/cesm/model_parameters/cesm.ci_cont_flatinitial/mixture_weights
Waiting for /home/ham/Bureau/cesm/model_parameters/cesm.ci_cont_flatinitial/mixture_weights
.
.
Best regards;
Last edit: ham 2015-09-03
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
I try to create a new acoustic model for my language. I used the latest versions of sphinxtrain, sphinxbase and pocketsphinx . When I train my model, the sphinxtrain stop after phase 1: Cleaning up directories to Training Context Independent models ....
i have the following results :
Sphinxtrain path: /usr/local/lib/sphinxtrain
Sphinxtrain binaries path: /usr/local/libexec/sphinxtrain
Running the training
MODULE: 000 Computing feature from audio files
Extracting features from segments starting at (part 1 of 1)
Extracting features from segments starting at (part 1 of 1)
Feature extraction is done
MODULE: 00 verify training files
Phase 1: Checking to see if the dict and filler dict agrees with the phonelist file.
Found 23846 words using 46 phones
Phase 2: Checking to make sure there are not duplicate entries in the dictionary
Phase 3: Check general format for the fileids file; utterance length (must be positive); files exist
Phase 4: Checking number of lines in the transcript file should match lines in fileids file
Phase 5: Determine amount of training data, see if n_tied_states seems reasonable.
Estimated Total Hours Training: 4.30869166666667
This is a small amount of data, no comment at this time
Phase 6: Checking that all the words in the transcript are in the dictionary
Words in dictionary: 23841
Words in filler dictionary: 5
Phase 7: Checking that all the phones in the transcript are in the phonelist, and all phones in the phonelist appear at least once
MODULE: 0000 train grapheme-to-phoneme model
Skipped (set $CFG_G2P_MODEL = 'yes' to enable)
MODULE: 01 Train LDA transformation
Skipped (set $CFG_LDA_MLLT = 'yes' to enable)
MODULE: 02 Train MLLT transformation
Skipped (set $CFG_LDA_MLLT = 'yes' to enable)
MODULE: 05 Vector Quantization
Skipped for continuous models
MODULE: 10 Training Context Independent models for forced alignment and VTLN
Skipped: $ST::CFG_FORCEDALIGN set to 'no' in sphinx_train.cfg
Skipped: $ST::CFG_VTLN set to 'no' in sphinx_train.cfg
MODULE: 11 Force-aligning transcripts
Skipped: $ST::CFG_FORCEDALIGN set to 'no' in sphinx_train.cfg
MODULE: 12 Force-aligning data for VTLN
Skipped: $ST::CFG_VTLN set to 'no' in sphinx_train.cfg
MODULE: 20 Training Context Independent models
Phase 1: Cleaning up directories:
accumulator...logs...qmanager...models...
Phase 2: Flat initialize
Last edit: ham 2015-08-21
You need to make sure you installed sphinxbase correctly and configured linker to load it (LD_LIBRARY_PATH part).
Thank you sir now it works properly when i train an4... but not yet working for my databases ,note that i have only features files therefore i didn't start training from comp_feat, i train my accoustic model using the following command: sphinxtrain -f verify cesm run...so i had the following results output :
MODULE: 20 Training Context Independent models
Phase 1: Cleaning up directories:
accumulator...logs...qmanager...models...
Phase 2: Flat initialize
Phase 3: Forward-Backward
Baum welch starting for 1 Gaussian(s), iteration: 1 (1 of 1)
Waiting for /home/ham/Bureau/cesm/model_parameters/cesm.ci_cont_flatinitial/mixture_weights
Waiting for /home/ham/Bureau/cesm/model_parameters/cesm.ci_cont_flatinitial/mixture_weights
Waiting for /home/ham/Bureau/cesm/model_parameters/cesm.ci_cont_flatinitial/mixture_weights
.
.
Best regards;
Last edit: ham 2015-09-03
You can share the database including logs in order to get help on this picture. You can pack into archive, upload to dropbox and give here a link.
Yes,here is the link: https://www.dropbox.com/s/usjyvmpxrf4u97k/asr.tar.gz?dl=0
Thank you very much for all your help and useful information...