CMU Sphinx / Forums / Help: new model pocketspinx5 prealpha

Marcello - 2017-10-23

I creating a model in Italian with about 300 words, I do not have to create speeches, only recognize key words, I use the Ubuntu operating system and I created it in the /usr/local/share/pocketsphinx/model/ ara my new acoustic model by creating the etc folder and the waw folder where i have a speaker1 folder inside and a 16Hz mono wav file when I prepare the database i put it in the model folder: /usr/local/share/pocketsphinx/model/ara and type the command: sphinxtrain -t an4 setup , as from tutorials(https://cmusphinx.github.io/wiki/tutorialam/) I should have the new structure:

├─ etc
├─ feat
├─ logdir
├─ model_parameters
├─ model_architecture
├─ result
└─ wav

but this does not happen, two files are created in etc: feat.params and sphinx_train.cfg , my start structure is:
├─ etc
│ ├─ ara.dic
│ ├─ ara.phone
│ ├─ ara.filler
│ ├─ ara_train.fileids
│ ├─ ara_train.transcription
│ ├─ ara_test.fileids
│ └─ ara_test.transcription
│ └─ ara.jsgf
└─ wav
├─ speaker_1
└─ file_1.wav

i setup the file sphinx_train.cfg, with add comment a line: $CFG_LANGUAGEMODEL, and remove comment at line : DEC_CFG_GRAMMAR, and setup $CFG_INITIAL_DENSITIES and $CFG_FINAL_NUM_DENSITIES =8, now if sphinxtrain -t an4 run, in log file: sphinx_fe.c(967): Processing all remaining utterances at position 0

INFO: sphinx_fe.c(787): Converting /usr/local/share/pocketsphinx/model/ara/wav/speaker1/file_1.wav to /usr/local/share/pocketsphinx/model/ara/feat/speaker1/file_1.mfc
ERROR: "sphinx_fe.c", line 118: Failed to open /usr/local/share/pocketsphinx/model/ara/wav/speaker1/file_1.wav: No such file or directory, but the file wav exist in this directory!!

my model is here https://github.com/McNamara10/ara

can you help me?
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
- Nickolay V. Shmyrev - 2017-10-23
  
  The folder with wav files must be called "wav", not "waw" with final 'w' like in your github. When computer tells you the file is missing check yourself, computers do not lie or try to fool you.
  
  If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Marcello - 2017-10-23

yes I have see now, the error is stupid..pardon :), ow converting, but have another error in phone list, so after command run if you process fine, i will a new structure: ├─ etc
├─ feat
├─ logdir
├─ model_parameters
├─ model_architecture
├─ result
└─ wav

or i need other operations?

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
- Nickolay V. Shmyrev - 2017-10-24
  
  It is hard to understand your question.
  
  If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

sorry, but i not understand because i launc this command: sphinxtrain -t ara setup , not created folders: ├─model_parameters
├─ model_architecture
├─ result
as an example--> https://cmusphinx.github.io/wiki/tutorialam/

so i have convert my file.wav in mfc, (with your help..the folder was waw..i update at wav)
now sphinxtrain -t ara run is completed,

MODULE: 000 Computing feature from audio files  (2017-10-24 10:24)

Extracting features from segments starting at (part 1 of 1)

sphinx_fe Log File
completed
Extracting features from segments starting at (part 1 of 1)

sphinx_fe Log File
completed
Feature extraction is done

MODULE: 00 verify training files    (2017-10-24 10:24)

Phase 1: Checking to see if the dict and filler dict agrees with the phonelist file.
WARNING: The phonelist (/usr/local/share/pocketsphinx/model/ara/etc/ara.phone) does not define the phone SIL (required!)
Found 4 words using 6 phones
WARNING: This phone (SIL) occurs in the dictionary (/usr/local/share/pocketsphinx/model/ara/etc/ara.dic), but not in the phonelist (/usr/local/share/pocketsphinx/model/ara/etc/ara.phone)
passed
Phase 2: Checking to make sure there are not duplicate entries in the dictionary
passed
Phase 3: Check general format for the fileids file; utterance length (must be positive); files exist
passed
Phase 4: Checking number of lines in the transcript file should match lines in fileids file
passed
Phase 5: Determine amount of training data, see if n_tied_states seems reasonable.
Estimated Total Hours Training: 0.000425
This is a small amount of data, no comment at this time
WARNING
Phase 6: Checking that all the words in the transcript are in the dictionary
Words in dictionary: 1
Words in filler dictionary: 3
passed
Phase 7: Checking that all the phones in the transcript are in the phonelist, and all phones in the phonelist appear at least once
passed

i have see ara.html from folder my model (ara), it's correct, my new structure is this: https://github.com/McNamara10/model-ara-new/tree/master/ara

if this is correct, i launch this command:
perl scripts_pl/000.comp_feat/slave_feat.pl
perl scripts_pl/00.verify/verify_all.pl
perl scripts_pl/10.vector_quantize/slave.VQ.pl
perl scripts_pl/20.ci_hmm/slave_convg.pl
perl scripts_pl/30.cd_hmm_untied/slave_convg.pl
perl scripts_pl/40.buildtrees/slave.treebuilder.pl
perl scripts_pl/45.prunetree/slave-state-tying.pl
perl scripts_pl/50.cd_hmm_tied/slave_convg.pl
perl scripts_pl/90.deleted_interpolation/deleted_interpolation.pl
as an example --> https://cmusphinx.github.io/wiki/tutorialam/

and this script created this file?
mdef
feat.params
mixture_weights
means
noisedict
transition_matrices
variances

Last edit: Marcello 2017-10-24

Marcello - 2017-10-25

now i understand, i see the log..i have correct the error, and with command run, has created the folder, but i have error..,
INFO: feat.c(715): Initializing feature stream to type: '1s_c_d_dd', ceplen=13, CMN='batch', VARNORM='no', AGC='none'
ERROR: "acmod.c", line 79: Folder '/usr/local/share/pocketsphinx/model/ara/model_parameters/ara.cd_cont_10' does not contain acoustic model definition 'mdef'
FATAL: "batch.c", line 822: PocketSphinx decoder init failed

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

new model pocketspinx5 prealpha

Speech Recognition Toolkit

Forums

Help

new model pocketspinx5 prealpha document.SUBSCRIPTION_OPTIONS = { "thing": "topic", "subscribed": false, "url": "subscribe", "icon": { "css": "fa fa-envelope-o" } };

new model pocketspinx5 prealpha