When training an acoustic model , I successfully prepared data , compile the requied packages , run the train script and since I am using all default configuration so there is no change on etc/sphinx_train.cfg file .
You can download file structure with the link below to see if I made any mistake , File Structure
Alis-MacBook-Pro:an4 aliakhtar$ sphinxtrain run
Sphinxtrain path: /usr/local/lib/sphinxtrain
Sphinxtrain binaries path: /usr/local/libexec/sphinxtrain
Running the training
MODULE: 000 Computing feature from audio files
Failed to open control file /Users/aliakhtar/Documents/tutorial/an4/etc/an4_train.fileids: No such file or directory at /usr/local/lib/sphinxtrain/scripts/000.comp_feat/make_feats.pl line 89.
Failed to open control file /Users/aliakhtar/Documents/tutorial/an4/etc/an4_test.fileids: No such file or directory at /usr/local/lib/sphinxtrain/scripts/000.comp_feat/make_feats.pl line 89.
Feature extraction is done
MODULE: 00 verify training files
Can not open the dictionary (/Users/aliakhtar/Documents/tutorial/an4/etc/an4.dic) at /usr/local/lib/sphinxtrain/scripts/00.verify/verify_all.pl line 58.
I am using macOS Sierra 10.12.6 , Can someone help me what I am doing wrong , I made my own an4 folder with language model in ARPA format ,
Last edit: Ali 2018-01-24
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
(ERROR: Not enough data for the training) Should I need to remove this error for successfull training or there are other things which I am missing . ??
If I am making a speech recogntion system that only recognize counting from 1 to 9 ? How much training data I need ?
If i sphinxtrain -t an4 setup , then (set CFG_CD_TRAIN to "no") , then run sphinxtrain run , I got these errors
Alis-MacBook-Pro:an4 aliakhtar$ sphinxtrain run
Sphinxtrain path: /usr/local/lib/sphinxtrain
Sphinxtrain binaries path: /usr/local/libexec/sphinxtrain
Running the training
Configuration (e.g. etc/sphinx_train.cfg) not defined
Compilation failed in require at /usr/local/lib/sphinxtrain/scripts/000.comp_feat/slave_feat.pl line 51.
BEGIN failed--compilation aborted at /usr/local/lib/sphinxtrain/scripts/000.comp_feat/slave_feat.pl line 51.
Alis-MacBook-Pro:an4 aliakhtar$
Last edit: Ali 2018-01-24
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
1 hour of recording for command and control for a single speaker
5 hours of recordings of 200 speakers for command and control for many speakers
10 hours of recordings for single speaker dictation
50 hours of recordings of 200 speakers for many speakers dictation
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Can anyone relpy on above post ?
Or can someone provide whole database folder that was successfully trained and tested so I can compile at my side and made by folder same like that ?
No, your model was not trained successfully. It is pretty clear that there was error in decoding.
Or can someone provide whole database folder that was successfully trained and tested so I can compile at my side and made by folder same like that ?
An4 link is provided in tutorial.
Another thing should I need to replace
$DEC_CFG_LANGUAGEMODEL = "$CFG_BASE_DIR/etc/${CFG_DB_NAME}.lm.DMP";
with
$DEC_CFG_LANGUAGEMODEL = "$CFG_BASE_DIR/etc/${CFG_DB_NAME}.lm";
on my sphinx_train.cfg Since I have a langugae model file with extension lm only on my database folder
Yes
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
When training an acoustic model , I successfully prepared data , compile the requied packages , run the train script and since I am using all default configuration so there is no change on
etc/sphinx_train.cfg
file .You can download file structure with the link below to see if I made any mistake ,
File Structure
Below is the picture of my root folder
tutorial folder root directory inside
When run the folowing command
sphinxtrain run
I got these error logs
I am using macOS Sierra 10.12.6 , Can someone help me what I am doing wrong , I made my own
an4
folder with language model in ARPA format ,Last edit: Ali 2018-01-24
Files in etc folder should have name an4_test, not a4_test like in your case. You miss letter 'n' in filename.
Got these logs after fixing name of the file ,
(ERROR: Not enough data for the training) Should I need to remove this error for successfull training or there are other things which I am missing . ??
If I am making a speech recogntion system that only recognize counting from 1 to 9 ? How much training data I need ?
If i
sphinxtrain -t an4 setup
, then (set CFG_CD_TRAIN to "no") , then runsphinxtrain run
, I got these errorsLast edit: Ali 2018-01-24
Data requirements are listed in tutorial
http://cmusphinx.github.io/wiki/tutorialam
Is my training successfully done ??
Last edit: Ali 2018-01-25
Can anyone relpy on above post ?
Or can someone provide whole database folder that was successfully trained and tested so I can compile at my side and made by folder same like that ?
Another thing should I need to replace
with
on my
sphinx_train.cfg
Since I have a langugae model file with extensionlm
only on my database folderThanks
Last edit: Ali 2018-01-28
No, your model was not trained successfully. It is pretty clear that there was error in decoding.
An4 link is provided in tutorial.
Yes
I was not compiling pocketsphinx
I am succusscfully train an acoustic model using an4 database that is provided
Now I need to train it on my own database
Special thank Nickolay V. Shmyrev , for replying my post
Last edit: Ali 2018-01-28
Hi,
Did you use Your format of files in databases or format from the database that is given in tutorial?