I got this error when I try to train the acoustic model
WARNING: This phone (SIL) occurs in the phonelist (/home/suranga/Downloads/Final_Dev/an4/etc/an4.phone), but not in any word in the transcription (/home/suranga/Downloads/Final_Dev/an4/etc/an4_train.transcription)
my phonemes file contain the SIL phoneme and there is no any usage of SIL in transcription file. (I use sinhala unicode transcription file no english words)
You have empty lines and extra UTF-8 BOF symbols in the file an4_test.transcription. You need to remove them. Number of lines must match the lines in fileids file exactly.
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
i used notepad ++ encoding convert to utf-8 only to remove UTF-8 BOF
I removed whitespaces and build acoustic model.
sentenses error rate 90% and word error rate 10%
when I use it on netbeans,
my code,
and System.out.println(result.getResult()); line prints, <s> බ�ලුව </s>
what are those charactors (බ�ලුව) still I am wrong ?
no error, warning or exception in console and I expected result like <s> අම්මා </s>
( I think " බ�ලුව " are ANSI values correspond to unicode-8 )
here I have attached my acoustic model files ,language model and dictionary file
This is just an output in wrong encoding. You can change console encoding to utf-8 or output to file and open with text editor with encoding specification. You can also modify encoding to the one you need before you output the result.
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
SpeechResultresult=recognizer.getResult();StringresultText=result.getHypothesis();PrintWriterwriter=newPrintWriter("the-file-name.txt","UTF-8");writer.println("The first line: "+resultText);writer.println("The second line සිංහල");writer.close();
and here I have attache my output file.
and convert the encording using notepad++ did not make human readable format
As you told me I create system variables for -Dfile.encoding=UTF-8. and restart the computer. then is work fine. Thank you vary much for your Kind help.
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
i want to know , use same sentenses list with different speakers trainig audio files help to improve accuracy? or use different sentenses list with different speakers help to improve accuracy?
or both ?
Because I have 150 sentences (one hour audio) and I decided to get the recorde clips with different users for above sentenses.
Last edit: Suranga Premakumara 2015-12-12
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
I got this error when I try to train the acoustic model
WARNING: This phone (SIL) occurs in the phonelist (/home/suranga/Downloads/Final_Dev/an4/etc/an4.phone), but not in any word in the transcription (/home/suranga/Downloads/Final_Dev/an4/etc/an4_train.transcription)my phonemes file contain the
SILphoneme and there is no any usage of SIL in transcription file. (I use sinhala unicode transcription file no english words)Link for my filler file image : https://lh3.googleusercontent.com/-d0WgJbv6xfA/VmfXncKuk2I/AAAAAAAABGc/ZOU3TYx2m5E/s957-Ic42/filler%252520%2525282%252529.png
Each transcription in your training transcription file must start with
<s>and end with</s>.Last edit: Nickolay V. Shmyrev 2015-12-09
When I add
tags to my transcription file it gives me compile error ,And my transcription file , filler file , dictionary, language model , field files are attached here.ahh it seems okay... mistakenly I put same id twice. sorry for disturb you.
this error is okay now. thak for the help.
Last edit: Suranga Premakumara 2015-12-09
After this changes I got error ,
in my sphinx_train.cfg file i changed,
$CFG_N_TIED_STATES = 1
$CFG_N_TIED_STATES = 2
$CFG_N_TIED_STATES = 4
$CFG_N_TIED_STATES = 8
$CFG_N_TIED_STATES = 200
$CFG_N_TIED_STATES = 1000
** but error still there.
I have 1 hour training data **
In my decode log file shows Warning called,
WARN: "ms_mgau.c", line 145: -topn argument (4) invalid or > #density codewords (1); set to latterLast edit: Suranga Premakumara 2015-12-09
This is just a warning, it should not affect results. Alignment failed for some other reason which you need to find in the logs.
You can share the acoustic model training folder in order to get help on this issue.
Here I have attached my acoustic model, (not include wav and feat folder )
and there is no error or warning in log files.
Last edit: Suranga Premakumara 2015-12-09
You have empty lines and extra UTF-8 BOF symbols in the file an4_test.transcription. You need to remove them. Number of lines must match the lines in fileids file exactly.
i used notepad ++ encoding convert to utf-8 only to remove UTF-8 BOF
I removed whitespaces and build acoustic model.
sentenses error rate 90% and word error rate 10%
when I use it on netbeans,
my code,
and System.out.println(result.getResult()); line prints,
<s> බ�ලුව </s>what are those charactors (බ�ලුව) still I am wrong ?
no error, warning or exception in console and I expected result like
<s> අම්මා </s>( I think " à¶¶à·?ලුව " are ANSI values correspond to unicode-8 )
here I have attached my acoustic model files ,language model and dictionary file
Last edit: Suranga Premakumara 2015-12-10
This is just an output in wrong encoding. You can change console encoding to utf-8 or output to file and open with text editor with encoding specification. You can also modify encoding to the one you need before you output the result.
I use this code segment to write output to file,
and here I have attache my output file.
and convert the encording using notepad++ did not make human readable format
Last edit: Suranga Premakumara 2015-12-10
You can add -Dfile.encoding=UTF-8 to java options when you run your code to force it use UTF-8.
As you told me I create system variables for -Dfile.encoding=UTF-8. and restart the computer. then is work fine. Thank you vary much for your Kind help.
i want to know , use same sentenses list with different speakers trainig audio files help to improve accuracy? or use different sentenses list with different speakers help to improve accuracy?
or both ?
Because I have 150 sentences (one hour audio) and I decided to get the recorde clips with different users for above sentenses.
Last edit: Suranga Premakumara 2015-12-12
You need to use different sentences