My breath is being picked up as a "misunderstanding" instead of filtered out and ignored. I checked the noisedict files, in location \hmm\wsj1\ the file ontains:
<s> SIL
</s> SIL
<sil> SIL
++NOISE++ +NOISE+
++BREATH++ +BREATH+
++SMACK++ +SMACK+
In locations \model\communicator\ and \libs\Communicator_semi_40.cd_semi_6000\ has the following in their noisedict files:
During search for the answer decoder tries to insert fillers after each word. The insertion is penalized with probability so you can tune the frequency of fillers. So with a large filler probability decoder will choose filler more likely than word from the grammer. With smaller probability it will prefer grammar words.
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Well, probably your breath is not the real breath. To get the idea you need to find out how breath sounds in the training dabase (wsj, which is unfortunately not availble for free). Probably if you'll ask David will give you examples. Also you can try to play with filler probability -fillprob. The bigger values will give fillers more preference.
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
My breath is being picked up as a "misunderstanding" instead of filtered out and ignored. I checked the noisedict files, in location \hmm\wsj1\ the file ontains:
<s> SIL
</s> SIL
<sil> SIL
++NOISE++ +NOISE+
++BREATH++ +BREATH+
++SMACK++ +SMACK+
In locations \model\communicator\ and \libs\Communicator_semi_40.cd_semi_6000\ has the following in their noisedict files:
++AE++ AE
++AH++ AA
++BACKGROUND++ +BACKGROUND+
++BREATH++ +BREATH+
++COUGH++ +COUGH+
++EH++ EH
++ER++ ER
++LAUGH++ +LAUGH+
++MM++ M
++MUMBLE++ +BACKGROUND+
++NOISE++ +NOISE+
++OH++ OW
++SMACK++ +SMACK+
++UH++ +UH+
++UH_NOISE++ +UH+
++UM++ +UM+
++UM_NOISE++ +UM+
I see ++BREATH++ +BREATH+ in all of them but still my breath is being picked up. Why?
Not all the settings just this setting "fillprob"
During search for the answer decoder tries to insert fillers after each word. The insertion is penalized with probability so you can tune the frequency of fillers. So with a large filler probability decoder will choose filler more likely than word from the grammer. With smaller probability it will prefer grammar words.
Well, probably your breath is not the real breath. To get the idea you need to find out how breath sounds in the training dabase (wsj, which is unfortunately not availble for free). Probably if you'll ask David will give you examples. Also you can try to play with filler probability -fillprob. The bigger values will give fillers more preference.
First, who is David and how do I get a hold of him?
Second, how would I set -fillprob, on windows xp if that matters.
Thanks.
David is the author of pocketsphinx. See this post
https://sourceforge.net/forum/message.php?msg_id=5748449
About -fillprob, in freeswitch find the following lines
add "-fillprob", "0.1"
there
I tried adding "-fillprob", "0.1",
but that seemed to make phrase recognition worst.
What do all these settings mean?