Inorder to get a simplistic keyword-spotting solution with pocketsphinx, I
tried with a simple grammar file. It seems
to give OK results for applications where robustness needn't be very-very
high. Would like to get some feedback on
performance of this approach.
I've tried it with model hub4wsj_sc_8k with pocketsphinx ver 0.6.1. Please
note that bestpath and nbest option need to be
turned off otherwise it takes a very long time to decode.
My keyword is "HELLO SIMSIIM" and grammar file is as follows:
Would like to get some feedback on performance of this approach.
It's ok, more or less what you need. You need to care about garbage path
pruning here, though it's easier to be done in kws-specific code. We have our
own KWS implementation in Nexiwave based on pocketsphinx, but it's different
ps_search class, not fsg search.
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
I had tested with garbage model like, added all phoneme into dictionary and also in the JSGF grammar. Only phoneme being recognized besides words. fsg_search taking lot of time and hanging the system. Should I, reduce number of phone in garbage?
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Anonymous
-
2013-06-17
You probably should reduce the length of the processed
audio, accounting for shorter silences and process shorter amount of speech.
Last edit: Nickolay V. Shmyrev 2013-06-17
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Hi,
Inorder to get a simplistic keyword-spotting solution with pocketsphinx, I
tried with a simple grammar file. It seems
to give OK results for applications where robustness needn't be very-very
high. Would like to get some feedback on
performance of this approach.
I've tried it with model hub4wsj_sc_8k with pocketsphinx ver 0.6.1. Please
note that bestpath and nbest option need to be
turned off otherwise it takes a very long time to decode.
My keyword is "HELLO SIMSIIM" and grammar file is as follows:
Last edit: Nickolay V. Shmyrev 2013-09-16
and.....G1 to G39 are mapped to 39 base phonemes of the model in dictionary.
It's ok, more or less what you need. You need to care about garbage path
pruning here, though it's easier to be done in kws-specific code. We have our
own KWS implementation in Nexiwave based on pocketsphinx, but it's different
ps_search class, not fsg search.
I had tested with garbage model like, added all phoneme into dictionary and also in the JSGF grammar. Only phoneme being recognized besides words. fsg_search taking lot of time and hanging the system. Should I, reduce number of phone in garbage?
You probably should reduce the length of the processed
audio, accounting for shorter silences and process shorter amount of speech.
Last edit: Nickolay V. Shmyrev 2013-06-17