CMU Sphinx / Forums / Help: Small vocabulary, tuning and performance options?

Remy Porter - 2016-06-16

I'm trying to understand how to tune pocketsphinx to very quickly recognize commands from an extremely small vocabulary. I've built a custom LMM with the handful of phrases I want to recognize: SIGNAL LEFT, SIGNAL RIGHT, SIGNAL OFF.

I'm running this on a Raspberry Pi, and the goal is to get recognition happening in about 1 second. Sometimes, it can hit that, but other times it takes as much as 4 seconds. Eventually, I'll want to trigger off a keyphrase, since it needs to be "always on", but first I want to tune recognition performance as much as possible.

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
- Nickolay V. Shmyrev - 2016-06-16
  
  Hello Remi
  
  You need to provide exact command line or code you are using and the output log of pocketsphinx to get help on this issue.
  
  According to experience of our users recognition should be pretty fast:
  
  https://www.element14.com/community/roadTestReviews/2166/l/roadtest-review-a-raspberry-pi-3-model-b-review
  
  You can start experimenting with keyphrase spotting mode directly, there is no need to spend time on language models if you are not going to use them.
  
  If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Right now, I'm mostly using default settings. I've tried turning on and off different kinds of search modes, like disabling fwdflat or bestpath, but didn't get any real improvements. This run is through a 19 second long audio file that contains several phrases, including a few that don't contain the keyphrase, which can be downloaded here: http://jetpackshark.com/systemtests.wav

$ pocketsphinx_continuous -keyphrase "SIGNAL" -samprate 44100 -nfft 2048 -lm ./lmm/9014.lm -dict ./lmm/9014.dic -infile systemtests.wav
INFO: pocketsphinx.c(152): Parsed model-specific feature parameters from /usr/local/share/pocketsphinx/model/en-us/en-us/feat.params
Current configuration:
[NAME]          [DEFLT]     [VALUE]
-agc            none        none
-agcthresh      2.0     2.000000e+00
-allphone
-allphone_ci        no      no
-alpha          0.97        9.700000e-01
-ascale         20.0        2.000000e+01
-aw         1       1
-backtrace      no      no
-beam           1e-48       1.000000e-48
-bestpath       yes     yes
-bestpathlw     9.5     9.500000e+00
-ceplen         13      13
-cmn            current     current
-cmninit        8.0     40,3,-1
-compallsen     no      no
-debug                  0
-dict                   ./lmm/9014.dic
-dictcase       no      no
-dither         no      no
-doublebw       no      no
-ds         1       1
-fdict
-feat           1s_c_d_dd   1s_c_d_dd
-featparams
-fillprob       1e-8        1.000000e-08
-frate          100     100
-fsg
-fsgusealtpron      yes     yes
-fsgusefiller       yes     yes
-fwdflat        yes     yes
-fwdflatbeam        1e-64       1.000000e-64
-fwdflatefwid       4       4
-fwdflatlw      8.5     8.500000e+00
-fwdflatsfwin       25      25
-fwdflatwbeam       7e-29       7.000000e-29
-fwdtree        yes     yes
-hmm                    /usr/local/share/pocketsphinx/model/en-us/en-us
-input_endian       little      little
-jsgf
-keyphrase              SIGNAL
-kws
-kws_delay      10      10
-kws_plp        1e-1        1.000000e-01
-kws_threshold      1       1.000000e+00
-latsize        5000        5000
-lda
-ldadim         0       0
-lifter         0       22
-lm                 ./lmm/9014.lm
-lmctl
-lmname
-logbase        1.0001      1.000100e+00
-logfn
-logspec        no      no
-lowerf         133.33334   1.300000e+02
-lpbeam         1e-40       1.000000e-40
-lponlybeam     7e-29       7.000000e-29
-lw         6.5     6.500000e+00
-maxhmmpf       30000       30000
-maxwpf         -1      -1
-mdef
-mean
-mfclogdir
-min_endfr      0       0
-mixw
-mixwfloor      0.0000001   1.000000e-07
-mllr
-mmap           yes     yes
-ncep           13      13
-nfft           512     2048
-nfilt          40      25
-nwpen          1.0     1.000000e+00
-pbeam          1e-48       1.000000e-48
-pip            1.0     1.000000e+00
-pl_beam        1e-10       1.000000e-10
-pl_pbeam       1e-10       1.000000e-10
-pl_pip         1.0     1.000000e+00
-pl_weight      3.0     3.000000e+00
-pl_window      5       5
-rawlogdir
-remove_dc      no      no
-remove_noise       yes     yes
-remove_silence     yes     yes
-round_filters      yes     yes
-samprate       16000       4.410000e+04
-seed           -1      -1
-sendump
-senlogdir
-senmgau
-silprob        0.005       5.000000e-03
-smoothspec     no      no
-svspec                 0-12/13-25/26-38
-tmat
-tmatfloor      0.0001      1.000000e-04
-topn           4       4
-topn_beam      0       0
-toprule
-transform      legacy      dct
-unit_area      yes     yes
-upperf         6855.4976   6.800000e+03
-uw         1.0     1.000000e+00
-vad_postspeech     50      50
-vad_prespeech      20      20
-vad_startspeech    10      10
-vad_threshold      2.0     2.000000e+00
-var
-varfloor       0.0001      1.000000e-04
-varnorm        no      no
-verbose        no      no
-warp_params
-warp_type      inverse_linear  inverse_linear
-wbeam          7e-29       7.000000e-29
-wip            0.65        6.500000e-01
-wlen           0.025625    2.562500e-02

INFO: feat.c(715): Initializing feature stream to type: '1s_c_d_dd', ceplen=13, CMN='current', VARNORM='no', AGC='none'
INFO: cmn.c(143): mean[0]= 12.00, mean[1..12]= 0.0
INFO: acmod.c(164): Using subvector specification 0-12/13-25/26-38
INFO: mdef.c(518): Reading model definition: /usr/local/share/pocketsphinx/model/en-us/en-us/mdef
INFO: mdef.c(531): Found byte-order mark BMDF, assuming this is a binary mdef file
INFO: bin_mdef.c(336): Reading binary model definition: /usr/local/share/pocketsphinx/model/en-us/en-us/mdef
INFO: bin_mdef.c(516): 42 CI-phone, 137053 CD-phone, 3 emitstate/phone, 126 CI-sen, 5126 Sen, 29324 Sen-Seq
INFO: tmat.c(206): Reading HMM transition probability matrices: /usr/local/share/pocketsphinx/model/en-us/en-us/transition_matrices
INFO: acmod.c(117): Attempting to use PTM computation module
INFO: ms_gauden.c(198): Reading mixture gaussian parameter: /usr/local/share/pocketsphinx/model/en-us/en-us/means
INFO: ms_gauden.c(292): 42 codebook, 3 feature, size:
INFO: ms_gauden.c(294):  128x13
INFO: ms_gauden.c(294):  128x13
INFO: ms_gauden.c(294):  128x13
INFO: ms_gauden.c(198): Reading mixture gaussian parameter: /usr/local/share/pocketsphinx/model/en-us/en-us/variances
INFO: ms_gauden.c(292): 42 codebook, 3 feature, size:
INFO: ms_gauden.c(294):  128x13
INFO: ms_gauden.c(294):  128x13
INFO: ms_gauden.c(294):  128x13
INFO: ms_gauden.c(354): 222 variance values floored
INFO: ptm_mgau.c(476): Loading senones from dump file /usr/local/share/pocketsphinx/model/en-us/en-us/sendump
INFO: ptm_mgau.c(500): BEGIN FILE FORMAT DESCRIPTION
INFO: ptm_mgau.c(563): Rows: 128, Columns: 5126
INFO: ptm_mgau.c(595): Using memory-mapped I/O for senones
INFO: ptm_mgau.c(835): Maximum top-N: 4
INFO: phone_loop_search.c(114): State beam -225 Phone exit beam -225 Insertion penalty 0
INFO: dict.c(320): Allocating 4106 * 20 bytes (80 KiB) for word entries
INFO: dict.c(333): Reading main dictionary: ./lmm/9014.dic
INFO: dict.c(213): Allocated 0 KiB for strings, 0 KiB for phones
INFO: dict.c(336): 5 words read
INFO: dict.c(358): Reading filler dictionary: /usr/local/share/pocketsphinx/model/en-us/en-us/noisedict
INFO: dict.c(213): Allocated 0 KiB for strings, 0 KiB for phones
INFO: dict.c(361): 5 words read
INFO: dict2pid.c(396): Building PID tables for dictionary
INFO: dict2pid.c(406): Allocating 42^3 * 2 bytes (144 KiB) for word-initial triphones
INFO: dict2pid.c(132): Allocated 21336 bytes (20 KiB) for word-final triphones
INFO: dict2pid.c(196): Allocated 21336 bytes (20 KiB) for single-phone word triphones
INFO: kws_search.c(420): KWS(beam: -1080, plp: -23, default threshold 0, delay 10)
INFO: ngram_model_trie.c(347): Trying to read LM in trie binary format
INFO: ngram_model_trie.c(358): Header doesn't match
INFO: ngram_model_trie.c(176): Trying to read LM in arpa format
INFO: ngram_model_trie.c(192): LM of order 3
INFO: ngram_model_trie.c(194): #1-grams: 7
INFO: ngram_model_trie.c(194): #2-grams: 10
INFO: ngram_model_trie.c(194): #3-grams: 5
INFO: lm_trie.c(473): Training quantizer
INFO: lm_trie.c(481): Building LM trie
INFO: ngram_search_fwdtree.c(99): 5 unique initial diphones
INFO: ngram_search_fwdtree.c(148): 0 root, 0 non-root channels, 6 single-phone words
INFO: ngram_search_fwdtree.c(186): Creating search tree
INFO: ngram_search_fwdtree.c(192): before: 0 root, 0 non-root channels, 6 single-phone words
INFO: ngram_search_fwdtree.c(326): after: max nonroot chan increased to 140
INFO: ngram_search_fwdtree.c(339): after: 5 root, 12 non-root channels, 5 single-phone words
INFO: ngram_search_fwdflat.c(157): fwdflat: min_ef_width = 4, max_sf_win = 25
INFO: kws_search.c(467): TOTAL kws 0.00 CPU nan xRT
INFO: kws_search.c(470): TOTAL kws 0.00 wall nan xRT
INFO: continuous.c(307): pocketsphinx_continuous COMPILED ON: Jun  2 2016, AT: 23:42:13

INFO: cmn_prior.c(131): cmn_prior_update: from < 40.00  3.00 -1.00  0.00  0.00  0.00  0.00  0.00  0.00  0.00  0.00  0.00  0.00 >
INFO: cmn_prior.c(149): cmn_prior_update: to   < 40.56 -12.63  2.86  3.19 -10.88  8.73 -13.00  7.11 -4.79 -3.85  5.89 -2.73 -2.47 >
INFO: ngram_search_fwdtree.c(1553):      937 words recognized (4/fr)
INFO: ngram_search_fwdtree.c(1555):    14225 senones evaluated (57/fr)
INFO: ngram_search_fwdtree.c(1559):     5761 channels searched (23/fr), 1051 1st, 3468 last
INFO: ngram_search_fwdtree.c(1562):     1069 words for which last channels evaluated (4/fr)
INFO: ngram_search_fwdtree.c(1564):      107 candidate words for entering last phone (0/fr)
INFO: ngram_search_fwdtree.c(1567): fwdtree 3.91 CPU 1.577 xRT
INFO: ngram_search_fwdtree.c(1570): fwdtree 4.10 wall 1.652 xRT
INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 4 words
INFO: ngram_search_fwdflat.c(948):      910 words recognized (4/fr)
INFO: ngram_search_fwdflat.c(950):    11000 senones evaluated (44/fr)
INFO: ngram_search_fwdflat.c(952):     5156 channels searched (20/fr)
INFO: ngram_search_fwdflat.c(954):     1344 words searched (5/fr)
INFO: ngram_search_fwdflat.c(957):      176 word transitions (0/fr)
INFO: ngram_search_fwdflat.c(960): fwdflat 1.40 CPU 0.565 xRT
INFO: ngram_search_fwdflat.c(963): fwdflat 1.47 wall 0.592 xRT
INFO: ngram_search.c(1253): lattice start node <s>.0 end node </s>.202
INFO: ngram_search.c(1279): Eliminated 1 nodes before end node
INFO: ngram_search.c(1384): Lattice has 255 nodes, 288 links
INFO: ps_lattice.c(1380): Bestpath score: -3046
INFO: ps_lattice.c(1384): Normalizer P(O) = alpha(</s>:202:246) = -176836
INFO: ps_lattice.c(1441): Joint P(O,S) = -197475 P(S|O) = -20639
INFO: ngram_search.c(875): bestpath 0.01 CPU 0.004 xRT
INFO: ngram_search.c(878): bestpath 0.02 wall 0.009 xRT
SIGNAL LEFT
INFO: cmn_prior.c(131): cmn_prior_update: from < 40.56 -12.63  2.86  3.19 -10.88  8.73 -13.00  7.11 -4.79 -3.85  5.89 -2.73 -2.47 >
INFO: cmn_prior.c(149): cmn_prior_update: to   < 43.22 -11.16  0.69  6.22 -10.22  5.94 -14.83  7.24 -4.54 -7.18  6.21 -0.54 -2.58 >
INFO: ngram_search_fwdtree.c(1553):      573 words recognized (3/fr)
INFO: ngram_search_fwdtree.c(1555):    10698 senones evaluated (64/fr)
INFO: ngram_search_fwdtree.c(1559):     4589 channels searched (27/fr), 659 1st, 3075 last
INFO: ngram_search_fwdtree.c(1562):      688 words for which last channels evaluated (4/fr)
INFO: ngram_search_fwdtree.c(1564):       75 candidate words for entering last phone (0/fr)
INFO: ngram_search_fwdtree.c(1567): fwdtree 2.79 CPU 1.661 xRT
INFO: ngram_search_fwdtree.c(1570): fwdtree 2.92 wall 1.741 xRT
INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 5 words
INFO: ngram_search_fwdflat.c(948):      513 words recognized (3/fr)
INFO: ngram_search_fwdflat.c(950):    10228 senones evaluated (61/fr)
INFO: ngram_search_fwdflat.c(952):     5051 channels searched (30/fr)
INFO: ngram_search_fwdflat.c(954):      915 words searched (5/fr)
INFO: ngram_search_fwdflat.c(957):      199 word transitions (1/fr)
INFO: ngram_search_fwdflat.c(960): fwdflat 1.11 CPU 0.661 xRT
INFO: ngram_search_fwdflat.c(963): fwdflat 1.17 wall 0.696 xRT
INFO: ngram_search.c(1253): lattice start node <s>.0 end node </s>.123
INFO: ngram_search.c(1279): Eliminated 1 nodes before end node
INFO: ngram_search.c(1384): Lattice has 149 nodes, 86 links
INFO: ps_lattice.c(1380): Bestpath score: -2489
INFO: ps_lattice.c(1384): Normalizer P(O) = alpha(</s>:123:166) = -161315
INFO: ps_lattice.c(1441): Joint P(O,S) = -174486 P(S|O) = -13171
INFO: ngram_search.c(875): bestpath 0.00 CPU 0.000 xRT
INFO: ngram_search.c(878): bestpath 0.01 wall 0.008 xRT
SIGNAL RIGHT
INFO: cmn_prior.c(131): cmn_prior_update: from < 43.22 -11.16  0.69  6.22 -10.22  5.94 -14.83  7.24 -4.54 -7.18  6.21 -0.54 -2.58 >
INFO: cmn_prior.c(149): cmn_prior_update: to   < 43.85 -11.07  1.01  4.70 -13.55  4.51 -12.91  8.63 -4.98 -7.35  6.83 -0.93 -2.10 >
INFO: ngram_search_fwdtree.c(1553):      806 words recognized (4/fr)
INFO: ngram_search_fwdtree.c(1555):    11810 senones evaluated (61/fr)
INFO: ngram_search_fwdtree.c(1559):     4829 channels searched (25/fr), 806 1st, 2893 last
INFO: ngram_search_fwdtree.c(1562):      895 words for which last channels evaluated (4/fr)
INFO: ngram_search_fwdtree.c(1564):      123 candidate words for entering last phone (0/fr)
INFO: ngram_search_fwdtree.c(1567): fwdtree 3.19 CPU 1.653 xRT
INFO: ngram_search_fwdtree.c(1570): fwdtree 3.35 wall 1.737 xRT
INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 4 words
INFO: ngram_search_fwdflat.c(948):      804 words recognized (4/fr)
INFO: ngram_search_fwdflat.c(950):    11123 senones evaluated (58/fr)
INFO: ngram_search_fwdflat.c(952):     5406 channels searched (28/fr)
INFO: ngram_search_fwdflat.c(954):     1124 words searched (5/fr)
INFO: ngram_search_fwdflat.c(957):      162 word transitions (0/fr)
INFO: ngram_search_fwdflat.c(960): fwdflat 1.28 CPU 0.663 xRT
INFO: ngram_search_fwdflat.c(963): fwdflat 1.34 wall 0.693 xRT
INFO: ngram_search.c(1253): lattice start node <s>.0 end node </s>.153
INFO: ngram_search.c(1279): Eliminated 1 nodes before end node
INFO: ngram_search.c(1384): Lattice has 198 nodes, 371 links
INFO: ps_lattice.c(1380): Bestpath score: -3680
INFO: ps_lattice.c(1384): Normalizer P(O) = alpha(</s>:153:191) = -195067
INFO: ps_lattice.c(1441): Joint P(O,S) = -228604 P(S|O) = -33537
INFO: ngram_search.c(875): bestpath 0.01 CPU 0.005 xRT
INFO: ngram_search.c(878): bestpath 0.02 wall 0.012 xRT
SIGNAL STOP
INFO: cmn_prior.c(99): cmn_prior_update: from < 43.85 -11.07  1.01  4.70 -13.55  4.51 -12.91  8.63 -4.98 -7.35  6.83 -0.93 -2.10 >
INFO: cmn_prior.c(116): cmn_prior_update: to   < 44.69 -11.43  2.95  2.25 -14.15  6.53 -11.29  7.80 -4.75 -6.60  6.63 -0.94 -0.22 >
INFO: cmn_prior.c(131): cmn_prior_update: from < 44.69 -11.43  2.95  2.25 -14.15  6.53 -11.29  7.80 -4.75 -6.60  6.63 -0.94 -0.22 >
INFO: cmn_prior.c(149): cmn_prior_update: to   < 43.01 -10.79  3.14  1.15 -13.50  7.19 -11.42  7.55 -4.59 -6.25  5.88 -1.18  0.00 >
INFO: ngram_search_fwdtree.c(1553):      822 words recognized (4/fr)
INFO: ngram_search_fwdtree.c(1555):    13733 senones evaluated (64/fr)
INFO: ngram_search_fwdtree.c(1559):     6001 channels searched (28/fr), 859 1st, 3862 last
INFO: ngram_search_fwdtree.c(1562):      961 words for which last channels evaluated (4/fr)
INFO: ngram_search_fwdtree.c(1564):      114 candidate words for entering last phone (0/fr)
INFO: ngram_search_fwdtree.c(1567): fwdtree 3.42 CPU 1.598 xRT
INFO: ngram_search_fwdtree.c(1570): fwdtree 3.60 wall 1.680 xRT
INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 5 words
INFO: ngram_search_fwdflat.c(948):      760 words recognized (4/fr)
INFO: ngram_search_fwdflat.c(950):    13309 senones evaluated (62/fr)
INFO: ngram_search_fwdflat.c(952):     6575 channels searched (30/fr)
INFO: ngram_search_fwdflat.c(954):     1279 words searched (5/fr)
INFO: ngram_search_fwdflat.c(957):      205 word transitions (0/fr)
INFO: ngram_search_fwdflat.c(960): fwdflat 1.43 CPU 0.668 xRT
INFO: ngram_search_fwdflat.c(963): fwdflat 1.50 wall 0.701 xRT
INFO: ngram_search.c(1253): lattice start node <s>.0 end node </s>.179
INFO: ngram_search.c(1279): Eliminated 1 nodes before end node
INFO: ngram_search.c(1384): Lattice has 163 nodes, 263 links
INFO: ps_lattice.c(1380): Bestpath score: -5075
INFO: ps_lattice.c(1384): Normalizer P(O) = alpha(</s>:179:212) = -271550
INFO: ps_lattice.c(1441): Joint P(O,S) = -309344 P(S|O) = -37794
INFO: ngram_search.c(875): bestpath 0.01 CPU 0.005 xRT
INFO: ngram_search.c(878): bestpath 0.01 wall 0.004 xRT
SIGNAL STOP
INFO: cmn_prior.c(131): cmn_prior_update: from < 43.01 -10.79  3.14  1.15 -13.50  7.19 -11.42  7.55 -4.59 -6.25  5.88 -1.18  0.00 >
INFO: cmn_prior.c(149): cmn_prior_update: to   < 45.71 -9.70  2.46 -0.12 -16.31  5.28 -10.94  7.86 -4.96 -6.55  5.80 -1.64  0.16 >
INFO: ngram_search_fwdtree.c(1553):      774 words recognized (5/fr)
INFO: ngram_search_fwdtree.c(1555):    15391 senones evaluated (96/fr)
INFO: ngram_search_fwdtree.c(1559):     7631 channels searched (47/fr), 715 1st, 5665 last
INFO: ngram_search_fwdtree.c(1562):      874 words for which last channels evaluated (5/fr)
INFO: ngram_search_fwdtree.c(1564):      231 candidate words for entering last phone (1/fr)
INFO: ngram_search_fwdtree.c(1567): fwdtree 2.73 CPU 1.706 xRT
INFO: ngram_search_fwdtree.c(1570): fwdtree 2.87 wall 1.792 xRT
INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 6 words
INFO: ngram_search_fwdflat.c(948):      774 words recognized (5/fr)
INFO: ngram_search_fwdflat.c(950):    16958 senones evaluated (106/fr)
INFO: ngram_search_fwdflat.c(952):     9729 channels searched (60/fr)
INFO: ngram_search_fwdflat.c(954):     1191 words searched (7/fr)
INFO: ngram_search_fwdflat.c(957):      305 word transitions (1/fr)
INFO: ngram_search_fwdflat.c(960): fwdflat 1.13 CPU 0.706 xRT
INFO: ngram_search_fwdflat.c(963): fwdflat 1.19 wall 0.746 xRT
INFO: ngram_search.c(1253): lattice start node <s>.0 end node </s>.134
INFO: ngram_search.c(1279): Eliminated 2 nodes before end node
INFO: ngram_search.c(1384): Lattice has 175 nodes, 227 links
INFO: ps_lattice.c(1380): Bestpath score: -4824
INFO: ps_lattice.c(1384): Normalizer P(O) = alpha(</s>:134:158) = -263021
INFO: ps_lattice.c(1441): Joint P(O,S) = -296546 P(S|O) = -33525
INFO: ngram_search.c(875): bestpath 0.01 CPU 0.006 xRT
INFO: ngram_search.c(878): bestpath 0.02 wall 0.014 xRT
SIGNAL STOP
INFO: cmn_prior.c(99): cmn_prior_update: from < 45.71 -9.70  2.46 -0.12 -16.31  5.28 -10.94  7.86 -4.96 -6.55  5.80 -1.64  0.16 >
INFO: cmn_prior.c(116): cmn_prior_update: to   < 46.37 -10.45  3.40  0.33 -15.60  5.64 -11.92  7.43 -4.96 -6.99  5.90 -1.78 -0.07 >
INFO: cmn_prior.c(131): cmn_prior_update: from < 46.37 -10.45  3.40  0.33 -15.60  5.64 -11.92  7.43 -4.96 -6.99  5.90 -1.78 -0.07 >
INFO: cmn_prior.c(149): cmn_prior_update: to   < 43.67 -9.67  3.24 -0.68 -16.17  4.57 -10.66  9.09 -6.51 -6.81  5.74 -1.41  0.10 >
INFO: ngram_search_fwdtree.c(1553):      896 words recognized (4/fr)
INFO: ngram_search_fwdtree.c(1555):    16099 senones evaluated (78/fr)
INFO: ngram_search_fwdtree.c(1559):     7374 channels searched (35/fr), 920 1st, 4902 last
INFO: ngram_search_fwdtree.c(1562):     1019 words for which last channels evaluated (4/fr)
INFO: ngram_search_fwdtree.c(1564):      209 candidate words for entering last phone (1/fr)
INFO: ngram_search_fwdtree.c(1567): fwdtree 3.33 CPU 1.617 xRT
INFO: ngram_search_fwdtree.c(1570): fwdtree 3.50 wall 1.701 xRT
INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 6 words
INFO: ngram_search_fwdflat.c(948):      915 words recognized (4/fr)
INFO: ngram_search_fwdflat.c(950):    18979 senones evaluated (92/fr)
INFO: ngram_search_fwdflat.c(952):    10767 channels searched (52/fr)
INFO: ngram_search_fwdflat.c(954):     1434 words searched (6/fr)
INFO: ngram_search_fwdflat.c(957):      336 word transitions (1/fr)
INFO: ngram_search_fwdflat.c(960): fwdflat 1.38 CPU 0.670 xRT
INFO: ngram_search_fwdflat.c(963): fwdflat 1.45 wall 0.703 xRT
INFO: ngram_search.c(1253): lattice start node <s>.0 end node </s>.145
INFO: ngram_search.c(1279): Eliminated 1 nodes before end node
INFO: ngram_search.c(1384): Lattice has 182 nodes, 185 links
INFO: ps_lattice.c(1380): Bestpath score: -4983
INFO: ps_lattice.c(1384): Normalizer P(O) = alpha(</s>:145:204) = -279027
INFO: ps_lattice.c(1441): Joint P(O,S) = -322759 P(S|O) = -43732
INFO: ngram_search.c(875): bestpath 0.01 CPU 0.005 xRT
INFO: ngram_search.c(878): bestpath 0.01 wall 0.007 xRT
SIGNAL STOP
INFO: cmn_prior.c(131): cmn_prior_update: from < 43.67 -9.67  3.24 -0.68 -16.17  4.57 -10.66  9.09 -6.51 -6.81  5.74 -1.41  0.10 >
INFO: cmn_prior.c(149): cmn_prior_update: to   < 44.69 -10.15  3.38  0.56 -14.73  4.29 -13.18  8.64 -6.64 -6.86  6.11 -1.27 -0.19 >
INFO: ngram_search_fwdtree.c(1553):      554 words recognized (3/fr)
INFO: ngram_search_fwdtree.c(1555):    11263 senones evaluated (62/fr)
INFO: ngram_search_fwdtree.c(1559):     4799 channels searched (26/fr), 687 1st, 3192 last
INFO: ngram_search_fwdtree.c(1562):      701 words for which last channels evaluated (3/fr)
INFO: ngram_search_fwdtree.c(1564):       93 candidate words for entering last phone (0/fr)
INFO: ngram_search_fwdtree.c(1567): fwdtree 2.95 CPU 1.621 xRT
INFO: ngram_search_fwdtree.c(1570): fwdtree 3.11 wall 1.707 xRT
INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 4 words
INFO: ngram_search_fwdflat.c(948):      500 words recognized (3/fr)
INFO: ngram_search_fwdflat.c(950):    10245 senones evaluated (56/fr)
INFO: ngram_search_fwdflat.c(952):     4888 channels searched (26/fr)
INFO: ngram_search_fwdflat.c(954):      904 words searched (4/fr)
INFO: ngram_search_fwdflat.c(957):      162 word transitions (0/fr)
INFO: ngram_search_fwdflat.c(960): fwdflat 1.20 CPU 0.659 xRT
INFO: ngram_search_fwdflat.c(963): fwdflat 1.26 wall 0.695 xRT
INFO: ngram_search.c(1253): lattice start node <s>.0 end node </s>.138
INFO: ngram_search.c(1279): Eliminated 1 nodes before end node
INFO: ngram_search.c(1384): Lattice has 143 nodes, 62 links
INFO: ps_lattice.c(1380): Bestpath score: -2666
INFO: ps_lattice.c(1384): Normalizer P(O) = alpha(</s>:138:180) = -173267
INFO: ps_lattice.c(1441): Joint P(O,S) = -181346 P(S|O) = -8079
INFO: ngram_search.c(875): bestpath 0.00 CPU 0.000 xRT
INFO: ngram_search.c(878): bestpath 0.01 wall 0.007 xRT
SIGNAL RIGHT
INFO: cmn_prior.c(131): cmn_prior_update: from < 44.69 -10.15  3.38  0.56 -14.73  4.29 -13.18  8.64 -6.64 -6.86  6.11 -1.27 -0.19 >
INFO: cmn_prior.c(149): cmn_prior_update: to   < 44.69 -10.15  3.38  0.56 -14.73  4.29 -13.18  8.64 -6.64 -6.86  6.11 -1.27 -0.19 >
INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 0 words
INFO: ngram_search_fwdtree.c(432): TOTAL fwdtree 22.48 CPU 1.648 xRT
INFO: ngram_search_fwdtree.c(435): TOTAL fwdtree 23.61 wall 1.731 xRT
INFO: ngram_search_fwdflat.c(176): TOTAL fwdflat 8.93 CPU 0.655 xRT
INFO: ngram_search_fwdflat.c(179): TOTAL fwdflat 9.38 wall 0.688 xRT
INFO: ngram_search.c(303): TOTAL bestpath 0.05 CPU 0.004 xRT
INFO: ngram_search.c(306): TOTAL bestpath 0.12 wall 0.009 xRT

Ok, and what is your CPU exactly? What is the output of cat /proc/cpuinfo?

You can try with jsgf grammar instead:

#JSGF V1.0;

grammar command;

public <command> = (system | signal) (on | off | left | right);

It should be faster than LM.

Nickolay V. Shmyrev - 2016-06-16

You can also try with semi model:

https://sourceforge.net/projects/cmusphinx/files/Acoustic%20and%20Language%20Models/US%20English/cmusphinx-en-us-semi-5.1.tar.gz/download

It is going to be faster as well.

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Remy Porter - 2016-06-16

processor : 0
model name : ARMv6-compatible processor rev 7 (v6l)
BogoMIPS : 697.95
Features : half thumb fastmult vfp edsp java tls
CPU implementer : 0x41
CPU architecture: 7
CPU variant : 0x0
CPU part : 0xb76
CPU revision : 7

Hardware : BCM2708
Revision : 0015
Serial : 00000000d620ebdb

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Remy Porter - 2016-06-16

And JSGF makes a lot more sense, and with some tweaking, should be exactly what I need. Thanks!

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

And this is the current output:

pi@raspberrypi:~/voicecode $ pocketsphinx_continuous -keyphrase "SIGNAL" -samprate 44100 -nfft 2048 -jsgf commands.jsgf -lm ./lmm/9014.lm -dict ./lmm/9014.dic -infile systemtests.wav
INFO: pocketsphinx.c(152): Parsed model-specific feature parameters from /usr/local/share/pocketsphinx/model/en-us/en-us/feat.params
Current configuration:
[NAME]          [DEFLT]     [VALUE]
-agc            none        none
-agcthresh      2.0     2.000000e+00
-allphone
-allphone_ci        no      no
-alpha          0.97        9.700000e-01
-ascale         20.0        2.000000e+01
-aw         1       1
-backtrace      no      no
-beam           1e-48       1.000000e-48
-bestpath       yes     yes
-bestpathlw     9.5     9.500000e+00
-ceplen         13      13
-cmn            current     current
-cmninit        8.0     40,3,-1
-compallsen     no      no
-debug                  0
-dict                   ./lmm/9014.dic
-dictcase       no      no
-dither         no      no
-doublebw       no      no
-ds         1       1
-fdict
-feat           1s_c_d_dd   1s_c_d_dd
-featparams
-fillprob       1e-8        1.000000e-08
-frate          100     100
-fsg
-fsgusealtpron      yes     yes
-fsgusefiller       yes     yes
-fwdflat        yes     yes
-fwdflatbeam        1e-64       1.000000e-64
-fwdflatefwid       4       4
-fwdflatlw      8.5     8.500000e+00
-fwdflatsfwin       25      25
-fwdflatwbeam       7e-29       7.000000e-29
-fwdtree        yes     yes
-hmm                    /usr/local/share/pocketsphinx/model/en-us/en-us
-input_endian       little      little
-jsgf                   commands.jsgf
-keyphrase              SIGNAL
-kws
-kws_delay      10      10
-kws_plp        1e-1        1.000000e-01
-kws_threshold      1       1.000000e+00
-latsize        5000        5000
-lda
-ldadim         0       0
-lifter         0       22
-lm                 ./lmm/9014.lm
-lmctl
-lmname
-logbase        1.0001      1.000100e+00
-logfn
-logspec        no      no
-lowerf         133.33334   1.300000e+02
-lpbeam         1e-40       1.000000e-40
-lponlybeam     7e-29       7.000000e-29
-lw         6.5     6.500000e+00
-maxhmmpf       30000       30000
-maxwpf         -1      -1
-mdef
-mean
-mfclogdir
-min_endfr      0       0
-mixw
-mixwfloor      0.0000001   1.000000e-07
-mllr
-mmap           yes     yes
-ncep           13      13
-nfft           512     2048
-nfilt          40      25
-nwpen          1.0     1.000000e+00
-pbeam          1e-48       1.000000e-48
-pip            1.0     1.000000e+00
-pl_beam        1e-10       1.000000e-10
-pl_pbeam       1e-10       1.000000e-10
-pl_pip         1.0     1.000000e+00
-pl_weight      3.0     3.000000e+00
-pl_window      5       5
-rawlogdir
-remove_dc      no      no
-remove_noise       yes     yes
-remove_silence     yes     yes
-round_filters      yes     yes
-samprate       16000       4.410000e+04
-seed           -1      -1
-sendump
-senlogdir
-senmgau
-silprob        0.005       5.000000e-03
-smoothspec     no      no
-svspec                 0-12/13-25/26-38
-tmat
-tmatfloor      0.0001      1.000000e-04
-topn           4       4
-topn_beam      0       0
-toprule
-transform      legacy      dct
-unit_area      yes     yes
-upperf         6855.4976   6.800000e+03
-uw         1.0     1.000000e+00
-vad_postspeech     50      50
-vad_prespeech      20      20
-vad_startspeech    10      10
-vad_threshold      2.0     2.000000e+00
-var
-varfloor       0.0001      1.000000e-04
-varnorm        no      no
-verbose        no      no
-warp_params
-warp_type      inverse_linear  inverse_linear
-wbeam          7e-29       7.000000e-29
-wip            0.65        6.500000e-01
-wlen           0.025625    2.562500e-02

INFO: feat.c(715): Initializing feature stream to type: '1s_c_d_dd', ceplen=13, CMN='current', VARNORM='no', AGC='none'
INFO: cmn.c(143): mean[0]= 12.00, mean[1..12]= 0.0
INFO: acmod.c(164): Using subvector specification 0-12/13-25/26-38
INFO: mdef.c(518): Reading model definition: /usr/local/share/pocketsphinx/model/en-us/en-us/mdef
INFO: mdef.c(531): Found byte-order mark BMDF, assuming this is a binary mdef file
INFO: bin_mdef.c(336): Reading binary model definition: /usr/local/share/pocketsphinx/model/en-us/en-us/mdef
INFO: bin_mdef.c(516): 42 CI-phone, 137053 CD-phone, 3 emitstate/phone, 126 CI-sen, 5126 Sen, 29324 Sen-Seq
INFO: tmat.c(206): Reading HMM transition probability matrices: /usr/local/share/pocketsphinx/model/en-us/en-us/transition_matrices
INFO: acmod.c(117): Attempting to use PTM computation module
INFO: ms_gauden.c(198): Reading mixture gaussian parameter: /usr/local/share/pocketsphinx/model/en-us/en-us/means
INFO: ms_gauden.c(292): 42 codebook, 3 feature, size:
INFO: ms_gauden.c(294):  128x13
INFO: ms_gauden.c(294):  128x13
INFO: ms_gauden.c(294):  128x13
INFO: ms_gauden.c(198): Reading mixture gaussian parameter: /usr/local/share/pocketsphinx/model/en-us/en-us/variances
INFO: ms_gauden.c(292): 42 codebook, 3 feature, size:
INFO: ms_gauden.c(294):  128x13
INFO: ms_gauden.c(294):  128x13
INFO: ms_gauden.c(294):  128x13
INFO: ms_gauden.c(354): 222 variance values floored
INFO: ptm_mgau.c(476): Loading senones from dump file /usr/local/share/pocketsphinx/model/en-us/en-us/sendump
INFO: ptm_mgau.c(500): BEGIN FILE FORMAT DESCRIPTION
INFO: ptm_mgau.c(563): Rows: 128, Columns: 5126
INFO: ptm_mgau.c(595): Using memory-mapped I/O for senones
INFO: ptm_mgau.c(835): Maximum top-N: 4
INFO: phone_loop_search.c(114): State beam -225 Phone exit beam -225 Insertion penalty 0
INFO: dict.c(320): Allocating 4106 * 20 bytes (80 KiB) for word entries
INFO: dict.c(333): Reading main dictionary: ./lmm/9014.dic
INFO: dict.c(213): Allocated 0 KiB for strings, 0 KiB for phones
INFO: dict.c(336): 5 words read
INFO: dict.c(358): Reading filler dictionary: /usr/local/share/pocketsphinx/model/en-us/en-us/noisedict
INFO: dict.c(213): Allocated 0 KiB for strings, 0 KiB for phones
INFO: dict.c(361): 5 words read
INFO: dict2pid.c(396): Building PID tables for dictionary
INFO: dict2pid.c(406): Allocating 42^3 * 2 bytes (144 KiB) for word-initial triphones
INFO: dict2pid.c(132): Allocated 21336 bytes (20 KiB) for word-final triphones
INFO: dict2pid.c(196): Allocated 21336 bytes (20 KiB) for single-phone word triphones
INFO: kws_search.c(420): KWS(beam: -1080, plp: -23, default threshold 0, delay 10)
INFO: jsgf.c(706): Defined rule: <command.g00000>
INFO: jsgf.c(706): Defined rule: PUBLIC <command.command>
INFO: fsg_model.c(208): Computing transitive closure for null transitions
INFO: fsg_model.c(270): 0 null transitions added
INFO: fsg_search.c(227): FSG(beam: -1080, pbeam: -1080, wbeam: -634; wip: -26, pip: 0)
INFO: fsg_model.c(423): Adding silence transitions for <sil> to FSG
INFO: fsg_model.c(443): Added 2 silence word transitions
INFO: fsg_model.c(423): Adding silence transitions for <sil> to FSG
INFO: fsg_model.c(443): Added 2 silence word transitions
INFO: fsg_model.c(423): Adding silence transitions for [NOISE] to FSG
INFO: fsg_model.c(443): Added 2 silence word transitions
INFO: fsg_search.c(173): Added 0 alternate word transitions
INFO: fsg_lextree.c(110): Allocated 172 bytes (0 KiB) for left and right context phones
INFO: fsg_lextree.c(256): 20 HMM nodes in lextree (8 leaves)
INFO: fsg_lextree.c(259): Allocated 2400 bytes (2 KiB) for all lextree nodes
INFO: fsg_lextree.c(262): Allocated 960 bytes (0 KiB) for lextree leafnodes
INFO: kws_search.c(467): TOTAL kws 0.00 CPU nan xRT
INFO: kws_search.c(470): TOTAL kws 0.00 wall nan xRT
INFO: ngram_model_trie.c(347): Trying to read LM in trie binary format
INFO: ngram_model_trie.c(358): Header doesn't match
INFO: ngram_model_trie.c(176): Trying to read LM in arpa format
INFO: ngram_model_trie.c(192): LM of order 3
INFO: ngram_model_trie.c(194): #1-grams: 7
INFO: ngram_model_trie.c(194): #2-grams: 10
INFO: ngram_model_trie.c(194): #3-grams: 5
INFO: lm_trie.c(473): Training quantizer
INFO: lm_trie.c(481): Building LM trie
INFO: ngram_search_fwdtree.c(99): 5 unique initial diphones
INFO: ngram_search_fwdtree.c(148): 0 root, 0 non-root channels, 6 single-phone words
INFO: ngram_search_fwdtree.c(186): Creating search tree
INFO: ngram_search_fwdtree.c(192): before: 0 root, 0 non-root channels, 6 single-phone words
INFO: ngram_search_fwdtree.c(326): after: max nonroot chan increased to 140
INFO: ngram_search_fwdtree.c(339): after: 5 root, 12 non-root channels, 5 single-phone words
INFO: ngram_search_fwdflat.c(157): fwdflat: min_ef_width = 4, max_sf_win = 25
INFO: fsg_search.c(265): TOTAL fsg 0.00 CPU nan xRT
INFO: fsg_search.c(268): TOTAL fsg 0.00 wall nan xRT
INFO: continuous.c(307): pocketsphinx_continuous COMPILED ON: Jun  2 2016, AT: 23:42:13

INFO: cmn_prior.c(131): cmn_prior_update: from < 40.00  3.00 -1.00  0.00  0.00  0.00  0.00  0.00  0.00  0.00  0.00  0.00  0.00 >
INFO: cmn_prior.c(149): cmn_prior_update: to   < 40.56 -12.63  2.86  3.19 -10.88  8.73 -13.00  7.11 -4.79 -3.85  5.89 -2.73 -2.47 >
INFO: ngram_search_fwdtree.c(1553):      937 words recognized (4/fr)
INFO: ngram_search_fwdtree.c(1555):    14225 senones evaluated (57/fr)
INFO: ngram_search_fwdtree.c(1559):     5761 channels searched (23/fr), 1051 1st, 3468 last
INFO: ngram_search_fwdtree.c(1562):     1069 words for which last channels evaluated (4/fr)
INFO: ngram_search_fwdtree.c(1564):      107 candidate words for entering last phone (0/fr)
INFO: ngram_search_fwdtree.c(1567): fwdtree 3.94 CPU 1.589 xRT
INFO: ngram_search_fwdtree.c(1570): fwdtree 4.17 wall 1.680 xRT
INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 4 words
INFO: ngram_search_fwdflat.c(948):      910 words recognized (4/fr)
INFO: ngram_search_fwdflat.c(950):    11000 senones evaluated (44/fr)
INFO: ngram_search_fwdflat.c(952):     5156 channels searched (20/fr)
INFO: ngram_search_fwdflat.c(954):     1344 words searched (5/fr)
INFO: ngram_search_fwdflat.c(957):      176 word transitions (0/fr)
INFO: ngram_search_fwdflat.c(960): fwdflat 1.41 CPU 0.569 xRT
INFO: ngram_search_fwdflat.c(963): fwdflat 1.49 wall 0.602 xRT
INFO: ngram_search.c(1253): lattice start node <s>.0 end node </s>.202
INFO: ngram_search.c(1279): Eliminated 1 nodes before end node
INFO: ngram_search.c(1384): Lattice has 255 nodes, 288 links
INFO: ps_lattice.c(1380): Bestpath score: -3046
INFO: ps_lattice.c(1384): Normalizer P(O) = alpha(</s>:202:246) = -176836
INFO: ps_lattice.c(1441): Joint P(O,S) = -197475 P(S|O) = -20639
INFO: ngram_search.c(875): bestpath 0.01 CPU 0.004 xRT
INFO: ngram_search.c(878): bestpath 0.02 wall 0.009 xRT
SIGNAL LEFT
INFO: cmn_prior.c(131): cmn_prior_update: from < 40.56 -12.63  2.86  3.19 -10.88  8.73 -13.00  7.11 -4.79 -3.85  5.89 -2.73 -2.47 >
INFO: cmn_prior.c(149): cmn_prior_update: to   < 43.22 -11.16  0.69  6.22 -10.22  5.94 -14.83  7.24 -4.54 -7.18  6.21 -0.54 -2.58 >
INFO: ngram_search_fwdtree.c(1553):      573 words recognized (3/fr)
INFO: ngram_search_fwdtree.c(1555):    10698 senones evaluated (64/fr)
INFO: ngram_search_fwdtree.c(1559):     4589 channels searched (27/fr), 659 1st, 3075 last
INFO: ngram_search_fwdtree.c(1562):      688 words for which last channels evaluated (4/fr)
INFO: ngram_search_fwdtree.c(1564):       75 candidate words for entering last phone (0/fr)
INFO: ngram_search_fwdtree.c(1567): fwdtree 2.82 CPU 1.679 xRT
INFO: ngram_search_fwdtree.c(1570): fwdtree 2.98 wall 1.776 xRT
INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 5 words
INFO: ngram_search_fwdflat.c(948):      513 words recognized (3/fr)
INFO: ngram_search_fwdflat.c(950):    10228 senones evaluated (61/fr)
INFO: ngram_search_fwdflat.c(952):     5051 channels searched (30/fr)
INFO: ngram_search_fwdflat.c(954):      915 words searched (5/fr)
INFO: ngram_search_fwdflat.c(957):      199 word transitions (1/fr)
INFO: ngram_search_fwdflat.c(960): fwdflat 1.14 CPU 0.679 xRT
INFO: ngram_search_fwdflat.c(963): fwdflat 1.19 wall 0.709 xRT
INFO: ngram_search.c(1253): lattice start node <s>.0 end node </s>.123
INFO: ngram_search.c(1279): Eliminated 1 nodes before end node
INFO: ngram_search.c(1384): Lattice has 149 nodes, 86 links
INFO: ps_lattice.c(1380): Bestpath score: -2489
INFO: ps_lattice.c(1384): Normalizer P(O) = alpha(</s>:123:166) = -161315
INFO: ps_lattice.c(1441): Joint P(O,S) = -174486 P(S|O) = -13171
INFO: ngram_search.c(875): bestpath 0.00 CPU 0.000 xRT
INFO: ngram_search.c(878): bestpath 0.01 wall 0.006 xRT
SIGNAL RIGHT
INFO: cmn_prior.c(131): cmn_prior_update: from < 43.22 -11.16  0.69  6.22 -10.22  5.94 -14.83  7.24 -4.54 -7.18  6.21 -0.54 -2.58 >
INFO: cmn_prior.c(149): cmn_prior_update: to   < 43.85 -11.07  1.01  4.70 -13.55  4.51 -12.91  8.63 -4.98 -7.35  6.83 -0.93 -2.10 >
INFO: ngram_search_fwdtree.c(1553):      806 words recognized (4/fr)
INFO: ngram_search_fwdtree.c(1555):    11810 senones evaluated (61/fr)
INFO: ngram_search_fwdtree.c(1559):     4829 channels searched (25/fr), 806 1st, 2893 last
INFO: ngram_search_fwdtree.c(1562):      895 words for which last channels evaluated (4/fr)
INFO: ngram_search_fwdtree.c(1564):      123 candidate words for entering last phone (0/fr)
INFO: ngram_search_fwdtree.c(1567): fwdtree 3.21 CPU 1.663 xRT
INFO: ngram_search_fwdtree.c(1570): fwdtree 3.39 wall 1.758 xRT
INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 4 words
INFO: ngram_search_fwdflat.c(948):      804 words recognized (4/fr)
INFO: ngram_search_fwdflat.c(950):    11123 senones evaluated (58/fr)
INFO: ngram_search_fwdflat.c(952):     5406 channels searched (28/fr)
INFO: ngram_search_fwdflat.c(954):     1124 words searched (5/fr)
INFO: ngram_search_fwdflat.c(957):      162 word transitions (0/fr)
INFO: ngram_search_fwdflat.c(960): fwdflat 1.29 CPU 0.668 xRT
INFO: ngram_search_fwdflat.c(963): fwdflat 1.37 wall 0.709 xRT
INFO: ngram_search.c(1253): lattice start node <s>.0 end node </s>.153
INFO: ngram_search.c(1279): Eliminated 1 nodes before end node
INFO: ngram_search.c(1384): Lattice has 198 nodes, 371 links
INFO: ps_lattice.c(1380): Bestpath score: -3680
INFO: ps_lattice.c(1384): Normalizer P(O) = alpha(</s>:153:191) = -195067
INFO: ps_lattice.c(1441): Joint P(O,S) = -228604 P(S|O) = -33537
INFO: ngram_search.c(875): bestpath 0.01 CPU 0.005 xRT
INFO: ngram_search.c(878): bestpath 0.02 wall 0.009 xRT
SIGNAL STOP
INFO: cmn_prior.c(99): cmn_prior_update: from < 43.85 -11.07  1.01  4.70 -13.55  4.51 -12.91  8.63 -4.98 -7.35  6.83 -0.93 -2.10 >
INFO: cmn_prior.c(116): cmn_prior_update: to   < 44.69 -11.43  2.95  2.25 -14.15  6.53 -11.29  7.80 -4.75 -6.60  6.63 -0.94 -0.22 >
INFO: cmn_prior.c(131): cmn_prior_update: from < 44.69 -11.43  2.95  2.25 -14.15  6.53 -11.29  7.80 -4.75 -6.60  6.63 -0.94 -0.22 >
INFO: cmn_prior.c(149): cmn_prior_update: to   < 43.01 -10.79  3.14  1.15 -13.50  7.19 -11.42  7.55 -4.59 -6.25  5.88 -1.18  0.00 >
INFO: ngram_search_fwdtree.c(1553):      822 words recognized (4/fr)
INFO: ngram_search_fwdtree.c(1555):    13733 senones evaluated (64/fr)
INFO: ngram_search_fwdtree.c(1559):     6001 channels searched (28/fr), 859 1st, 3862 last
INFO: ngram_search_fwdtree.c(1562):      961 words for which last channels evaluated (4/fr)
INFO: ngram_search_fwdtree.c(1564):      114 candidate words for entering last phone (0/fr)
INFO: ngram_search_fwdtree.c(1567): fwdtree 3.46 CPU 1.617 xRT
INFO: ngram_search_fwdtree.c(1570): fwdtree 3.66 wall 1.709 xRT
INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 5 words
INFO: ngram_search_fwdflat.c(948):      760 words recognized (4/fr)
INFO: ngram_search_fwdflat.c(950):    13309 senones evaluated (62/fr)
INFO: ngram_search_fwdflat.c(952):     6575 channels searched (30/fr)
INFO: ngram_search_fwdflat.c(954):     1279 words searched (5/fr)
INFO: ngram_search_fwdflat.c(957):      205 word transitions (0/fr)
INFO: ngram_search_fwdflat.c(960): fwdflat 1.44 CPU 0.673 xRT
INFO: ngram_search_fwdflat.c(963): fwdflat 1.52 wall 0.711 xRT
INFO: ngram_search.c(1253): lattice start node <s>.0 end node </s>.179
INFO: ngram_search.c(1279): Eliminated 1 nodes before end node
INFO: ngram_search.c(1384): Lattice has 163 nodes, 263 links
INFO: ps_lattice.c(1380): Bestpath score: -5075
INFO: ps_lattice.c(1384): Normalizer P(O) = alpha(</s>:179:212) = -271550
INFO: ps_lattice.c(1441): Joint P(O,S) = -309344 P(S|O) = -37794
INFO: ngram_search.c(875): bestpath 0.01 CPU 0.005 xRT
INFO: ngram_search.c(878): bestpath 0.02 wall 0.011 xRT
SIGNAL STOP
INFO: cmn_prior.c(131): cmn_prior_update: from < 43.01 -10.79  3.14  1.15 -13.50  7.19 -11.42  7.55 -4.59 -6.25  5.88 -1.18  0.00 >
INFO: cmn_prior.c(149): cmn_prior_update: to   < 45.71 -9.70  2.46 -0.12 -16.31  5.28 -10.94  7.86 -4.96 -6.55  5.80 -1.64  0.16 >
INFO: ngram_search_fwdtree.c(1553):      774 words recognized (5/fr)
INFO: ngram_search_fwdtree.c(1555):    15391 senones evaluated (96/fr)
INFO: ngram_search_fwdtree.c(1559):     7631 channels searched (47/fr), 715 1st, 5665 last
INFO: ngram_search_fwdtree.c(1562):      874 words for which last channels evaluated (5/fr)
INFO: ngram_search_fwdtree.c(1564):      231 candidate words for entering last phone (1/fr)
INFO: ngram_search_fwdtree.c(1567): fwdtree 2.77 CPU 1.731 xRT
INFO: ngram_search_fwdtree.c(1570): fwdtree 2.92 wall 1.824 xRT
INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 6 words
INFO: ngram_search_fwdflat.c(948):      774 words recognized (5/fr)
INFO: ngram_search_fwdflat.c(950):    16958 senones evaluated (106/fr)
INFO: ngram_search_fwdflat.c(952):     9729 channels searched (60/fr)
INFO: ngram_search_fwdflat.c(954):     1191 words searched (7/fr)
INFO: ngram_search_fwdflat.c(957):      305 word transitions (1/fr)
INFO: ngram_search_fwdflat.c(960): fwdflat 1.14 CPU 0.713 xRT
INFO: ngram_search_fwdflat.c(963): fwdflat 1.19 wall 0.746 xRT
INFO: ngram_search.c(1253): lattice start node <s>.0 end node </s>.134
INFO: ngram_search.c(1279): Eliminated 2 nodes before end node
INFO: ngram_search.c(1384): Lattice has 175 nodes, 227 links
INFO: ps_lattice.c(1380): Bestpath score: -4824
INFO: ps_lattice.c(1384): Normalizer P(O) = alpha(</s>:134:158) = -263021
INFO: ps_lattice.c(1441): Joint P(O,S) = -296546 P(S|O) = -33525
INFO: ngram_search.c(875): bestpath 0.01 CPU 0.006 xRT
INFO: ngram_search.c(878): bestpath 0.01 wall 0.008 xRT
SIGNAL STOP
INFO: cmn_prior.c(99): cmn_prior_update: from < 45.71 -9.70  2.46 -0.12 -16.31  5.28 -10.94  7.86 -4.96 -6.55  5.80 -1.64  0.16 >
INFO: cmn_prior.c(116): cmn_prior_update: to   < 46.37 -10.45  3.40  0.33 -15.60  5.64 -11.92  7.43 -4.96 -6.99  5.90 -1.78 -0.07 >
INFO: cmn_prior.c(131): cmn_prior_update: from < 46.37 -10.45  3.40  0.33 -15.60  5.64 -11.92  7.43 -4.96 -6.99  5.90 -1.78 -0.07 >
INFO: cmn_prior.c(149): cmn_prior_update: to   < 43.67 -9.67  3.24 -0.68 -16.17  4.57 -10.66  9.09 -6.51 -6.81  5.74 -1.41  0.10 >
INFO: ngram_search_fwdtree.c(1553):      896 words recognized (4/fr)
INFO: ngram_search_fwdtree.c(1555):    16099 senones evaluated (78/fr)
INFO: ngram_search_fwdtree.c(1559):     7374 channels searched (35/fr), 920 1st, 4902 last
INFO: ngram_search_fwdtree.c(1562):     1019 words for which last channels evaluated (4/fr)
INFO: ngram_search_fwdtree.c(1564):      209 candidate words for entering last phone (1/fr)
INFO: ngram_search_fwdtree.c(1567): fwdtree 3.36 CPU 1.631 xRT
INFO: ngram_search_fwdtree.c(1570): fwdtree 3.56 wall 1.730 xRT
INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 6 words
INFO: ngram_search_fwdflat.c(948):      915 words recognized (4/fr)
INFO: ngram_search_fwdflat.c(950):    18979 senones evaluated (92/fr)
INFO: ngram_search_fwdflat.c(952):    10767 channels searched (52/fr)
INFO: ngram_search_fwdflat.c(954):     1434 words searched (6/fr)
INFO: ngram_search_fwdflat.c(957):      336 word transitions (1/fr)
INFO: ngram_search_fwdflat.c(960): fwdflat 1.39 CPU 0.675 xRT
INFO: ngram_search_fwdflat.c(963): fwdflat 1.47 wall 0.712 xRT
INFO: ngram_search.c(1253): lattice start node <s>.0 end node </s>.145
INFO: ngram_search.c(1279): Eliminated 1 nodes before end node
INFO: ngram_search.c(1384): Lattice has 182 nodes, 185 links
INFO: ps_lattice.c(1380): Bestpath score: -4983
INFO: ps_lattice.c(1384): Normalizer P(O) = alpha(</s>:145:204) = -279027
INFO: ps_lattice.c(1441): Joint P(O,S) = -322759 P(S|O) = -43732
INFO: ngram_search.c(875): bestpath 0.01 CPU 0.005 xRT
INFO: ngram_search.c(878): bestpath 0.01 wall 0.005 xRT
SIGNAL STOP
INFO: cmn_prior.c(131): cmn_prior_update: from < 43.67 -9.67  3.24 -0.68 -16.17  4.57 -10.66  9.09 -6.51 -6.81  5.74 -1.41  0.10 >
INFO: cmn_prior.c(149): cmn_prior_update: to   < 44.69 -10.15  3.38  0.56 -14.73  4.29 -13.18  8.64 -6.64 -6.86  6.11 -1.27 -0.19 >
INFO: ngram_search_fwdtree.c(1553):      554 words recognized (3/fr)
INFO: ngram_search_fwdtree.c(1555):    11263 senones evaluated (62/fr)
INFO: ngram_search_fwdtree.c(1559):     4799 channels searched (26/fr), 687 1st, 3192 last
INFO: ngram_search_fwdtree.c(1562):      701 words for which last channels evaluated (3/fr)
INFO: ngram_search_fwdtree.c(1564):       93 candidate words for entering last phone (0/fr)
INFO: ngram_search_fwdtree.c(1567): fwdtree 2.98 CPU 1.637 xRT
INFO: ngram_search_fwdtree.c(1570): fwdtree 3.15 wall 1.733 xRT
INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 4 words
INFO: ngram_search_fwdflat.c(948):      500 words recognized (3/fr)
INFO: ngram_search_fwdflat.c(950):    10245 senones evaluated (56/fr)
INFO: ngram_search_fwdflat.c(952):     4888 channels searched (26/fr)
INFO: ngram_search_fwdflat.c(954):      904 words searched (4/fr)
INFO: ngram_search_fwdflat.c(957):      162 word transitions (0/fr)
INFO: ngram_search_fwdflat.c(960): fwdflat 1.21 CPU 0.665 xRT
INFO: ngram_search_fwdflat.c(963): fwdflat 1.27 wall 0.696 xRT
INFO: ngram_search.c(1253): lattice start node <s>.0 end node </s>.138
INFO: ngram_search.c(1279): Eliminated 1 nodes before end node
INFO: ngram_search.c(1384): Lattice has 143 nodes, 62 links
INFO: ps_lattice.c(1380): Bestpath score: -2666
INFO: ps_lattice.c(1384): Normalizer P(O) = alpha(</s>:138:180) = -173267
INFO: ps_lattice.c(1441): Joint P(O,S) = -181346 P(S|O) = -8079
INFO: ngram_search.c(875): bestpath 0.00 CPU 0.000 xRT
INFO: ngram_search.c(878): bestpath 0.01 wall 0.006 xRT
SIGNAL RIGHT
INFO: cmn_prior.c(131): cmn_prior_update: from < 44.69 -10.15  3.38  0.56 -14.73  4.29 -13.18  8.64 -6.64 -6.86  6.11 -1.27 -0.19 >
INFO: cmn_prior.c(149): cmn_prior_update: to   < 44.69 -10.15  3.38  0.56 -14.73  4.29 -13.18  8.64 -6.64 -6.86  6.11 -1.27 -0.19 >
INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 0 words
INFO: ngram_search_fwdtree.c(432): TOTAL fwdtree 22.69 CPU 1.663 xRT
INFO: ngram_search_fwdtree.c(435): TOTAL fwdtree 24.02 wall 1.761 xRT
INFO: ngram_search_fwdflat.c(176): TOTAL fwdflat 9.02 CPU 0.661 xRT
INFO: ngram_search_fwdflat.c(179): TOTAL fwdflat 9.50 wall 0.696 xRT
INFO: ngram_search.c(303): TOTAL bestpath 0.05 CPU 0.004 xRT
INFO: ngram_search.c(306): TOTAL bestpath 0.11 wall 0.008 xRT``

Nickolay V. Shmyrev - 2016-06-16

jsgf and lm conflict with each other, you need to use either jsgf or lm.

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
- Remy Porter - 2016-06-16
  
  Hey, that's much better!
  
  If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Small vocabulary, tuning and performance options?

Speech Recognition Toolkit

Forums

Help

Small vocabulary, tuning and performance options? document.SUBSCRIPTION_OPTIONS = { "thing": "topic", "subscribed": false, "url": "subscribe", "icon": { "css": "fa fa-envelope-o" } };

Small vocabulary, tuning and performance options?