Menu

Small vocabulary, tuning and performance options?

Help
2016-06-16
2016-06-16
  • Remy Porter

    Remy Porter - 2016-06-16

    I'm trying to understand how to tune pocketsphinx to very quickly recognize commands from an extremely small vocabulary. I've built a custom LMM with the handful of phrases I want to recognize: SIGNAL LEFT, SIGNAL RIGHT, SIGNAL OFF.

    I'm running this on a Raspberry Pi, and the goal is to get recognition happening in about 1 second. Sometimes, it can hit that, but other times it takes as much as 4 seconds. Eventually, I'll want to trigger off a keyphrase, since it needs to be "always on", but first I want to tune recognition performance as much as possible.

     
    • Nickolay V. Shmyrev

      Hello Remi

      You need to provide exact command line or code you are using and the output log of pocketsphinx to get help on this issue.

      According to experience of our users recognition should be pretty fast:

      https://www.element14.com/community/roadTestReviews/2166/l/roadtest-review-a-raspberry-pi-3-model-b-review

      You can start experimenting with keyphrase spotting mode directly, there is no need to spend time on language models if you are not going to use them.

       
  • Remy Porter

    Remy Porter - 2016-06-16

    Right now, I'm mostly using default settings. I've tried turning on and off different kinds of search modes, like disabling fwdflat or bestpath, but didn't get any real improvements. This run is through a 19 second long audio file that contains several phrases, including a few that don't contain the keyphrase, which can be downloaded here: http://jetpackshark.com/systemtests.wav

    $ pocketsphinx_continuous -keyphrase "SIGNAL" -samprate 44100 -nfft 2048 -lm ./lmm/9014.lm -dict ./lmm/9014.dic -infile systemtests.wav
    INFO: pocketsphinx.c(152): Parsed model-specific feature parameters from /usr/local/share/pocketsphinx/model/en-us/en-us/feat.params
    Current configuration:
    [NAME]          [DEFLT]     [VALUE]
    -agc            none        none
    -agcthresh      2.0     2.000000e+00
    -allphone
    -allphone_ci        no      no
    -alpha          0.97        9.700000e-01
    -ascale         20.0        2.000000e+01
    -aw         1       1
    -backtrace      no      no
    -beam           1e-48       1.000000e-48
    -bestpath       yes     yes
    -bestpathlw     9.5     9.500000e+00
    -ceplen         13      13
    -cmn            current     current
    -cmninit        8.0     40,3,-1
    -compallsen     no      no
    -debug                  0
    -dict                   ./lmm/9014.dic
    -dictcase       no      no
    -dither         no      no
    -doublebw       no      no
    -ds         1       1
    -fdict
    -feat           1s_c_d_dd   1s_c_d_dd
    -featparams
    -fillprob       1e-8        1.000000e-08
    -frate          100     100
    -fsg
    -fsgusealtpron      yes     yes
    -fsgusefiller       yes     yes
    -fwdflat        yes     yes
    -fwdflatbeam        1e-64       1.000000e-64
    -fwdflatefwid       4       4
    -fwdflatlw      8.5     8.500000e+00
    -fwdflatsfwin       25      25
    -fwdflatwbeam       7e-29       7.000000e-29
    -fwdtree        yes     yes
    -hmm                    /usr/local/share/pocketsphinx/model/en-us/en-us
    -input_endian       little      little
    -jsgf
    -keyphrase              SIGNAL
    -kws
    -kws_delay      10      10
    -kws_plp        1e-1        1.000000e-01
    -kws_threshold      1       1.000000e+00
    -latsize        5000        5000
    -lda
    -ldadim         0       0
    -lifter         0       22
    -lm                 ./lmm/9014.lm
    -lmctl
    -lmname
    -logbase        1.0001      1.000100e+00
    -logfn
    -logspec        no      no
    -lowerf         133.33334   1.300000e+02
    -lpbeam         1e-40       1.000000e-40
    -lponlybeam     7e-29       7.000000e-29
    -lw         6.5     6.500000e+00
    -maxhmmpf       30000       30000
    -maxwpf         -1      -1
    -mdef
    -mean
    -mfclogdir
    -min_endfr      0       0
    -mixw
    -mixwfloor      0.0000001   1.000000e-07
    -mllr
    -mmap           yes     yes
    -ncep           13      13
    -nfft           512     2048
    -nfilt          40      25
    -nwpen          1.0     1.000000e+00
    -pbeam          1e-48       1.000000e-48
    -pip            1.0     1.000000e+00
    -pl_beam        1e-10       1.000000e-10
    -pl_pbeam       1e-10       1.000000e-10
    -pl_pip         1.0     1.000000e+00
    -pl_weight      3.0     3.000000e+00
    -pl_window      5       5
    -rawlogdir
    -remove_dc      no      no
    -remove_noise       yes     yes
    -remove_silence     yes     yes
    -round_filters      yes     yes
    -samprate       16000       4.410000e+04
    -seed           -1      -1
    -sendump
    -senlogdir
    -senmgau
    -silprob        0.005       5.000000e-03
    -smoothspec     no      no
    -svspec                 0-12/13-25/26-38
    -tmat
    -tmatfloor      0.0001      1.000000e-04
    -topn           4       4
    -topn_beam      0       0
    -toprule
    -transform      legacy      dct
    -unit_area      yes     yes
    -upperf         6855.4976   6.800000e+03
    -uw         1.0     1.000000e+00
    -vad_postspeech     50      50
    -vad_prespeech      20      20
    -vad_startspeech    10      10
    -vad_threshold      2.0     2.000000e+00
    -var
    -varfloor       0.0001      1.000000e-04
    -varnorm        no      no
    -verbose        no      no
    -warp_params
    -warp_type      inverse_linear  inverse_linear
    -wbeam          7e-29       7.000000e-29
    -wip            0.65        6.500000e-01
    -wlen           0.025625    2.562500e-02
    
    INFO: feat.c(715): Initializing feature stream to type: '1s_c_d_dd', ceplen=13, CMN='current', VARNORM='no', AGC='none'
    INFO: cmn.c(143): mean[0]= 12.00, mean[1..12]= 0.0
    INFO: acmod.c(164): Using subvector specification 0-12/13-25/26-38
    INFO: mdef.c(518): Reading model definition: /usr/local/share/pocketsphinx/model/en-us/en-us/mdef
    INFO: mdef.c(531): Found byte-order mark BMDF, assuming this is a binary mdef file
    INFO: bin_mdef.c(336): Reading binary model definition: /usr/local/share/pocketsphinx/model/en-us/en-us/mdef
    INFO: bin_mdef.c(516): 42 CI-phone, 137053 CD-phone, 3 emitstate/phone, 126 CI-sen, 5126 Sen, 29324 Sen-Seq
    INFO: tmat.c(206): Reading HMM transition probability matrices: /usr/local/share/pocketsphinx/model/en-us/en-us/transition_matrices
    INFO: acmod.c(117): Attempting to use PTM computation module
    INFO: ms_gauden.c(198): Reading mixture gaussian parameter: /usr/local/share/pocketsphinx/model/en-us/en-us/means
    INFO: ms_gauden.c(292): 42 codebook, 3 feature, size:
    INFO: ms_gauden.c(294):  128x13
    INFO: ms_gauden.c(294):  128x13
    INFO: ms_gauden.c(294):  128x13
    INFO: ms_gauden.c(198): Reading mixture gaussian parameter: /usr/local/share/pocketsphinx/model/en-us/en-us/variances
    INFO: ms_gauden.c(292): 42 codebook, 3 feature, size:
    INFO: ms_gauden.c(294):  128x13
    INFO: ms_gauden.c(294):  128x13
    INFO: ms_gauden.c(294):  128x13
    INFO: ms_gauden.c(354): 222 variance values floored
    INFO: ptm_mgau.c(476): Loading senones from dump file /usr/local/share/pocketsphinx/model/en-us/en-us/sendump
    INFO: ptm_mgau.c(500): BEGIN FILE FORMAT DESCRIPTION
    INFO: ptm_mgau.c(563): Rows: 128, Columns: 5126
    INFO: ptm_mgau.c(595): Using memory-mapped I/O for senones
    INFO: ptm_mgau.c(835): Maximum top-N: 4
    INFO: phone_loop_search.c(114): State beam -225 Phone exit beam -225 Insertion penalty 0
    INFO: dict.c(320): Allocating 4106 * 20 bytes (80 KiB) for word entries
    INFO: dict.c(333): Reading main dictionary: ./lmm/9014.dic
    INFO: dict.c(213): Allocated 0 KiB for strings, 0 KiB for phones
    INFO: dict.c(336): 5 words read
    INFO: dict.c(358): Reading filler dictionary: /usr/local/share/pocketsphinx/model/en-us/en-us/noisedict
    INFO: dict.c(213): Allocated 0 KiB for strings, 0 KiB for phones
    INFO: dict.c(361): 5 words read
    INFO: dict2pid.c(396): Building PID tables for dictionary
    INFO: dict2pid.c(406): Allocating 42^3 * 2 bytes (144 KiB) for word-initial triphones
    INFO: dict2pid.c(132): Allocated 21336 bytes (20 KiB) for word-final triphones
    INFO: dict2pid.c(196): Allocated 21336 bytes (20 KiB) for single-phone word triphones
    INFO: kws_search.c(420): KWS(beam: -1080, plp: -23, default threshold 0, delay 10)
    INFO: ngram_model_trie.c(347): Trying to read LM in trie binary format
    INFO: ngram_model_trie.c(358): Header doesn't match
    INFO: ngram_model_trie.c(176): Trying to read LM in arpa format
    INFO: ngram_model_trie.c(192): LM of order 3
    INFO: ngram_model_trie.c(194): #1-grams: 7
    INFO: ngram_model_trie.c(194): #2-grams: 10
    INFO: ngram_model_trie.c(194): #3-grams: 5
    INFO: lm_trie.c(473): Training quantizer
    INFO: lm_trie.c(481): Building LM trie
    INFO: ngram_search_fwdtree.c(99): 5 unique initial diphones
    INFO: ngram_search_fwdtree.c(148): 0 root, 0 non-root channels, 6 single-phone words
    INFO: ngram_search_fwdtree.c(186): Creating search tree
    INFO: ngram_search_fwdtree.c(192): before: 0 root, 0 non-root channels, 6 single-phone words
    INFO: ngram_search_fwdtree.c(326): after: max nonroot chan increased to 140
    INFO: ngram_search_fwdtree.c(339): after: 5 root, 12 non-root channels, 5 single-phone words
    INFO: ngram_search_fwdflat.c(157): fwdflat: min_ef_width = 4, max_sf_win = 25
    INFO: kws_search.c(467): TOTAL kws 0.00 CPU nan xRT
    INFO: kws_search.c(470): TOTAL kws 0.00 wall nan xRT
    INFO: continuous.c(307): pocketsphinx_continuous COMPILED ON: Jun  2 2016, AT: 23:42:13
    
    INFO: cmn_prior.c(131): cmn_prior_update: from < 40.00  3.00 -1.00  0.00  0.00  0.00  0.00  0.00  0.00  0.00  0.00  0.00  0.00 >
    INFO: cmn_prior.c(149): cmn_prior_update: to   < 40.56 -12.63  2.86  3.19 -10.88  8.73 -13.00  7.11 -4.79 -3.85  5.89 -2.73 -2.47 >
    INFO: ngram_search_fwdtree.c(1553):      937 words recognized (4/fr)
    INFO: ngram_search_fwdtree.c(1555):    14225 senones evaluated (57/fr)
    INFO: ngram_search_fwdtree.c(1559):     5761 channels searched (23/fr), 1051 1st, 3468 last
    INFO: ngram_search_fwdtree.c(1562):     1069 words for which last channels evaluated (4/fr)
    INFO: ngram_search_fwdtree.c(1564):      107 candidate words for entering last phone (0/fr)
    INFO: ngram_search_fwdtree.c(1567): fwdtree 3.91 CPU 1.577 xRT
    INFO: ngram_search_fwdtree.c(1570): fwdtree 4.10 wall 1.652 xRT
    INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 4 words
    INFO: ngram_search_fwdflat.c(948):      910 words recognized (4/fr)
    INFO: ngram_search_fwdflat.c(950):    11000 senones evaluated (44/fr)
    INFO: ngram_search_fwdflat.c(952):     5156 channels searched (20/fr)
    INFO: ngram_search_fwdflat.c(954):     1344 words searched (5/fr)
    INFO: ngram_search_fwdflat.c(957):      176 word transitions (0/fr)
    INFO: ngram_search_fwdflat.c(960): fwdflat 1.40 CPU 0.565 xRT
    INFO: ngram_search_fwdflat.c(963): fwdflat 1.47 wall 0.592 xRT
    INFO: ngram_search.c(1253): lattice start node <s>.0 end node </s>.202
    INFO: ngram_search.c(1279): Eliminated 1 nodes before end node
    INFO: ngram_search.c(1384): Lattice has 255 nodes, 288 links
    INFO: ps_lattice.c(1380): Bestpath score: -3046
    INFO: ps_lattice.c(1384): Normalizer P(O) = alpha(</s>:202:246) = -176836
    INFO: ps_lattice.c(1441): Joint P(O,S) = -197475 P(S|O) = -20639
    INFO: ngram_search.c(875): bestpath 0.01 CPU 0.004 xRT
    INFO: ngram_search.c(878): bestpath 0.02 wall 0.009 xRT
    SIGNAL LEFT
    INFO: cmn_prior.c(131): cmn_prior_update: from < 40.56 -12.63  2.86  3.19 -10.88  8.73 -13.00  7.11 -4.79 -3.85  5.89 -2.73 -2.47 >
    INFO: cmn_prior.c(149): cmn_prior_update: to   < 43.22 -11.16  0.69  6.22 -10.22  5.94 -14.83  7.24 -4.54 -7.18  6.21 -0.54 -2.58 >
    INFO: ngram_search_fwdtree.c(1553):      573 words recognized (3/fr)
    INFO: ngram_search_fwdtree.c(1555):    10698 senones evaluated (64/fr)
    INFO: ngram_search_fwdtree.c(1559):     4589 channels searched (27/fr), 659 1st, 3075 last
    INFO: ngram_search_fwdtree.c(1562):      688 words for which last channels evaluated (4/fr)
    INFO: ngram_search_fwdtree.c(1564):       75 candidate words for entering last phone (0/fr)
    INFO: ngram_search_fwdtree.c(1567): fwdtree 2.79 CPU 1.661 xRT
    INFO: ngram_search_fwdtree.c(1570): fwdtree 2.92 wall 1.741 xRT
    INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 5 words
    INFO: ngram_search_fwdflat.c(948):      513 words recognized (3/fr)
    INFO: ngram_search_fwdflat.c(950):    10228 senones evaluated (61/fr)
    INFO: ngram_search_fwdflat.c(952):     5051 channels searched (30/fr)
    INFO: ngram_search_fwdflat.c(954):      915 words searched (5/fr)
    INFO: ngram_search_fwdflat.c(957):      199 word transitions (1/fr)
    INFO: ngram_search_fwdflat.c(960): fwdflat 1.11 CPU 0.661 xRT
    INFO: ngram_search_fwdflat.c(963): fwdflat 1.17 wall 0.696 xRT
    INFO: ngram_search.c(1253): lattice start node <s>.0 end node </s>.123
    INFO: ngram_search.c(1279): Eliminated 1 nodes before end node
    INFO: ngram_search.c(1384): Lattice has 149 nodes, 86 links
    INFO: ps_lattice.c(1380): Bestpath score: -2489
    INFO: ps_lattice.c(1384): Normalizer P(O) = alpha(</s>:123:166) = -161315
    INFO: ps_lattice.c(1441): Joint P(O,S) = -174486 P(S|O) = -13171
    INFO: ngram_search.c(875): bestpath 0.00 CPU 0.000 xRT
    INFO: ngram_search.c(878): bestpath 0.01 wall 0.008 xRT
    SIGNAL RIGHT
    INFO: cmn_prior.c(131): cmn_prior_update: from < 43.22 -11.16  0.69  6.22 -10.22  5.94 -14.83  7.24 -4.54 -7.18  6.21 -0.54 -2.58 >
    INFO: cmn_prior.c(149): cmn_prior_update: to   < 43.85 -11.07  1.01  4.70 -13.55  4.51 -12.91  8.63 -4.98 -7.35  6.83 -0.93 -2.10 >
    INFO: ngram_search_fwdtree.c(1553):      806 words recognized (4/fr)
    INFO: ngram_search_fwdtree.c(1555):    11810 senones evaluated (61/fr)
    INFO: ngram_search_fwdtree.c(1559):     4829 channels searched (25/fr), 806 1st, 2893 last
    INFO: ngram_search_fwdtree.c(1562):      895 words for which last channels evaluated (4/fr)
    INFO: ngram_search_fwdtree.c(1564):      123 candidate words for entering last phone (0/fr)
    INFO: ngram_search_fwdtree.c(1567): fwdtree 3.19 CPU 1.653 xRT
    INFO: ngram_search_fwdtree.c(1570): fwdtree 3.35 wall 1.737 xRT
    INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 4 words
    INFO: ngram_search_fwdflat.c(948):      804 words recognized (4/fr)
    INFO: ngram_search_fwdflat.c(950):    11123 senones evaluated (58/fr)
    INFO: ngram_search_fwdflat.c(952):     5406 channels searched (28/fr)
    INFO: ngram_search_fwdflat.c(954):     1124 words searched (5/fr)
    INFO: ngram_search_fwdflat.c(957):      162 word transitions (0/fr)
    INFO: ngram_search_fwdflat.c(960): fwdflat 1.28 CPU 0.663 xRT
    INFO: ngram_search_fwdflat.c(963): fwdflat 1.34 wall 0.693 xRT
    INFO: ngram_search.c(1253): lattice start node <s>.0 end node </s>.153
    INFO: ngram_search.c(1279): Eliminated 1 nodes before end node
    INFO: ngram_search.c(1384): Lattice has 198 nodes, 371 links
    INFO: ps_lattice.c(1380): Bestpath score: -3680
    INFO: ps_lattice.c(1384): Normalizer P(O) = alpha(</s>:153:191) = -195067
    INFO: ps_lattice.c(1441): Joint P(O,S) = -228604 P(S|O) = -33537
    INFO: ngram_search.c(875): bestpath 0.01 CPU 0.005 xRT
    INFO: ngram_search.c(878): bestpath 0.02 wall 0.012 xRT
    SIGNAL STOP
    INFO: cmn_prior.c(99): cmn_prior_update: from < 43.85 -11.07  1.01  4.70 -13.55  4.51 -12.91  8.63 -4.98 -7.35  6.83 -0.93 -2.10 >
    INFO: cmn_prior.c(116): cmn_prior_update: to   < 44.69 -11.43  2.95  2.25 -14.15  6.53 -11.29  7.80 -4.75 -6.60  6.63 -0.94 -0.22 >
    INFO: cmn_prior.c(131): cmn_prior_update: from < 44.69 -11.43  2.95  2.25 -14.15  6.53 -11.29  7.80 -4.75 -6.60  6.63 -0.94 -0.22 >
    INFO: cmn_prior.c(149): cmn_prior_update: to   < 43.01 -10.79  3.14  1.15 -13.50  7.19 -11.42  7.55 -4.59 -6.25  5.88 -1.18  0.00 >
    INFO: ngram_search_fwdtree.c(1553):      822 words recognized (4/fr)
    INFO: ngram_search_fwdtree.c(1555):    13733 senones evaluated (64/fr)
    INFO: ngram_search_fwdtree.c(1559):     6001 channels searched (28/fr), 859 1st, 3862 last
    INFO: ngram_search_fwdtree.c(1562):      961 words for which last channels evaluated (4/fr)
    INFO: ngram_search_fwdtree.c(1564):      114 candidate words for entering last phone (0/fr)
    INFO: ngram_search_fwdtree.c(1567): fwdtree 3.42 CPU 1.598 xRT
    INFO: ngram_search_fwdtree.c(1570): fwdtree 3.60 wall 1.680 xRT
    INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 5 words
    INFO: ngram_search_fwdflat.c(948):      760 words recognized (4/fr)
    INFO: ngram_search_fwdflat.c(950):    13309 senones evaluated (62/fr)
    INFO: ngram_search_fwdflat.c(952):     6575 channels searched (30/fr)
    INFO: ngram_search_fwdflat.c(954):     1279 words searched (5/fr)
    INFO: ngram_search_fwdflat.c(957):      205 word transitions (0/fr)
    INFO: ngram_search_fwdflat.c(960): fwdflat 1.43 CPU 0.668 xRT
    INFO: ngram_search_fwdflat.c(963): fwdflat 1.50 wall 0.701 xRT
    INFO: ngram_search.c(1253): lattice start node <s>.0 end node </s>.179
    INFO: ngram_search.c(1279): Eliminated 1 nodes before end node
    INFO: ngram_search.c(1384): Lattice has 163 nodes, 263 links
    INFO: ps_lattice.c(1380): Bestpath score: -5075
    INFO: ps_lattice.c(1384): Normalizer P(O) = alpha(</s>:179:212) = -271550
    INFO: ps_lattice.c(1441): Joint P(O,S) = -309344 P(S|O) = -37794
    INFO: ngram_search.c(875): bestpath 0.01 CPU 0.005 xRT
    INFO: ngram_search.c(878): bestpath 0.01 wall 0.004 xRT
    SIGNAL STOP
    INFO: cmn_prior.c(131): cmn_prior_update: from < 43.01 -10.79  3.14  1.15 -13.50  7.19 -11.42  7.55 -4.59 -6.25  5.88 -1.18  0.00 >
    INFO: cmn_prior.c(149): cmn_prior_update: to   < 45.71 -9.70  2.46 -0.12 -16.31  5.28 -10.94  7.86 -4.96 -6.55  5.80 -1.64  0.16 >
    INFO: ngram_search_fwdtree.c(1553):      774 words recognized (5/fr)
    INFO: ngram_search_fwdtree.c(1555):    15391 senones evaluated (96/fr)
    INFO: ngram_search_fwdtree.c(1559):     7631 channels searched (47/fr), 715 1st, 5665 last
    INFO: ngram_search_fwdtree.c(1562):      874 words for which last channels evaluated (5/fr)
    INFO: ngram_search_fwdtree.c(1564):      231 candidate words for entering last phone (1/fr)
    INFO: ngram_search_fwdtree.c(1567): fwdtree 2.73 CPU 1.706 xRT
    INFO: ngram_search_fwdtree.c(1570): fwdtree 2.87 wall 1.792 xRT
    INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 6 words
    INFO: ngram_search_fwdflat.c(948):      774 words recognized (5/fr)
    INFO: ngram_search_fwdflat.c(950):    16958 senones evaluated (106/fr)
    INFO: ngram_search_fwdflat.c(952):     9729 channels searched (60/fr)
    INFO: ngram_search_fwdflat.c(954):     1191 words searched (7/fr)
    INFO: ngram_search_fwdflat.c(957):      305 word transitions (1/fr)
    INFO: ngram_search_fwdflat.c(960): fwdflat 1.13 CPU 0.706 xRT
    INFO: ngram_search_fwdflat.c(963): fwdflat 1.19 wall 0.746 xRT
    INFO: ngram_search.c(1253): lattice start node <s>.0 end node </s>.134
    INFO: ngram_search.c(1279): Eliminated 2 nodes before end node
    INFO: ngram_search.c(1384): Lattice has 175 nodes, 227 links
    INFO: ps_lattice.c(1380): Bestpath score: -4824
    INFO: ps_lattice.c(1384): Normalizer P(O) = alpha(</s>:134:158) = -263021
    INFO: ps_lattice.c(1441): Joint P(O,S) = -296546 P(S|O) = -33525
    INFO: ngram_search.c(875): bestpath 0.01 CPU 0.006 xRT
    INFO: ngram_search.c(878): bestpath 0.02 wall 0.014 xRT
    SIGNAL STOP
    INFO: cmn_prior.c(99): cmn_prior_update: from < 45.71 -9.70  2.46 -0.12 -16.31  5.28 -10.94  7.86 -4.96 -6.55  5.80 -1.64  0.16 >
    INFO: cmn_prior.c(116): cmn_prior_update: to   < 46.37 -10.45  3.40  0.33 -15.60  5.64 -11.92  7.43 -4.96 -6.99  5.90 -1.78 -0.07 >
    INFO: cmn_prior.c(131): cmn_prior_update: from < 46.37 -10.45  3.40  0.33 -15.60  5.64 -11.92  7.43 -4.96 -6.99  5.90 -1.78 -0.07 >
    INFO: cmn_prior.c(149): cmn_prior_update: to   < 43.67 -9.67  3.24 -0.68 -16.17  4.57 -10.66  9.09 -6.51 -6.81  5.74 -1.41  0.10 >
    INFO: ngram_search_fwdtree.c(1553):      896 words recognized (4/fr)
    INFO: ngram_search_fwdtree.c(1555):    16099 senones evaluated (78/fr)
    INFO: ngram_search_fwdtree.c(1559):     7374 channels searched (35/fr), 920 1st, 4902 last
    INFO: ngram_search_fwdtree.c(1562):     1019 words for which last channels evaluated (4/fr)
    INFO: ngram_search_fwdtree.c(1564):      209 candidate words for entering last phone (1/fr)
    INFO: ngram_search_fwdtree.c(1567): fwdtree 3.33 CPU 1.617 xRT
    INFO: ngram_search_fwdtree.c(1570): fwdtree 3.50 wall 1.701 xRT
    INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 6 words
    INFO: ngram_search_fwdflat.c(948):      915 words recognized (4/fr)
    INFO: ngram_search_fwdflat.c(950):    18979 senones evaluated (92/fr)
    INFO: ngram_search_fwdflat.c(952):    10767 channels searched (52/fr)
    INFO: ngram_search_fwdflat.c(954):     1434 words searched (6/fr)
    INFO: ngram_search_fwdflat.c(957):      336 word transitions (1/fr)
    INFO: ngram_search_fwdflat.c(960): fwdflat 1.38 CPU 0.670 xRT
    INFO: ngram_search_fwdflat.c(963): fwdflat 1.45 wall 0.703 xRT
    INFO: ngram_search.c(1253): lattice start node <s>.0 end node </s>.145
    INFO: ngram_search.c(1279): Eliminated 1 nodes before end node
    INFO: ngram_search.c(1384): Lattice has 182 nodes, 185 links
    INFO: ps_lattice.c(1380): Bestpath score: -4983
    INFO: ps_lattice.c(1384): Normalizer P(O) = alpha(</s>:145:204) = -279027
    INFO: ps_lattice.c(1441): Joint P(O,S) = -322759 P(S|O) = -43732
    INFO: ngram_search.c(875): bestpath 0.01 CPU 0.005 xRT
    INFO: ngram_search.c(878): bestpath 0.01 wall 0.007 xRT
    SIGNAL STOP
    INFO: cmn_prior.c(131): cmn_prior_update: from < 43.67 -9.67  3.24 -0.68 -16.17  4.57 -10.66  9.09 -6.51 -6.81  5.74 -1.41  0.10 >
    INFO: cmn_prior.c(149): cmn_prior_update: to   < 44.69 -10.15  3.38  0.56 -14.73  4.29 -13.18  8.64 -6.64 -6.86  6.11 -1.27 -0.19 >
    INFO: ngram_search_fwdtree.c(1553):      554 words recognized (3/fr)
    INFO: ngram_search_fwdtree.c(1555):    11263 senones evaluated (62/fr)
    INFO: ngram_search_fwdtree.c(1559):     4799 channels searched (26/fr), 687 1st, 3192 last
    INFO: ngram_search_fwdtree.c(1562):      701 words for which last channels evaluated (3/fr)
    INFO: ngram_search_fwdtree.c(1564):       93 candidate words for entering last phone (0/fr)
    INFO: ngram_search_fwdtree.c(1567): fwdtree 2.95 CPU 1.621 xRT
    INFO: ngram_search_fwdtree.c(1570): fwdtree 3.11 wall 1.707 xRT
    INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 4 words
    INFO: ngram_search_fwdflat.c(948):      500 words recognized (3/fr)
    INFO: ngram_search_fwdflat.c(950):    10245 senones evaluated (56/fr)
    INFO: ngram_search_fwdflat.c(952):     4888 channels searched (26/fr)
    INFO: ngram_search_fwdflat.c(954):      904 words searched (4/fr)
    INFO: ngram_search_fwdflat.c(957):      162 word transitions (0/fr)
    INFO: ngram_search_fwdflat.c(960): fwdflat 1.20 CPU 0.659 xRT
    INFO: ngram_search_fwdflat.c(963): fwdflat 1.26 wall 0.695 xRT
    INFO: ngram_search.c(1253): lattice start node <s>.0 end node </s>.138
    INFO: ngram_search.c(1279): Eliminated 1 nodes before end node
    INFO: ngram_search.c(1384): Lattice has 143 nodes, 62 links
    INFO: ps_lattice.c(1380): Bestpath score: -2666
    INFO: ps_lattice.c(1384): Normalizer P(O) = alpha(</s>:138:180) = -173267
    INFO: ps_lattice.c(1441): Joint P(O,S) = -181346 P(S|O) = -8079
    INFO: ngram_search.c(875): bestpath 0.00 CPU 0.000 xRT
    INFO: ngram_search.c(878): bestpath 0.01 wall 0.007 xRT
    SIGNAL RIGHT
    INFO: cmn_prior.c(131): cmn_prior_update: from < 44.69 -10.15  3.38  0.56 -14.73  4.29 -13.18  8.64 -6.64 -6.86  6.11 -1.27 -0.19 >
    INFO: cmn_prior.c(149): cmn_prior_update: to   < 44.69 -10.15  3.38  0.56 -14.73  4.29 -13.18  8.64 -6.64 -6.86  6.11 -1.27 -0.19 >
    INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 0 words
    INFO: ngram_search_fwdtree.c(432): TOTAL fwdtree 22.48 CPU 1.648 xRT
    INFO: ngram_search_fwdtree.c(435): TOTAL fwdtree 23.61 wall 1.731 xRT
    INFO: ngram_search_fwdflat.c(176): TOTAL fwdflat 8.93 CPU 0.655 xRT
    INFO: ngram_search_fwdflat.c(179): TOTAL fwdflat 9.38 wall 0.688 xRT
    INFO: ngram_search.c(303): TOTAL bestpath 0.05 CPU 0.004 xRT
    INFO: ngram_search.c(306): TOTAL bestpath 0.12 wall 0.009 xRT
    
     
    • Nickolay V. Shmyrev

      Ok, and what is your CPU exactly? What is the output of cat /proc/cpuinfo?

      You can try with jsgf grammar instead:

      #JSGF V1.0;
      
      grammar command;
      
      public <command> = (system | signal) (on | off | left | right);
      

      It should be faster than LM.

       
      • Nickolay V. Shmyrev

         
      • Remy Porter

        Remy Porter - 2016-06-16

        processor : 0
        model name : ARMv6-compatible processor rev 7 (v6l)
        BogoMIPS : 697.95
        Features : half thumb fastmult vfp edsp java tls
        CPU implementer : 0x41
        CPU architecture: 7
        CPU variant : 0x0
        CPU part : 0xb76
        CPU revision : 7

        Hardware : BCM2708
        Revision : 0015
        Serial : 00000000d620ebdb

         
      • Remy Porter

        Remy Porter - 2016-06-16

        And JSGF makes a lot more sense, and with some tweaking, should be exactly what I need. Thanks!

         
      • Remy Porter

        Remy Porter - 2016-06-16

        And this is the current output:

        pi@raspberrypi:~/voicecode $ pocketsphinx_continuous -keyphrase "SIGNAL" -samprate 44100 -nfft 2048 -jsgf commands.jsgf -lm ./lmm/9014.lm -dict ./lmm/9014.dic -infile systemtests.wav
        INFO: pocketsphinx.c(152): Parsed model-specific feature parameters from /usr/local/share/pocketsphinx/model/en-us/en-us/feat.params
        Current configuration:
        [NAME]          [DEFLT]     [VALUE]
        -agc            none        none
        -agcthresh      2.0     2.000000e+00
        -allphone
        -allphone_ci        no      no
        -alpha          0.97        9.700000e-01
        -ascale         20.0        2.000000e+01
        -aw         1       1
        -backtrace      no      no
        -beam           1e-48       1.000000e-48
        -bestpath       yes     yes
        -bestpathlw     9.5     9.500000e+00
        -ceplen         13      13
        -cmn            current     current
        -cmninit        8.0     40,3,-1
        -compallsen     no      no
        -debug                  0
        -dict                   ./lmm/9014.dic
        -dictcase       no      no
        -dither         no      no
        -doublebw       no      no
        -ds         1       1
        -fdict
        -feat           1s_c_d_dd   1s_c_d_dd
        -featparams
        -fillprob       1e-8        1.000000e-08
        -frate          100     100
        -fsg
        -fsgusealtpron      yes     yes
        -fsgusefiller       yes     yes
        -fwdflat        yes     yes
        -fwdflatbeam        1e-64       1.000000e-64
        -fwdflatefwid       4       4
        -fwdflatlw      8.5     8.500000e+00
        -fwdflatsfwin       25      25
        -fwdflatwbeam       7e-29       7.000000e-29
        -fwdtree        yes     yes
        -hmm                    /usr/local/share/pocketsphinx/model/en-us/en-us
        -input_endian       little      little
        -jsgf                   commands.jsgf
        -keyphrase              SIGNAL
        -kws
        -kws_delay      10      10
        -kws_plp        1e-1        1.000000e-01
        -kws_threshold      1       1.000000e+00
        -latsize        5000        5000
        -lda
        -ldadim         0       0
        -lifter         0       22
        -lm                 ./lmm/9014.lm
        -lmctl
        -lmname
        -logbase        1.0001      1.000100e+00
        -logfn
        -logspec        no      no
        -lowerf         133.33334   1.300000e+02
        -lpbeam         1e-40       1.000000e-40
        -lponlybeam     7e-29       7.000000e-29
        -lw         6.5     6.500000e+00
        -maxhmmpf       30000       30000
        -maxwpf         -1      -1
        -mdef
        -mean
        -mfclogdir
        -min_endfr      0       0
        -mixw
        -mixwfloor      0.0000001   1.000000e-07
        -mllr
        -mmap           yes     yes
        -ncep           13      13
        -nfft           512     2048
        -nfilt          40      25
        -nwpen          1.0     1.000000e+00
        -pbeam          1e-48       1.000000e-48
        -pip            1.0     1.000000e+00
        -pl_beam        1e-10       1.000000e-10
        -pl_pbeam       1e-10       1.000000e-10
        -pl_pip         1.0     1.000000e+00
        -pl_weight      3.0     3.000000e+00
        -pl_window      5       5
        -rawlogdir
        -remove_dc      no      no
        -remove_noise       yes     yes
        -remove_silence     yes     yes
        -round_filters      yes     yes
        -samprate       16000       4.410000e+04
        -seed           -1      -1
        -sendump
        -senlogdir
        -senmgau
        -silprob        0.005       5.000000e-03
        -smoothspec     no      no
        -svspec                 0-12/13-25/26-38
        -tmat
        -tmatfloor      0.0001      1.000000e-04
        -topn           4       4
        -topn_beam      0       0
        -toprule
        -transform      legacy      dct
        -unit_area      yes     yes
        -upperf         6855.4976   6.800000e+03
        -uw         1.0     1.000000e+00
        -vad_postspeech     50      50
        -vad_prespeech      20      20
        -vad_startspeech    10      10
        -vad_threshold      2.0     2.000000e+00
        -var
        -varfloor       0.0001      1.000000e-04
        -varnorm        no      no
        -verbose        no      no
        -warp_params
        -warp_type      inverse_linear  inverse_linear
        -wbeam          7e-29       7.000000e-29
        -wip            0.65        6.500000e-01
        -wlen           0.025625    2.562500e-02
        
        INFO: feat.c(715): Initializing feature stream to type: '1s_c_d_dd', ceplen=13, CMN='current', VARNORM='no', AGC='none'
        INFO: cmn.c(143): mean[0]= 12.00, mean[1..12]= 0.0
        INFO: acmod.c(164): Using subvector specification 0-12/13-25/26-38
        INFO: mdef.c(518): Reading model definition: /usr/local/share/pocketsphinx/model/en-us/en-us/mdef
        INFO: mdef.c(531): Found byte-order mark BMDF, assuming this is a binary mdef file
        INFO: bin_mdef.c(336): Reading binary model definition: /usr/local/share/pocketsphinx/model/en-us/en-us/mdef
        INFO: bin_mdef.c(516): 42 CI-phone, 137053 CD-phone, 3 emitstate/phone, 126 CI-sen, 5126 Sen, 29324 Sen-Seq
        INFO: tmat.c(206): Reading HMM transition probability matrices: /usr/local/share/pocketsphinx/model/en-us/en-us/transition_matrices
        INFO: acmod.c(117): Attempting to use PTM computation module
        INFO: ms_gauden.c(198): Reading mixture gaussian parameter: /usr/local/share/pocketsphinx/model/en-us/en-us/means
        INFO: ms_gauden.c(292): 42 codebook, 3 feature, size:
        INFO: ms_gauden.c(294):  128x13
        INFO: ms_gauden.c(294):  128x13
        INFO: ms_gauden.c(294):  128x13
        INFO: ms_gauden.c(198): Reading mixture gaussian parameter: /usr/local/share/pocketsphinx/model/en-us/en-us/variances
        INFO: ms_gauden.c(292): 42 codebook, 3 feature, size:
        INFO: ms_gauden.c(294):  128x13
        INFO: ms_gauden.c(294):  128x13
        INFO: ms_gauden.c(294):  128x13
        INFO: ms_gauden.c(354): 222 variance values floored
        INFO: ptm_mgau.c(476): Loading senones from dump file /usr/local/share/pocketsphinx/model/en-us/en-us/sendump
        INFO: ptm_mgau.c(500): BEGIN FILE FORMAT DESCRIPTION
        INFO: ptm_mgau.c(563): Rows: 128, Columns: 5126
        INFO: ptm_mgau.c(595): Using memory-mapped I/O for senones
        INFO: ptm_mgau.c(835): Maximum top-N: 4
        INFO: phone_loop_search.c(114): State beam -225 Phone exit beam -225 Insertion penalty 0
        INFO: dict.c(320): Allocating 4106 * 20 bytes (80 KiB) for word entries
        INFO: dict.c(333): Reading main dictionary: ./lmm/9014.dic
        INFO: dict.c(213): Allocated 0 KiB for strings, 0 KiB for phones
        INFO: dict.c(336): 5 words read
        INFO: dict.c(358): Reading filler dictionary: /usr/local/share/pocketsphinx/model/en-us/en-us/noisedict
        INFO: dict.c(213): Allocated 0 KiB for strings, 0 KiB for phones
        INFO: dict.c(361): 5 words read
        INFO: dict2pid.c(396): Building PID tables for dictionary
        INFO: dict2pid.c(406): Allocating 42^3 * 2 bytes (144 KiB) for word-initial triphones
        INFO: dict2pid.c(132): Allocated 21336 bytes (20 KiB) for word-final triphones
        INFO: dict2pid.c(196): Allocated 21336 bytes (20 KiB) for single-phone word triphones
        INFO: kws_search.c(420): KWS(beam: -1080, plp: -23, default threshold 0, delay 10)
        INFO: jsgf.c(706): Defined rule: <command.g00000>
        INFO: jsgf.c(706): Defined rule: PUBLIC <command.command>
        INFO: fsg_model.c(208): Computing transitive closure for null transitions
        INFO: fsg_model.c(270): 0 null transitions added
        INFO: fsg_search.c(227): FSG(beam: -1080, pbeam: -1080, wbeam: -634; wip: -26, pip: 0)
        INFO: fsg_model.c(423): Adding silence transitions for <sil> to FSG
        INFO: fsg_model.c(443): Added 2 silence word transitions
        INFO: fsg_model.c(423): Adding silence transitions for <sil> to FSG
        INFO: fsg_model.c(443): Added 2 silence word transitions
        INFO: fsg_model.c(423): Adding silence transitions for [NOISE] to FSG
        INFO: fsg_model.c(443): Added 2 silence word transitions
        INFO: fsg_search.c(173): Added 0 alternate word transitions
        INFO: fsg_lextree.c(110): Allocated 172 bytes (0 KiB) for left and right context phones
        INFO: fsg_lextree.c(256): 20 HMM nodes in lextree (8 leaves)
        INFO: fsg_lextree.c(259): Allocated 2400 bytes (2 KiB) for all lextree nodes
        INFO: fsg_lextree.c(262): Allocated 960 bytes (0 KiB) for lextree leafnodes
        INFO: kws_search.c(467): TOTAL kws 0.00 CPU nan xRT
        INFO: kws_search.c(470): TOTAL kws 0.00 wall nan xRT
        INFO: ngram_model_trie.c(347): Trying to read LM in trie binary format
        INFO: ngram_model_trie.c(358): Header doesn't match
        INFO: ngram_model_trie.c(176): Trying to read LM in arpa format
        INFO: ngram_model_trie.c(192): LM of order 3
        INFO: ngram_model_trie.c(194): #1-grams: 7
        INFO: ngram_model_trie.c(194): #2-grams: 10
        INFO: ngram_model_trie.c(194): #3-grams: 5
        INFO: lm_trie.c(473): Training quantizer
        INFO: lm_trie.c(481): Building LM trie
        INFO: ngram_search_fwdtree.c(99): 5 unique initial diphones
        INFO: ngram_search_fwdtree.c(148): 0 root, 0 non-root channels, 6 single-phone words
        INFO: ngram_search_fwdtree.c(186): Creating search tree
        INFO: ngram_search_fwdtree.c(192): before: 0 root, 0 non-root channels, 6 single-phone words
        INFO: ngram_search_fwdtree.c(326): after: max nonroot chan increased to 140
        INFO: ngram_search_fwdtree.c(339): after: 5 root, 12 non-root channels, 5 single-phone words
        INFO: ngram_search_fwdflat.c(157): fwdflat: min_ef_width = 4, max_sf_win = 25
        INFO: fsg_search.c(265): TOTAL fsg 0.00 CPU nan xRT
        INFO: fsg_search.c(268): TOTAL fsg 0.00 wall nan xRT
        INFO: continuous.c(307): pocketsphinx_continuous COMPILED ON: Jun  2 2016, AT: 23:42:13
        
        INFO: cmn_prior.c(131): cmn_prior_update: from < 40.00  3.00 -1.00  0.00  0.00  0.00  0.00  0.00  0.00  0.00  0.00  0.00  0.00 >
        INFO: cmn_prior.c(149): cmn_prior_update: to   < 40.56 -12.63  2.86  3.19 -10.88  8.73 -13.00  7.11 -4.79 -3.85  5.89 -2.73 -2.47 >
        INFO: ngram_search_fwdtree.c(1553):      937 words recognized (4/fr)
        INFO: ngram_search_fwdtree.c(1555):    14225 senones evaluated (57/fr)
        INFO: ngram_search_fwdtree.c(1559):     5761 channels searched (23/fr), 1051 1st, 3468 last
        INFO: ngram_search_fwdtree.c(1562):     1069 words for which last channels evaluated (4/fr)
        INFO: ngram_search_fwdtree.c(1564):      107 candidate words for entering last phone (0/fr)
        INFO: ngram_search_fwdtree.c(1567): fwdtree 3.94 CPU 1.589 xRT
        INFO: ngram_search_fwdtree.c(1570): fwdtree 4.17 wall 1.680 xRT
        INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 4 words
        INFO: ngram_search_fwdflat.c(948):      910 words recognized (4/fr)
        INFO: ngram_search_fwdflat.c(950):    11000 senones evaluated (44/fr)
        INFO: ngram_search_fwdflat.c(952):     5156 channels searched (20/fr)
        INFO: ngram_search_fwdflat.c(954):     1344 words searched (5/fr)
        INFO: ngram_search_fwdflat.c(957):      176 word transitions (0/fr)
        INFO: ngram_search_fwdflat.c(960): fwdflat 1.41 CPU 0.569 xRT
        INFO: ngram_search_fwdflat.c(963): fwdflat 1.49 wall 0.602 xRT
        INFO: ngram_search.c(1253): lattice start node <s>.0 end node </s>.202
        INFO: ngram_search.c(1279): Eliminated 1 nodes before end node
        INFO: ngram_search.c(1384): Lattice has 255 nodes, 288 links
        INFO: ps_lattice.c(1380): Bestpath score: -3046
        INFO: ps_lattice.c(1384): Normalizer P(O) = alpha(</s>:202:246) = -176836
        INFO: ps_lattice.c(1441): Joint P(O,S) = -197475 P(S|O) = -20639
        INFO: ngram_search.c(875): bestpath 0.01 CPU 0.004 xRT
        INFO: ngram_search.c(878): bestpath 0.02 wall 0.009 xRT
        SIGNAL LEFT
        INFO: cmn_prior.c(131): cmn_prior_update: from < 40.56 -12.63  2.86  3.19 -10.88  8.73 -13.00  7.11 -4.79 -3.85  5.89 -2.73 -2.47 >
        INFO: cmn_prior.c(149): cmn_prior_update: to   < 43.22 -11.16  0.69  6.22 -10.22  5.94 -14.83  7.24 -4.54 -7.18  6.21 -0.54 -2.58 >
        INFO: ngram_search_fwdtree.c(1553):      573 words recognized (3/fr)
        INFO: ngram_search_fwdtree.c(1555):    10698 senones evaluated (64/fr)
        INFO: ngram_search_fwdtree.c(1559):     4589 channels searched (27/fr), 659 1st, 3075 last
        INFO: ngram_search_fwdtree.c(1562):      688 words for which last channels evaluated (4/fr)
        INFO: ngram_search_fwdtree.c(1564):       75 candidate words for entering last phone (0/fr)
        INFO: ngram_search_fwdtree.c(1567): fwdtree 2.82 CPU 1.679 xRT
        INFO: ngram_search_fwdtree.c(1570): fwdtree 2.98 wall 1.776 xRT
        INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 5 words
        INFO: ngram_search_fwdflat.c(948):      513 words recognized (3/fr)
        INFO: ngram_search_fwdflat.c(950):    10228 senones evaluated (61/fr)
        INFO: ngram_search_fwdflat.c(952):     5051 channels searched (30/fr)
        INFO: ngram_search_fwdflat.c(954):      915 words searched (5/fr)
        INFO: ngram_search_fwdflat.c(957):      199 word transitions (1/fr)
        INFO: ngram_search_fwdflat.c(960): fwdflat 1.14 CPU 0.679 xRT
        INFO: ngram_search_fwdflat.c(963): fwdflat 1.19 wall 0.709 xRT
        INFO: ngram_search.c(1253): lattice start node <s>.0 end node </s>.123
        INFO: ngram_search.c(1279): Eliminated 1 nodes before end node
        INFO: ngram_search.c(1384): Lattice has 149 nodes, 86 links
        INFO: ps_lattice.c(1380): Bestpath score: -2489
        INFO: ps_lattice.c(1384): Normalizer P(O) = alpha(</s>:123:166) = -161315
        INFO: ps_lattice.c(1441): Joint P(O,S) = -174486 P(S|O) = -13171
        INFO: ngram_search.c(875): bestpath 0.00 CPU 0.000 xRT
        INFO: ngram_search.c(878): bestpath 0.01 wall 0.006 xRT
        SIGNAL RIGHT
        INFO: cmn_prior.c(131): cmn_prior_update: from < 43.22 -11.16  0.69  6.22 -10.22  5.94 -14.83  7.24 -4.54 -7.18  6.21 -0.54 -2.58 >
        INFO: cmn_prior.c(149): cmn_prior_update: to   < 43.85 -11.07  1.01  4.70 -13.55  4.51 -12.91  8.63 -4.98 -7.35  6.83 -0.93 -2.10 >
        INFO: ngram_search_fwdtree.c(1553):      806 words recognized (4/fr)
        INFO: ngram_search_fwdtree.c(1555):    11810 senones evaluated (61/fr)
        INFO: ngram_search_fwdtree.c(1559):     4829 channels searched (25/fr), 806 1st, 2893 last
        INFO: ngram_search_fwdtree.c(1562):      895 words for which last channels evaluated (4/fr)
        INFO: ngram_search_fwdtree.c(1564):      123 candidate words for entering last phone (0/fr)
        INFO: ngram_search_fwdtree.c(1567): fwdtree 3.21 CPU 1.663 xRT
        INFO: ngram_search_fwdtree.c(1570): fwdtree 3.39 wall 1.758 xRT
        INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 4 words
        INFO: ngram_search_fwdflat.c(948):      804 words recognized (4/fr)
        INFO: ngram_search_fwdflat.c(950):    11123 senones evaluated (58/fr)
        INFO: ngram_search_fwdflat.c(952):     5406 channels searched (28/fr)
        INFO: ngram_search_fwdflat.c(954):     1124 words searched (5/fr)
        INFO: ngram_search_fwdflat.c(957):      162 word transitions (0/fr)
        INFO: ngram_search_fwdflat.c(960): fwdflat 1.29 CPU 0.668 xRT
        INFO: ngram_search_fwdflat.c(963): fwdflat 1.37 wall 0.709 xRT
        INFO: ngram_search.c(1253): lattice start node <s>.0 end node </s>.153
        INFO: ngram_search.c(1279): Eliminated 1 nodes before end node
        INFO: ngram_search.c(1384): Lattice has 198 nodes, 371 links
        INFO: ps_lattice.c(1380): Bestpath score: -3680
        INFO: ps_lattice.c(1384): Normalizer P(O) = alpha(</s>:153:191) = -195067
        INFO: ps_lattice.c(1441): Joint P(O,S) = -228604 P(S|O) = -33537
        INFO: ngram_search.c(875): bestpath 0.01 CPU 0.005 xRT
        INFO: ngram_search.c(878): bestpath 0.02 wall 0.009 xRT
        SIGNAL STOP
        INFO: cmn_prior.c(99): cmn_prior_update: from < 43.85 -11.07  1.01  4.70 -13.55  4.51 -12.91  8.63 -4.98 -7.35  6.83 -0.93 -2.10 >
        INFO: cmn_prior.c(116): cmn_prior_update: to   < 44.69 -11.43  2.95  2.25 -14.15  6.53 -11.29  7.80 -4.75 -6.60  6.63 -0.94 -0.22 >
        INFO: cmn_prior.c(131): cmn_prior_update: from < 44.69 -11.43  2.95  2.25 -14.15  6.53 -11.29  7.80 -4.75 -6.60  6.63 -0.94 -0.22 >
        INFO: cmn_prior.c(149): cmn_prior_update: to   < 43.01 -10.79  3.14  1.15 -13.50  7.19 -11.42  7.55 -4.59 -6.25  5.88 -1.18  0.00 >
        INFO: ngram_search_fwdtree.c(1553):      822 words recognized (4/fr)
        INFO: ngram_search_fwdtree.c(1555):    13733 senones evaluated (64/fr)
        INFO: ngram_search_fwdtree.c(1559):     6001 channels searched (28/fr), 859 1st, 3862 last
        INFO: ngram_search_fwdtree.c(1562):      961 words for which last channels evaluated (4/fr)
        INFO: ngram_search_fwdtree.c(1564):      114 candidate words for entering last phone (0/fr)
        INFO: ngram_search_fwdtree.c(1567): fwdtree 3.46 CPU 1.617 xRT
        INFO: ngram_search_fwdtree.c(1570): fwdtree 3.66 wall 1.709 xRT
        INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 5 words
        INFO: ngram_search_fwdflat.c(948):      760 words recognized (4/fr)
        INFO: ngram_search_fwdflat.c(950):    13309 senones evaluated (62/fr)
        INFO: ngram_search_fwdflat.c(952):     6575 channels searched (30/fr)
        INFO: ngram_search_fwdflat.c(954):     1279 words searched (5/fr)
        INFO: ngram_search_fwdflat.c(957):      205 word transitions (0/fr)
        INFO: ngram_search_fwdflat.c(960): fwdflat 1.44 CPU 0.673 xRT
        INFO: ngram_search_fwdflat.c(963): fwdflat 1.52 wall 0.711 xRT
        INFO: ngram_search.c(1253): lattice start node <s>.0 end node </s>.179
        INFO: ngram_search.c(1279): Eliminated 1 nodes before end node
        INFO: ngram_search.c(1384): Lattice has 163 nodes, 263 links
        INFO: ps_lattice.c(1380): Bestpath score: -5075
        INFO: ps_lattice.c(1384): Normalizer P(O) = alpha(</s>:179:212) = -271550
        INFO: ps_lattice.c(1441): Joint P(O,S) = -309344 P(S|O) = -37794
        INFO: ngram_search.c(875): bestpath 0.01 CPU 0.005 xRT
        INFO: ngram_search.c(878): bestpath 0.02 wall 0.011 xRT
        SIGNAL STOP
        INFO: cmn_prior.c(131): cmn_prior_update: from < 43.01 -10.79  3.14  1.15 -13.50  7.19 -11.42  7.55 -4.59 -6.25  5.88 -1.18  0.00 >
        INFO: cmn_prior.c(149): cmn_prior_update: to   < 45.71 -9.70  2.46 -0.12 -16.31  5.28 -10.94  7.86 -4.96 -6.55  5.80 -1.64  0.16 >
        INFO: ngram_search_fwdtree.c(1553):      774 words recognized (5/fr)
        INFO: ngram_search_fwdtree.c(1555):    15391 senones evaluated (96/fr)
        INFO: ngram_search_fwdtree.c(1559):     7631 channels searched (47/fr), 715 1st, 5665 last
        INFO: ngram_search_fwdtree.c(1562):      874 words for which last channels evaluated (5/fr)
        INFO: ngram_search_fwdtree.c(1564):      231 candidate words for entering last phone (1/fr)
        INFO: ngram_search_fwdtree.c(1567): fwdtree 2.77 CPU 1.731 xRT
        INFO: ngram_search_fwdtree.c(1570): fwdtree 2.92 wall 1.824 xRT
        INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 6 words
        INFO: ngram_search_fwdflat.c(948):      774 words recognized (5/fr)
        INFO: ngram_search_fwdflat.c(950):    16958 senones evaluated (106/fr)
        INFO: ngram_search_fwdflat.c(952):     9729 channels searched (60/fr)
        INFO: ngram_search_fwdflat.c(954):     1191 words searched (7/fr)
        INFO: ngram_search_fwdflat.c(957):      305 word transitions (1/fr)
        INFO: ngram_search_fwdflat.c(960): fwdflat 1.14 CPU 0.713 xRT
        INFO: ngram_search_fwdflat.c(963): fwdflat 1.19 wall 0.746 xRT
        INFO: ngram_search.c(1253): lattice start node <s>.0 end node </s>.134
        INFO: ngram_search.c(1279): Eliminated 2 nodes before end node
        INFO: ngram_search.c(1384): Lattice has 175 nodes, 227 links
        INFO: ps_lattice.c(1380): Bestpath score: -4824
        INFO: ps_lattice.c(1384): Normalizer P(O) = alpha(</s>:134:158) = -263021
        INFO: ps_lattice.c(1441): Joint P(O,S) = -296546 P(S|O) = -33525
        INFO: ngram_search.c(875): bestpath 0.01 CPU 0.006 xRT
        INFO: ngram_search.c(878): bestpath 0.01 wall 0.008 xRT
        SIGNAL STOP
        INFO: cmn_prior.c(99): cmn_prior_update: from < 45.71 -9.70  2.46 -0.12 -16.31  5.28 -10.94  7.86 -4.96 -6.55  5.80 -1.64  0.16 >
        INFO: cmn_prior.c(116): cmn_prior_update: to   < 46.37 -10.45  3.40  0.33 -15.60  5.64 -11.92  7.43 -4.96 -6.99  5.90 -1.78 -0.07 >
        INFO: cmn_prior.c(131): cmn_prior_update: from < 46.37 -10.45  3.40  0.33 -15.60  5.64 -11.92  7.43 -4.96 -6.99  5.90 -1.78 -0.07 >
        INFO: cmn_prior.c(149): cmn_prior_update: to   < 43.67 -9.67  3.24 -0.68 -16.17  4.57 -10.66  9.09 -6.51 -6.81  5.74 -1.41  0.10 >
        INFO: ngram_search_fwdtree.c(1553):      896 words recognized (4/fr)
        INFO: ngram_search_fwdtree.c(1555):    16099 senones evaluated (78/fr)
        INFO: ngram_search_fwdtree.c(1559):     7374 channels searched (35/fr), 920 1st, 4902 last
        INFO: ngram_search_fwdtree.c(1562):     1019 words for which last channels evaluated (4/fr)
        INFO: ngram_search_fwdtree.c(1564):      209 candidate words for entering last phone (1/fr)
        INFO: ngram_search_fwdtree.c(1567): fwdtree 3.36 CPU 1.631 xRT
        INFO: ngram_search_fwdtree.c(1570): fwdtree 3.56 wall 1.730 xRT
        INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 6 words
        INFO: ngram_search_fwdflat.c(948):      915 words recognized (4/fr)
        INFO: ngram_search_fwdflat.c(950):    18979 senones evaluated (92/fr)
        INFO: ngram_search_fwdflat.c(952):    10767 channels searched (52/fr)
        INFO: ngram_search_fwdflat.c(954):     1434 words searched (6/fr)
        INFO: ngram_search_fwdflat.c(957):      336 word transitions (1/fr)
        INFO: ngram_search_fwdflat.c(960): fwdflat 1.39 CPU 0.675 xRT
        INFO: ngram_search_fwdflat.c(963): fwdflat 1.47 wall 0.712 xRT
        INFO: ngram_search.c(1253): lattice start node <s>.0 end node </s>.145
        INFO: ngram_search.c(1279): Eliminated 1 nodes before end node
        INFO: ngram_search.c(1384): Lattice has 182 nodes, 185 links
        INFO: ps_lattice.c(1380): Bestpath score: -4983
        INFO: ps_lattice.c(1384): Normalizer P(O) = alpha(</s>:145:204) = -279027
        INFO: ps_lattice.c(1441): Joint P(O,S) = -322759 P(S|O) = -43732
        INFO: ngram_search.c(875): bestpath 0.01 CPU 0.005 xRT
        INFO: ngram_search.c(878): bestpath 0.01 wall 0.005 xRT
        SIGNAL STOP
        INFO: cmn_prior.c(131): cmn_prior_update: from < 43.67 -9.67  3.24 -0.68 -16.17  4.57 -10.66  9.09 -6.51 -6.81  5.74 -1.41  0.10 >
        INFO: cmn_prior.c(149): cmn_prior_update: to   < 44.69 -10.15  3.38  0.56 -14.73  4.29 -13.18  8.64 -6.64 -6.86  6.11 -1.27 -0.19 >
        INFO: ngram_search_fwdtree.c(1553):      554 words recognized (3/fr)
        INFO: ngram_search_fwdtree.c(1555):    11263 senones evaluated (62/fr)
        INFO: ngram_search_fwdtree.c(1559):     4799 channels searched (26/fr), 687 1st, 3192 last
        INFO: ngram_search_fwdtree.c(1562):      701 words for which last channels evaluated (3/fr)
        INFO: ngram_search_fwdtree.c(1564):       93 candidate words for entering last phone (0/fr)
        INFO: ngram_search_fwdtree.c(1567): fwdtree 2.98 CPU 1.637 xRT
        INFO: ngram_search_fwdtree.c(1570): fwdtree 3.15 wall 1.733 xRT
        INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 4 words
        INFO: ngram_search_fwdflat.c(948):      500 words recognized (3/fr)
        INFO: ngram_search_fwdflat.c(950):    10245 senones evaluated (56/fr)
        INFO: ngram_search_fwdflat.c(952):     4888 channels searched (26/fr)
        INFO: ngram_search_fwdflat.c(954):      904 words searched (4/fr)
        INFO: ngram_search_fwdflat.c(957):      162 word transitions (0/fr)
        INFO: ngram_search_fwdflat.c(960): fwdflat 1.21 CPU 0.665 xRT
        INFO: ngram_search_fwdflat.c(963): fwdflat 1.27 wall 0.696 xRT
        INFO: ngram_search.c(1253): lattice start node <s>.0 end node </s>.138
        INFO: ngram_search.c(1279): Eliminated 1 nodes before end node
        INFO: ngram_search.c(1384): Lattice has 143 nodes, 62 links
        INFO: ps_lattice.c(1380): Bestpath score: -2666
        INFO: ps_lattice.c(1384): Normalizer P(O) = alpha(</s>:138:180) = -173267
        INFO: ps_lattice.c(1441): Joint P(O,S) = -181346 P(S|O) = -8079
        INFO: ngram_search.c(875): bestpath 0.00 CPU 0.000 xRT
        INFO: ngram_search.c(878): bestpath 0.01 wall 0.006 xRT
        SIGNAL RIGHT
        INFO: cmn_prior.c(131): cmn_prior_update: from < 44.69 -10.15  3.38  0.56 -14.73  4.29 -13.18  8.64 -6.64 -6.86  6.11 -1.27 -0.19 >
        INFO: cmn_prior.c(149): cmn_prior_update: to   < 44.69 -10.15  3.38  0.56 -14.73  4.29 -13.18  8.64 -6.64 -6.86  6.11 -1.27 -0.19 >
        INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 0 words
        INFO: ngram_search_fwdtree.c(432): TOTAL fwdtree 22.69 CPU 1.663 xRT
        INFO: ngram_search_fwdtree.c(435): TOTAL fwdtree 24.02 wall 1.761 xRT
        INFO: ngram_search_fwdflat.c(176): TOTAL fwdflat 9.02 CPU 0.661 xRT
        INFO: ngram_search_fwdflat.c(179): TOTAL fwdflat 9.50 wall 0.696 xRT
        INFO: ngram_search.c(303): TOTAL bestpath 0.05 CPU 0.004 xRT
        INFO: ngram_search.c(306): TOTAL bestpath 0.11 wall 0.008 xRT``
        
         
        • Nickolay V. Shmyrev

          jsgf and lm conflict with each other, you need to use either jsgf or lm.

           
          • Remy Porter

            Remy Porter - 2016-06-16

            Hey, that's much better!

             

Log in to post a comment.

Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.