I've been working with the stand-alone pre-compiled version of pocketsphinx for the last 48 hours. I found the technology really interesting and I'm just practicing a little bit. At this point, I've a reduced dictionary, as the only thing I have to detect are numbers.
However for some reason, it's actually properly detecting all the numbers, but not the "six".
zero Z IH R OW
zero(2) Z IY R OW
one W AH N
one(2) HH W AH N
two T UW
three TH R IY
four F AO R
five F AY V
six S IH T S
six(2) S IH K S IH Z
six(3) S IH K S T
seven S EH V AH N
eight EY T
nine N AY N
INFO: feat.c(715): Initializing feature stream to type: '1s_c_d_dd', ceplen=13, CMN='current', VARNORM='no', AGC='none'
INFO: cmn.c(143): mean[0]= 12.00, mean[1..12]= 0.0
INFO: acmod.c(154): Reading linear feature transformation from model/en-us/en-us/feature_transform
INFO: mdef.c(518): Reading model definition: model/en-us/en-us/mdef
INFO: bin_mdef.c(181): Allocating 142124 * 8 bytes (1110 KiB) for CD tree
INFO: tmat.c(206): Reading HMM transition probability matrices: model/en-us/en-us/transition_matrices
INFO: acmod.c(117): Attempting to use PTM computation module
INFO: ms_gauden.c(198): Reading mixture gaussian parameter: model/en-us/en-us/means
INFO: ms_gauden.c(292): 5138 codebook, 1 feature, size:
INFO: ms_gauden.c(294): 32x36
INFO: ms_gauden.c(198): Reading mixture gaussian parameter: model/en-us/en-us/variances
INFO: ms_gauden.c(292): 5138 codebook, 1 feature, size:
INFO: ms_gauden.c(294): 32x36
INFO: ms_gauden.c(354): 813 variance values floored
INFO: ptm_mgau.c(801): Number of codebooks exceeds 256: 5138
INFO: acmod.c(119): Attempting to use semi-continuous computation module
INFO: ms_gauden.c(198): Reading mixture gaussian parameter: model/en-us/en-us/means
INFO: ms_gauden.c(292): 5138 codebook, 1 feature, size:
INFO: ms_gauden.c(294): 32x36
INFO: ms_gauden.c(198): Reading mixture gaussian parameter: model/en-us/en-us/variances
INFO: ms_gauden.c(292): 5138 codebook, 1 feature, size:
INFO: ms_gauden.c(294): 32x36
INFO: ms_gauden.c(354): 813 variance values floored
INFO: acmod.c(121): Falling back to general multi-stream GMM computation
INFO: ms_gauden.c(198): Reading mixture gaussian parameter: model/en-us/en-us/means
INFO: ms_gauden.c(292): 5138 codebook, 1 feature, size:
INFO: ms_gauden.c(294): 32x36
INFO: ms_gauden.c(198): Reading mixture gaussian parameter: model/en-us/en-us/variances
INFO: ms_gauden.c(292): 5138 codebook, 1 feature, size:
INFO: ms_gauden.c(294): 32x36
INFO: ms_gauden.c(354): 813 variance values floored
INFO: ms_senone.c(149): Reading senone mixture weights: model/en-us/en-us/mixture_weights
INFO: ms_senone.c(200): Truncating senone logs3(pdf) values by 10 bits
INFO: ms_senone.c(207): Not transposing mixture weights in memory
INFO: ms_senone.c(268): Read mixture weights for 5138 senones: 1 features x 32 codewords
INFO: ms_senone.c(320): Mapping senones to individual codebooks
INFO: ms_mgau.c(141): The value of topn: 4
INFO: phone_loop_search.c(114): State beam -225 Phone exit beam -225 Insertion penalty 0
INFO: dict.c(320): Allocating 4119 * 32 bytes (128 KiB) for word entries
INFO: dict.c(333): Reading main dictionary: model/en-us/micro.dict
INFO: dict.c(213): Allocated 0 KiB for strings, 0 KiB for phones
INFO: dict.c(336): 14 words read
INFO: dict.c(358): Reading filler dictionary: model/en-us/en-us/noisedict
INFO: dict.c(213): Allocated 0 KiB for strings, 0 KiB for phones
INFO: dict.c(361): 9 words read
INFO: dict2pid.c(396): Building PID tables for dictionary
INFO: dict2pid.c(406): Allocating 46^3 * 2 bytes (190 KiB) for word-initial triphones
INFO: dict2pid.c(132): Allocated 51152 bytes (49 KiB) for word-final triphones
INFO: dict2pid.c(196): Allocated 51152 bytes (49 KiB) for single-phone word triphones
INFO: ngram_model_trie.c(347): Trying to read LM in trie binary format
INFO: ngram_search_fwdtree.c(99): 12 unique initial diphones
INFO: ngram_search_fwdtree.c(148): 0 root, 0 non-root channels, 10 single-phone words
INFO: ngram_search_fwdtree.c(186): Creating search tree
INFO: ngram_search_fwdtree.c(192): before: 0 root, 0 non-root channels, 10 single-phone words
INFO: ngram_search_fwdtree.c(326): after: max nonroot chan increased to 149
INFO: ngram_search_fwdtree.c(339): after: 12 root, 21 non-root channels, 9 single-phone words
INFO: ngram_search_fwdflat.c(157): fwdflat: min_ef_width = 4, max_sf_win = 25
INFO: continuous.c(307): C:\Users\admin\Desktop\modelo1\bin\Release\x64\pocketsphinx_continuous.exe COMPILED ON: Jan 24 2016, AT: 07:35:22
INFO: cmn_prior.c(131): cmn_prior_update: from < 40.00 3.00 -1.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 >
INFO: cmn_prior.c(149): cmn_prior_update: to < 59.77 25.73 -30.09 31.64 -17.87 -3.09 5.67 -16.70 17.23 -13.99 1.21 5.62 -9.56 >
INFO: ngram_search_fwdtree.c(1553): 1615 words recognized (6/fr)
INFO: ngram_search_fwdtree.c(1555): 45610 senones evaluated (170/fr)
INFO: ngram_search_fwdtree.c(1559): 27981 channels searched (104/fr), 2503 1st, 23331 last
INFO: ngram_search_fwdtree.c(1562): 2207 words for which last channels evaluated (8/fr)
INFO: ngram_search_fwdtree.c(1564): 926 candidate words for entering last phone (3/fr)
INFO: ngram_search_fwdtree.c(1567): fwdtree 0.16 CPU 0.058 xRT
INFO: ngram_search_fwdtree.c(1570): fwdtree 0.14 wall 0.054 xRT
INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 10 words
INFO: ngram_search_fwdflat.c(948): 859 words recognized (3/fr)
INFO: ngram_search_fwdflat.c(950): 35278 senones evaluated (131/fr)
INFO: ngram_search_fwdflat.c(952): 20950 channels searched (77/fr)
INFO: ngram_search_fwdflat.c(954): 2432 words searched (9/fr)
INFO: ngram_search_fwdflat.c(957): 1039 word transitions (3/fr)
INFO: ngram_search_fwdflat.c(960): fwdflat 0.05 CPU 0.017 xRT
INFO: ngram_search_fwdflat.c(963): fwdflat 0.06 wall 0.020 xRT
INFO: ngram_search.c(1253): lattice start node .0 end node .233
INFO: ngram_search.c(1279): Eliminated 0 nodes before end node
INFO: ngram_search.c(1384): Lattice has 98 nodes, 7 links
INFO: ps_lattice.c(1380): Besteight two
eight two two one
path score: -9352
INFO: ps_lattice.c(1384): Normalizer P(O) = alpha(:233:267) = -612077
INFO: ps_lattice.c(1441): Joint P(O,S) = -619201 P(S|O) = -7124
INFO: ngram_search.c(875): bestpath 0.00 CPU 0.000 xRT
INFO: ngram_search.c(878): bestpath 0.00 wall 0.000 xRT
INFO: cmn_prior.c(131): cmn_prior_update: from < 59.77 25.73 -30.09 31.64 -17.87 -3.09 5.67 -16.70 17.23 -13.99 1.21 5.62 -9.56 >
INFO: cmn_prior.c(149): cmn_prior_update: to < 57.05 24.79 -29.11 34.55 -18.54 -2.06 6.49 -20.19 14.28 -14.35 -0.42 5.39 -8.80 >
INFO: ngram_search_fwdtree.c(1553): 3021 words recognized (9/fr)
INFO: ngram_search_fwdtree.c(1555): 83996 senones evaluated (246/fr)
INFO: ngram_search_fwdtree.c(1559): 52714 channels searched (154/fr), 3981 1st, 44159 last
INFO: ngram_search_fwdtree.c(1562): 4008 words for which last channels evaluated (11/fr)
INFO: ngram_search_fwdtree.c(1564): 1633 candidate words for entering last phone (4/fr)
INFO: ngram_search_fwdtree.c(1567): fwdtree 0.23 CPU 0.068 xRT
INFO: ngram_search_fwdtree.c(1570): fwdtree 0.24 wall 0.070 xRT
INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 15 words
INFO: ngram_search_fwdflat.c(948): 2900 words recognized (8/fr)
INFO: ngram_search_fwdflat.c(950): 98768 senones evaluated (289/fr)
INFO: ngram_search_fwdflat.c(952): 70226 channels searched (205/fr)
INFO: ngram_search_fwdflat.c(954): 5789 words searched (16/fr)
INFO: ngram_search_fwdflat.c(957): 2346 word transitions (6/fr)
INFO: ngram_search_fwdflat.c(960): fwdflat 0.19 CPU 0.055 xRT
INFO: ngram_search_fwdflat.c(963): fwdflat 0.18 wall 0.053 xRT
INFO: ngram_search.c(1253): lattice start node .0 end node .299
INFO: ngram_search.c(1279): Eliminated 0 nodes before end node
INFO: ngram_search.c(1384): Lattice has 578 nodes, 2651 links
INFO: ps_lattice.c(1380): Bestpath score: -15225
INFO: ps_lattice.c(1384): Normalizer P(O) = alpha(:299:340) = -833498
INFO: ps_lattice.c(1441): Joint P(O,S) = -973462 P(S|O) = -139964
INFO: ngram_search.c(875): bestpath 0.00 CPU 0.000 xRT
INFO: ngram_search.c(878): bestpath 0.00 wall 0.000 xRT
INFO: cmn_prior.c(99): cmn_prior_update: from < 57.05 24.79 -29.11 34.55 -18.54 -2.06 6.49 -20.19 14.28 -14.35 -0.42 5.39 -8.80 >
INFO: cmn_prior.c(116): cmn_prior_update: to < 57.35 25.83 -31.31 37.35 -18.26 -1.94 5.26 -21.96 14.76 -14.39 0.19 5.88 -8.60 >
INFO: cmn_prior.c(99): cmn_prior_update: from < 57.35 25.83 -31.31 37.35 -18.26 -1.94 5.26 -21.96 14.76 -14.39 0.19 5.88 -8.60 >
INFO: cmn_prior.c(116): cmn_prior_update: to < 56.83 29.95 -32.22 36.91 -19.88 -0.93 6.56 -24.45 14.15 -15.08 -0.21 6.80 -8.44 >
INFO: ngram_search.c(459): Resized backpointer table to 10000 entries
INFO: cmn_prior.c(131): cmn_prior_update: from < 56.83 29.95 -32.22 36.91 -19.88 -0.93 6.56 -24.45 14.15 -15.08 -0.21 6.80 -8.44 >
INFO: cmn_prior.c(149): cmn_prior_update: to < 52.44 27.33 -34.54 34.08 -19.19 -1.27 4.84 -24.13 13.82 -14.10 -0.18 7.15 -8.00 >
INFO: ngram_search_fwdtree.c(1553): 5418 words recognized (8/fr)
INFO: ngram_search_fwdtree.c(1555): 161043 senones evaluated (229/fr)
INFO: ngram_search_fwdtree.c(1559): 100628 channels searched (142/fr), 8192 1st, 83757 last
INFO: ngram_search_fwdtree.c(1562): 7635 words for which last channels evaluated (10/fr)
INFO: ngram_search_fwdtree.c(1564): 3047 candidate words for entering last phone (4/fr)
INFO: ngram_search_fwdtree.c(1567): fwdtree 0.47 CPU 0.066 xRT
INFO: ngram_search_fwdtree.c(1570): fwdtree 0.47 wall 0.066 xRT
INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 14 words
INFO: ngram_search_fwdflat.c(948): 5481 words recognized (8/fr)
INFO: ngram_search_fwdflat.c(950): 203107 senones evaluated (289/fr)
INFO: ngram_search_fwdflat.c(952): 145265 channels searched (206/fr)
INFO: ngram_search_fwdflat.c(954): 12049 words searched (17/fr)
INFO: ngram_search_fwdflat.c(957): 5062 word transitions (7/fr)
INFO: ngram_search_fwdflat.c(960): fwdflat 0.37 CPU 0.053 xRT
INFO: ngram_search_fwdflat.c(963): fwdflat 0.36 wall 0.052 xRT
INFO: ngram_search.nine three one two one two one three
three
c(1253): lattice start node .0 end node .640
INFO: ngram_search.c(1279): Eliminated 0 nodes before end node
INFO: ngram_search.c(1384): Lattice has 1141 nodes, 4596 links
INFO: ps_lattice.c(1380): Bestpath score: -34289
INFO: ps_lattice.c(1384): Normalizer P(O) = alpha(:640:702) = -1840914
INFO: ps_lattice.c(1441): Joint P(O,S) = -2111102 P(S|O) = -270188
INFO: ngram_search.c(875): bestpath 0.00 CPU 0.000 xRT
INFO: ngram_search.c(878): bestpath 0.01 wall 0.001 xRT
INFO: cmn_prior.c(131): cmn_prior_update: from < 52.44 27.33 -34.54 34.08 -19.19 -1.27 4.84 -24.13 13.82 -14.10 -0.18 7.15 -8.00 >
INFO: cmn_prior.c(149): cmn_prior_update: to < 52.28 28.18 -34.33 35.30 -19.53 -1.45 4.96 -25.44 13.61 -14.04 0.13 7.98 -7.48 >
INFO: ngram_search_fwdtree.c(1553): 1294 words recognized (9/fr)
INFO: ngram_search_fwdtree.c(1555): 32716 senones evaluated (237/fr)
INFO: ngram_search_fwdtree.c(1559): 20092 channels searched (145/fr), 1608 1st, 16562 last
INFO: ngram_search_fwdtree.c(1562): 1628 words for which last channels evaluated (11/fr)
INFO: ngram_search_fwdtree.c(1564): 626 candidate words for entering last phone (4/fr)
INFO: ngram_search_fwdtree.c(1567): fwdtree 0.09 CPU 0.068 xRT
INFO: ngram_search_fwdtree.c(1570): fwdtree 0.09 wall 0.069 xRT
INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 13 words
INFO: ngram_search_fwdflat.c(948): 1101 words recognized (8/fr)
INFO: ngram_search_fwdflat.c(950): 40711 senones evaluated (295/fr)
INFO: ngram_search_fwdflat.c(952): 29155 channels searched (211/fr)
INFO: ngram_search_fwdflat.c(954): 2406 words searched (17/fr)
INFO: ngram_search_fwdflat.c(957): 899 word transitions (6/fr)
INFO: ngram_search_fwdflat.c(960): fwdflat 0.08 CPU 0.057 xRT
INFO: ngram_search_fwdflat.c(963): fwdflat 0.08 wall 0.054 xRT
INFO: ngram_search.c(1253): lattice start node .0 end node .121
INFO: ngram_search.c(1279): Eliminated 0 nodes before end node
INFO: ngram_search.c(1384): Lattice has 242 nodes, 1043 links
INFO: ps_lattice.c(1380): Bestpath score: -6526
INFO: ps_lattice.c(1384): Normalizer P(O) = alpha(:121:136) = -318823
INFO: ps_lattice.c(1441): Joint P(O,S) = -402781 P(S|O) = -83958
INFO: ngram_search.c(875): bestpath 0.00 CPU 0.000 xRT
INFO: ngram_search.c(878): bestpath 0.00 wall 0.000 xRT
INFO: cmn_prior.c(99): cmn_prior_update: from < 52.28 28.18 -34.33 35.30 -19.53 -1.45 4.96 -25.44 13.61 -14.04 0.13 7.98 -7.48 >
INFO: cmn_prior.c(116): cmn_prior_update: to < 52.24 28.18 -34.25 35.34 -19.45 -1.51 4.92 -25.40 13.60 -14.09 0.05 8.05 -7.44 >
INFO: cmn_prior.c(131): cmn_prior_update: from < 52.24 28.18 -34.25 35.34 -19.45 -1.51 4.92 -25.40 13.60 -14.09 0.05 8.05 -7.44 >
INFO: cmn_prior.c(149): cmn_prior_update: to < 49.34 25.26 -34.22 34.17 -19.44 -2.38 3.70 -23.77 12.56 -13.28 -0.27 7.69 -7.60 >
INFO: ngram_search_fwdtree.c(1553): 578 words recognized (5/fr)
INFO: ngram_search_fwdtree.c(1555): 16717 senones evaluated (148/fr)
INFO: ngram_search_fwdtree.c(1559): 8290 channels searched (73/fr), 1308 1st, 5881 last
INFO: ngram_search_fwdtree.c(1562): 935 words for which last channels evaluated (8/fr)
INFO: ngram_search_fwdtree.c(1564): 178 candidate words for entering last phone (1/fr)
INFO: ngram_search_fwdtree.c(1567): fwdtree 0.06 CPU 0.055 xRT
INFO: ngram_search_fwdtree.c(1570): fwdtree 0.05 wall 0.049 xRT
INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 6 words
INFO: ngram_search_fwdflat.c(948): 652 words recognized (6/fr)
INFO: ngram_search_fwdflat.c(950): 15326 senones evaluated (136/fr)
INFO: ngram_search_fwdflat.c(952): 7913 channels searched (70/fr)
INFO: ngram_search_fwdflat.c(954): 1313 words searched (11/fr)
INFO: ngram_search_fwdflat.c(957): 328 word transitions (2/fr)
INFO: ngram_search_fwdflat.c(960): fwdflat 0.02 CPU 0.014 xRT
INFO: ngram_search_fwdflat.c(963): fwdflat 0.03 wall 0.022 xRT
INFO: ngram_search.c(1253): lattice start node .0 end node .70
INFO: ngram_search.c(1279): Eliminated 0 nodes before end node
INFO: ngram_seaeight
three
eight
rch.c(1384): Lattice has 182 nodes, 688 links
INFO: ps_lattice.c(1380): Bestpath score: -2709
INFO: ps_lattice.c(1384): Normalizer P(O) = alpha(:70:111) = -158830
INFO: ps_lattice.c(1441): Joint P(O,S) = -217931 P(S|O) = -59101
INFO: ngram_search.c(875): bestpath 0.00 CPU 0.000 xRT
INFO: ngram_search.c(878): bestpath 0.00 wall 0.000 xRT
INFO: cmn_prior.c(131): cmn_prior_update: from < 49.34 25.26 -34.22 34.17 -19.44 -2.38 3.70 -23.77 12.56 -13.28 -0.27 7.69 -7.60 >
INFO: cmn_prior.c(149): cmn_prior_update: to < 48.91 24.86 -33.06 36.03 -18.79 -2.21 3.76 -24.92 11.76 -13.39 -0.86 7.67 -7.61 >
INFO: ngram_search_fwdtree.c(1553): 369 words recognized (4/fr)
INFO: ngram_search_fwdtree.c(1555): 12360 senones evaluated (129/fr)
INFO: ngram_search_fwdtree.c(1559): 6138 channels searched (63/fr), 942 1st, 4592 last
INFO: ngram_search_fwdtree.c(1562): 723 words for which last channels evaluated (7/fr)
INFO: ngram_search_fwdtree.c(1564): 132 candidate words for entering last phone (1/fr)
INFO: ngram_search_fwdtree.c(1567): fwdtree 0.05 CPU 0.049 xRT
INFO: ngram_search_fwdtree.c(1570): fwdtree 0.04 wall 0.047 xRT
INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 5 words
INFO: ngram_search_fwdflat.c(948): 348 words recognized (4/fr)
INFO: ngram_search_fwdflat.c(950): 11123 senones evaluated (116/fr)
INFO: ngram_search_fwdflat.c(952): 5789 channels searched (60/fr)
INFO: ngram_search_fwdflat.c(954): 1051 words searched (10/fr)
INFO: ngram_search_fwdflat.c(957): 192 word transitions (2/fr)
INFO: ngram_search_fwdflat.c(960): fwdflat 0.02 CPU 0.016 xRT
INFO: ngram_search_fwdflat.c(963): fwdflat 0.01 wall 0.016 xRT
INFO: ngram_search.c(1253): lattice start node .0 end node .71
INFO: ngram_search.c(1279): Eliminated 1 nodes before end node
INFO: ngram_search.c(1384): Lattice has 73 nodes, 24 links
INFO: ps_lattice.c(1380): Bestpath score: -2982
INFO: ps_lattice.c(1384): Normalizer P(O) = alpha(:71:94) = -216221
INFO: ps_lattice.c(1441): Joint P(O,S) = -222560 P(S|O) = -6339
INFO: ngram_search.c(875): bestpath 0.00 CPU 0.000 xRT
INFO: ngram_search.c(878): bestpath 0.00 wall 0.000 xRT
INFO: cmn_prior.c(131): cmn_prior_update: from < 48.91 24.86 -33.06 36.03 -18.79 -2.21 3.76 -24.92 11.76 -13.39 -0.86 7.67 -7.61 >
INFO: cmn_prior.c(149): cmn_prior_update: to < 46.48 22.58 -31.30 35.09 -18.85 -2.14 3.31 -23.77 10.67 -12.62 -1.45 7.33 -7.58 >
INFO: ngram_search_fwdtree.c(1553): 340 words recognized (3/fr)
INFO: ngram_search_fwdtree.c(1555): 11470 senones evaluated (117/fr)
INFO: ngram_search_fwdtree.c(1559): 5543 channels searched (56/fr), 1128 1st, 3736 last
INFO: ngram_search_fwdtree.c(1562): 745 words for which last channels evaluated (7/fr)
INFO: ngram_search_fwdtree.c(1564): 139 candidate words for entering last phone (1/fr)
INFO: ngram_search_fwdtree.c(1567): fwdtree 0.05 CPU 0.048 xRT
INFO: ngram_search_fwdtree.c(1570): fwdtree 0.04 wall 0.041 xRT
INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 4 words
INFO: ngram_search_fwdflat.c(948): 403 words recognized (4/fr)
INFO: ngram_search_fwdflat.c(950): 8249 senones evaluated (84/fr)
INFO: ngram_search_fwdflat.c(952): 3816 channels searched (38/fr)
INFO: ngram_search_fwdflat.c(954): 991 words searched (10/fr)
INFO: ngram_search_fwdflat.c(957): 196 word transitions (2/fr)
INFO: ngram_search_fwdflat.c(960): fwdflat 0.02 CPU 0.016 xRT
INFO: ngram_search_fwdflat.c(963): fwdflat 0.02 wall 0.015 xRT
INFO: ngram_search.c(1253): lattice start node .0 end node .41
INFO: ngram_search.c(1279): Eliminated 0 nodes before end node
INFO: ngram_search.c(1384): Lattice has 59 nodes, 8 links
INFO: ps_lattice.c(1380): Bestpath score: -1566
INFO: ps_lattice.c(1384): Normalizer P(O) = alpha(:41:96) = -177960
INFO: ps_lattice.c(1441): Joint P(O,S) = -183524 P(S|O) = -5564
INFO: ngram_search.c(875): bestpath 0.00 CPU 0.000 xRT
INFO: ngram_search.c(878): bestpath 0.00 wall 0.000 xRT
INFO: cmn_prior.c(99): cmn_prior_update: from < 46.48 22.58 -31.30 35.09 -18.8three
four
5 -2.14 3.31 -23.77 10.67 -12.62 -1.45 7.33 -7.58 >
INFO: cmn_prior.c(116): cmn_prior_update: to < 46.94 23.01 -30.93 35.85 -18.36 -2.42 3.12 -24.38 10.31 -12.80 -1.62 7.28 -7.63 >
INFO: cmn_prior.c(131): cmn_prior_update: from < 46.94 23.01 -30.93 35.85 -18.36 -2.42 3.12 -24.38 10.31 -12.80 -1.62 7.28 -7.63 >
INFO: cmn_prior.c(149): cmn_prior_update: to < 45.85 22.21 -30.51 36.61 -18.34 -1.92 3.50 -24.75 10.08 -12.76 -2.12 7.26 -7.53 >
INFO: ngram_search_fwdtree.c(1553): 443 words recognized (5/fr)
INFO: ngram_search_fwdtree.c(1555): 14118 senones evaluated (147/fr)
INFO: ngram_search_fwdtree.c(1559): 7329 channels searched (76/fr), 1033 1st, 5480 last
INFO: ngram_search_fwdtree.c(1562): 803 words for which last channels evaluated (8/fr)
INFO: ngram_search_fwdtree.c(1564): 207 candidate words for entering last phone (2/fr)
INFO: ngram_search_fwdtree.c(1567): fwdtree 0.05 CPU 0.049 xRT
INFO: ngram_search_fwdtree.c(1570): fwdtree 0.05 wall 0.052 xRT
INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 7 words
INFO: ngram_search_fwdflat.c(948): 448 words recognized (5/fr)
INFO: ngram_search_fwdflat.c(950): 14415 senones evaluated (150/fr)
INFO: ngram_search_fwdflat.c(952): 7999 channels searched (83/fr)
INFO: ngram_search_fwdflat.c(954): 1162 words searched (12/fr)
INFO: ngram_search_fwdflat.c(957): 276 word transitions (2/fr)
INFO: ngram_search_fwdflat.c(960): fwdflat 0.02 CPU 0.016 xRT
INFO: ngram_search_fwdflat.c(963): fwdflat 0.03 wall 0.026 xRT
INFO: ngram_search.c(1253): lattice start node .0 end node .70
INFO: ngram_search.c(1279): Eliminated 0 nodes before end node
INFO: ngram_search.c(1384): Lattice has 97 nodes, 85 links
INFO: ps_lattice.c(1380): Bestpath score: -3291
INFO: ps_lattice.c(1384): Normalizer P(O) = alpha(:70:94) = -234978
INFO: ps_lattice.c(1441): Joint P(O,S) = -246879 P(S|O) = -11901
INFO: ngram_search.c(875): bestpath 0.00 CPU 0.000 xRT
INFO: ngram_search.c(878): bestpath 0.00 wall 0.000 xRT
INFO: cmn_prior.c(131): cmn_prior_update: from < 45.85 22.21 -30.51 36.61 -18.34 -1.92 3.50 -24.75 10.08 -12.76 -2.12 7.26 -7.53 >
INFO: cmn_prior.c(149): cmn_prior_update: to < 44.59 22.35 -29.73 33.42 -18.83 -1.85 3.43 -23.40 10.23 -12.11 -1.93 6.88 -7.91 >
INFO: ngram_search_fwdtree.c(1553): 384 words recognized (4/fr)
INFO: ngram_search_fwdtree.c(1555): 13300 senones evaluated (130/fr)
INFO: ngram_search_fwdtree.c(1559): 6294 channels searched (61/fr), 1075 1st, 4590 last
INFO: ngram_search_fwdtree.c(1562): 741 words for which last channels evaluated (7/fr)
INFO: ngram_search_fwdtree.c(1564): 161 candidate words for entering last phone (1/fr)
INFO: ngram_search_fwdtree.c(1567): fwdtree 0.05 CPU 0.046 xRT
INFO: ngram_search_fwdtree.c(1570): fwdtree 0.04 wall 0.044 xRT
INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 6 words
INFO: ngram_search_fwdflat.c(948): 418 words recognized (4/fr)
INFO: ngram_search_fwdflat.c(950): 12487 senones evaluated (122/fr)
INFO: ngram_search_fwdflat.c(952): 5910 channels searched (57/fr)
INFO: ngram_search_fwdflat.c(954): 1100 words searched (10/fr)
INFO: ngram_search_fwdflat.c(957): 246 word transitions (2/fr)
INFO: ngram_search_fwdflat.c(960): fwdflat 0.03 CPU 0.031 xRT
INFO: ngram_search_fwdflat.c(963): fwdflat 0.02 wall 0.020 xRT
INFO: ngram_search.c(1253): lattice start node .0 end node .63
INFO: ngram_search.c(1279): Eliminated 0 nodes before end node
INFO: ngram_search.c(1384): Lattice has 115 nodes, 264 links
INFO: ps_lattice.c(1380): Bestpath score: -2326
INFO: ps_lattice.c(1384): Normalizer P(O) = alpha(:63:100) = -155842
INFO: ps_lattice.c(1441): Joint P(O,S) = -176621 P(S|O) = -20779
INFO: ngram_search.c(875): bestpath 0.00 CPU 0.000 xRT
INFO: ngram_search.c(878): bestpath 0.00 wall 0.000 xRT
INFO: cmn_prior.c(131): cmn_prior_update: from < 44.59 22.35 -29.73 33.42 -18.83 -1.85 3.43 -23.40 10.23 -12.11 -1.93 6.88 -7.91 >
INFO: cmn_prior.c(149): cmn_prior_update: to < 42.50 20.30 -28.31 32.41 -18.73 -2.11 3.00 eight
eight
-22.53 9.16 -11.35 -2.47 6.63 -7.97 >
INFO: ngram_search_fwdtree.c(1553): 342 words recognized (3/fr)
INFO: ngram_search_fwdtree.c(1555): 10871 senones evaluated (111/fr)
INFO: ngram_search_fwdtree.c(1559): 5153 channels searched (52/fr), 1120 1st, 3290 last
INFO: ngram_search_fwdtree.c(1562): 731 words for which last channels evaluated (7/fr)
INFO: ngram_search_fwdtree.c(1564): 121 candidate words for entering last phone (1/fr)
INFO: ngram_search_fwdtree.c(1567): fwdtree 0.03 CPU 0.032 xRT
INFO: ngram_search_fwdtree.c(1570): fwdtree 0.04 wall 0.041 xRT
INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 4 words
INFO: ngram_search_fwdflat.c(948): 407 words recognized (4/fr)
INFO: ngram_search_fwdflat.c(950): 8674 senones evaluated (89/fr)
INFO: ngram_search_fwdflat.c(952): 4033 channels searched (41/fr)
INFO: ngram_search_fwdflat.c(954): 994 words searched (10/fr)
INFO: ngram_search_fwdflat.c(957): 197 word transitions (2/fr)
INFO: ngram_search_fwdflat.c(960): fwdflat 0.02 CPU 0.016 xRT
INFO: ngram_search_fwdflat.c(963): fwdflat 0.01 wall 0.015 xRT
INFO: ngram_search.c(1253): lattice start node .0 end node .41
INFO: ngram_search.c(1279): Eliminated 0 nodes before end node
INFO: ngram_search.c(1384): Lattice has 58 nodes, 10 links
INFO: ps_lattice.c(1380): Bestpath score: -1667
INFO: ps_lattice.c(1384): Normalizer P(O) = alpha(:41:96) = -178631
INFO: ps_lattice.c(1441): Joint P(O,S) = -186341 P(S|O) = -7710
INFO: ngram_search.c(875): bestpath 0.00 CPU 0.000 xRT
INFO: ngram_search.c(878): bestpath 0.00 wall 0.000 xRT
INFO: cmn_prior.c(99): cmn_prior_update: from < 42.50 20.30 -28.31 32.41 -18.73 -2.11 3.00 -22.53 9.16 -11.35 -2.47 6.63 -7.97 >
INFO: cmn_prior.c(116): cmn_prior_update: to < 43.99 21.56 -29.15 33.13 -18.80 -2.60 3.12 -22.33 10.44 -11.57 -2.49 6.94 -8.30 >
INFO: cmn_prior.c(131): cmn_prior_update: from < 43.99 21.56 -29.15 33.13 -18.80 -2.60 3.12 -22.33 10.44 -11.57 -2.49 6.94 -8.30 >
INFO: cmn_prior.c(149): cmn_prior_update: to < 44.32 22.38 -28.86 34.02 -18.96 -2.53 3.59 -23.32 9.57 -11.50 -2.32 7.94 -7.24 >
INFO: ngram_search_fwdtree.c(1553): 1814 words recognized (8/fr)
INFO: ngram_search_fwdtree.c(1555): 50657 senones evaluated (237/fr)
INFO: ngram_search_fwdtree.c(1559): 33753 channels searched (157/fr), 2508 1st, 28418 last
INFO: ngram_search_fwdtree.c(1562): 2472 words for which last channels evaluated (11/fr)
INFO: ngram_search_fwdtree.c(1564): 1040 candidate words for entering last phone (4/fr)
INFO: ngram_search_fwdtree.c(1567): fwdtree 0.14 CPU 0.066 xRT
INFO: ngram_search_fwdtree.c(1570): fwdtree 0.15 wall 0.068 xRT
INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 13 words
INFO: ngram_search_fwdflat.c(948): 1716 words recognized (8/fr)
INFO: ngram_search_fwdflat.c(950): 53713 senones evaluated (251/fr)
INFO: ngram_search_fwdflat.c(952): 40212 channels searched (187/fr)
INFO: ngram_search_fwdflat.c(954): 3563 words searched (16/fr)
INFO: ngram_search_fwdflat.c(957): 1289 word transitions (6/fr)
INFO: ngram_search_fwdflat.c(960): fwdflat 0.09 CPU 0.044 xRT
INFO: ngram_search_fwdflat.c(963): fwdflat 0.10 wall 0.044 xRT
INFO: ngram_search.c(1253): lattice start node .0 end node .180
INFO: ngram_search.c(1279): Eliminated 1 nodes before end node
INFO: ngram_search.c(1384): Lattice has 325 nodes, 945 links
INFO: ps_lattice.c(1380): Bestpath score: -10136
INFO: ps_lattice.c(1384): Normalizer P(O) = alpha(:180:212) = -541776
INFO: ps_lattice.c(1441): Joint P(O,S) = -597424 P(S|O) = -55648
INFO: ngram_search.c(875): bestpath 0.02 CPU 0.007 xRT
INFO: ngram_search.c(878): bestpath 0.00 wall 0.002 xRT
INFO: cmn_prior.c(131): cmn_prior_update: from < 44.32 22.38 -28.86 34.02 -18.96 -2.53 3.59 -23.32 9.57 -11.50 -2.32 7.94 -7.24 >
INFO: cmn_prior.c(149): cmn_prior_update: to < 42.66 20.57 -29.23 32.94 -18.92 -3.13 2.74 -22.33 9.12 -11.14 -2.20 7.91 -7.24 >
INFO: ngram_search_fwdtree.c(1553): 709 words recognized (6/fr)
INFO: ngram_search_fwdtree.c(1555eight
three
): 18161 senones evaluated (161/fr)
INFO: ngram_search_fwdtree.c(1559): 9526 channels searched (84/fr), 1308 1st, 6976 last
INFO: ngram_search_fwdtree.c(1562): 1051 words for which last channels evaluated (9/fr)
INFO: ngram_search_fwdtree.c(1564): 238 candidate words for entering last phone (2/fr)
INFO: ngram_search_fwdtree.c(1567): fwdtree 0.05 CPU 0.041 xRT
INFO: ngram_search_fwdtree.c(1570): fwdtree 0.06 wall 0.049 xRT
INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 9 words
INFO: ngram_search_fwdflat.c(948): 699 words recognized (6/fr)
INFO: ngram_search_fwdflat.c(950): 19802 senones evaluated (175/fr)
INFO: ngram_search_fwdflat.c(952): 11755 channels searched (104/fr)
INFO: ngram_search_fwdflat.c(954): 1563 words searched (13/fr)
INFO: ngram_search_fwdflat.c(957): 461 word transitions (4/fr)
INFO: ngram_search_fwdflat.c(960): fwdflat 0.05 CPU 0.041 xRT
INFO: ngram_search_fwdflat.c(963): fwdflat 0.03 wall 0.031 xRT
INFO: ngram_search.c(1253): lattice start node .0 end node .71
INFO: ngram_search.c(1279): Eliminated 1 nodes before end node
INFO: ngram_search.c(1384): Lattice has 215 nodes, 653 links
INFO: ps_lattice.c(1380): Bestpath score: -3081
INFO: ps_lattice.c(1384): Normalizer P(O) = alpha(:71:111) = -147851
INFO: ps_lattice.c(1441): Joint P(O,S) = -219876 P(S|O) = -72025
INFO: ngram_search.c(875): bestpath 0.00 CPU 0.000 xRT
INFO: ngram_search.c(878): bestpath 0.00 wall 0.000 xRT
INFO: cmn_prior.c(131): cmn_prior_update: from < 42.66 20.57 -29.23 32.94 -18.92 -3.13 2.74 -22.33 9.12 -11.14 -2.20 7.91 -7.24 >
INFO: cmn_prior.c(149): cmn_prior_update: to < 43.14 20.86 -28.69 34.88 -18.52 -2.94 2.83 -23.46 8.86 -11.52 -2.61 7.87 -7.28 >
INFO: ngram_search_fwdtree.c(1553): 540 words recognized (6/fr)
INFO: ngram_search_fwdtree.c(1555): 16354 senones evaluated (172/fr)
INFO: ngram_search_fwdtree.c(1559): 9447 channels searched (99/fr), 991 1st, 7517 last
INFO: ngram_search_fwdtree.c(1562): 841 words for which last channels evaluated (8/fr)
INFO: ngram_search_fwdtree.c(1564): 258 candidate words for entering last phone (2/fr)
INFO: ngram_search_fwdtree.c(1567): fwdtree 0.05 CPU 0.049 xRT
INFO: ngram_search_fwdtree.c(1570): fwdtree 0.06 wall 0.058 xRT
INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 11 words
INFO: ngram_search_fwdflat.c(948): 503 words recognized (5/fr)
INFO: ngram_search_fwdflat.c(950): 18180 senones evaluated (191/fr)
INFO: ngram_search_fwdflat.c(952): 10849 channels searched (114/fr)
INFO: ngram_search_fwdflat.c(954): 1314 words searched (13/fr)
INFO: ngram_search_fwdflat.c(957): 419 word transitions (4/fr)
INFO: ngram_search_fwdflat.c(960): fwdflat 0.03 CPU 0.033 xRT
INFO: ngram_search_fwdflat.c(963): fwdflat 0.03 wall 0.032 xRT
INFO: ngram_search.c(1253): lattice start node .0 end node .72
INFO: ngram_search.c(1279): Eliminated 1 nodes before end node
INFO: ngram_search.c(1384): Lattice has 127 nodes, 240 links
INFO: ps_lattice.c(1380): Bestpath score: -3670
INFO: ps_lattice.c(1384): Normalizer P(O) = alpha(:72:93) = -232142
INFO: ps_lattice.c(1441): Joint P(O,S) = -259833 P(S|O) = -27691
INFO: ngram_search.c(875): bestpath 0.00 CPU 0.000 xRT
INFO: ngram_search.c(878): bestpath 0.00 wall 0.000 xRT
INFO: cmn_prior.c(99): cmn_prior_update: from < 43.14 20.86 -28.69 34.88 -18.52 -2.94 2.83 -23.46 8.86 -11.52 -2.61 7.87 -7.28 >
INFO: cmn_prior.c(116): cmn_prior_update: to < 42.94 20.59 -28.46 34.80 -18.73 -3.08 2.81 -23.33 8.81 -11.40 -2.58 7.76 -7.23 >
INFO: cmn_prior.c(131): cmn_prior_update: from < 42.94 20.59 -28.46 34.80 -18.73 -3.08 2.81 -23.33 8.81 -11.40 -2.58 7.76 -7.23 >
INFO: cmn_prior.c(149): cmn_prior_update: to < 40.78 18.59 -27.18 33.79 -18.68 -2.83 2.36 -22.40 7.99 -10.87 -3.14 7.41 -7.32 >
INFO: ngram_search_fwdtree.c(1553): 352 words recognized (4/fr)
INFO: ngram_search_fwdtree.c(1555): 11636 senones evaluated (119/fr)
INFO: ngram_search_fwdtree.c(1559): 5600 channels searched (57/fr), 1128 1st, 3717 last
INFO: ngram_seight
three
earch_fwdtree.c(1562): 753 words for which last channels evaluated (7/fr)
INFO: ngram_search_fwdtree.c(1564): 145 candidate words for entering last phone (1/fr)
INFO: ngram_search_fwdtree.c(1567): fwdtree 0.05 CPU 0.048 xRT
INFO: ngram_search_fwdtree.c(1570): fwdtree 0.04 wall 0.046 xRT
INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 5 words
INFO: ngram_search_fwdflat.c(948): 446 words recognized (5/fr)
INFO: ngram_search_fwdflat.c(950): 10728 senones evaluated (109/fr)
INFO: ngram_search_fwdflat.c(952): 5171 channels searched (52/fr)
INFO: ngram_search_fwdflat.c(954): 1080 words searched (11/fr)
INFO: ngram_search_fwdflat.c(957): 263 word transitions (2/fr)
INFO: ngram_search_fwdflat.c(960): fwdflat 0.02 CPU 0.016 xRT
INFO: ngram_search_fwdflat.c(963): fwdflat 0.01 wall 0.015 xRT
INFO: ngram_search.c(1253): lattice start node .0 end node .54
INFO: ngram_search.c(1279): Eliminated 1 nodes before end node
INFO: ngram_search.c(1384): Lattice has 85 nodes, 66 links
INFO: ps_lattice.c(1380): Bestpath score: -2391
INFO: ps_lattice.c(1384): Normalizer P(O) = alpha(:54:96) = -174789
INFO: ps_lattice.c(1441): Joint P(O,S) = -194174 P(S|O) = -19385
INFO: ngram_search.c(875): bestpath 0.00 CPU 0.000 xRT
INFO: ngram_search.c(878): bestpath 0.00 wall 0.000 xRT
INFO: cmn_prior.c(131): cmn_prior_update: from < 40.78 18.59 -27.18 33.79 -18.68 -2.83 2.36 -22.40 7.99 -10.87 -3.14 7.41 -7.32 >
INFO: cmn_prior.c(149): cmn_prior_update: to < 41.35 18.96 -26.87 35.72 -18.15 -2.71 2.54 -23.72 7.68 -11.25 -3.48 7.39 -7.41 >
INFO: ngram_search_fwdtree.c(1553): 563 words recognized (6/fr)
INFO: ngram_search_fwdtree.c(1555): 16206 senones evaluated (169/fr)
INFO: ngram_search_fwdtree.c(1559): 8464 channels searched (88/fr), 1086 1st, 6375 last
INFO: ngram_search_fwdtree.c(1562): 880 words for which last channels evaluated (9/fr)
INFO: ngram_search_fwdtree.c(1564): 228 candidate words for entering last phone (2/fr)
INFO: ngram_search_fwdtree.c(1567): fwdtree 0.05 CPU 0.049 xRT
INFO: ngram_search_fwdtree.c(1570): fwdtree 0.06 wall 0.057 xRT
INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 5 words
INFO: ngram_search_fwdflat.c(948): 622 words recognized (6/fr)
INFO: ngram_search_fwdflat.c(950): 14505 senones evaluated (151/fr)
INFO: ngram_search_fwdflat.c(952): 7629 channels searched (79/fr)
INFO: ngram_search_fwdflat.c(954): 1101 words searched (11/fr)
INFO: ngram_search_fwdflat.c(957): 271 word transitions (2/fr)
INFO: ngram_search_fwdflat.c(960): fwdflat 0.03 CPU 0.033 xRT
INFO: ngram_search_fwdflat.c(963): fwdflat 0.02 wall 0.021 xRT
INFO: ngram_search.c(1253): lattice start node .0 end node .70
INFO: ngram_search.c(1279): Eliminated 1 nodes before end node
INFO: ngram_search.c(1384): Lattice has 162 nodes, 295 links
INFO: ps_lattice.c(1380): Bestpath score: -3788
INFO: ps_lattice.c(1384): Normalizer P(O) = alpha(:70:94) = -256022
INFO: ps_lattice.c(1441): Joint P(O,S) = -269561 P(S|O) = -13539
INFO: ngram_search.c(875): bestpath 0.00 CPU 0.000 xRT
INFO: ngram_search.c(878): bestpath 0.00 wall 0.000 xRT
INFO: cmn_prior.c(131): cmn_prior_update: from < 41.35 18.96 -26.87 35.72 -18.15 -2.71 2.54 -23.72 7.68 -11.25 -3.48 7.39 -7.41 >
INFO: cmn_prior.c(149): cmn_prior_update: to < 40.55 19.13 -26.39 32.57 -18.31 -2.65 2.48 -22.39 7.99 -10.80 -3.10 7.16 -7.81 >
INFO: ngram_search_fwdtree.c(1553): 387 words recognized (4/fr)
INFO: ngram_search_fwdtree.c(1555): 13699 senones evaluated (134/fr)
INFO: ngram_search_fwdtree.c(1559): 6805 channels searched (66/fr), 1055 1st, 5047 last
INFO: ngram_search_fwdtree.c(1562): 756 words for which last channels evaluated (7/fr)
INFO: ngram_search_fwdtree.c(1564): 189 candidate words for entering last phone (1/fr)
INFO: ngram_search_fwdtree.c(1567): fwdtree 0.05 CPU 0.046 xRT
INFO: ngram_search_fwdtree.c(1570): fwdtree 0.05 wall 0.049 xRT
INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 8 words
INFO: ngram_search_fwdflat.c(948): 447four
eight
words recognized (4/fr)
INFO: ngram_search_fwdflat.c(950): 14119 senones evaluated (138/fr)
INFO: ngram_search_fwdflat.c(952): 8974 channels searched (87/fr)
INFO: ngram_search_fwdflat.c(954): 1266 words searched (12/fr)
INFO: ngram_search_fwdflat.c(957): 353 word transitions (3/fr)
INFO: ngram_search_fwdflat.c(960): fwdflat 0.02 CPU 0.015 xRT
INFO: ngram_search_fwdflat.c(963): fwdflat 0.02 wall 0.020 xRT
INFO: ngram_search.c(1253): lattice start node .0 end node .63
INFO: ngram_search.c(1279): Eliminated 0 nodes before end node
INFO: ngram_search.c(1384): Lattice has 127 nodes, 151 links
INFO: ps_lattice.c(1380): Bestpath score: -2487
INFO: ps_lattice.c(1384): Normalizer P(O) = alpha(:63:100) = -185586
INFO: ps_lattice.c(1441): Joint P(O,S) = -201402 P(S|O) = -15816
INFO: ngram_search.c(875): bestpath 0.00 CPU 0.000 xRT
INFO: ngram_search.c(878): bestpath 0.00 wall 0.000 xRT
INFO: cmn_prior.c(99): cmn_prior_update: from < 40.55 19.13 -26.39 32.57 -18.31 -2.65 2.48 -22.39 7.99 -10.80 -3.10 7.16 -7.81 >
INFO: cmn_prior.c(116): cmn_prior_update: to < 40.62 19.30 -26.14 33.21 -18.70 -2.81 2.68 -22.94 8.20 -10.90 -3.15 6.94 -7.94 >
INFO: cmn_prior.c(131): cmn_prior_update: from < 40.62 19.30 -26.14 33.21 -18.70 -2.81 2.68 -22.94 8.20 -10.90 -3.15 6.94 -7.94 >
INFO: cmn_prior.c(149): cmn_prior_update: to < 38.34 16.92 -25.36 30.96 -18.14 -2.58 1.91 -21.13 7.08 -9.87 -3.41 6.60 -7.72 >
INFO: ngram_search_fwdtree.c(1553): 359 words recognized (4/fr)
INFO: ngram_search_fwdtree.c(1555): 11463 senones evaluated (117/fr)
INFO: ngram_search_fwdtree.c(1559): 5454 channels searched (55/fr), 1118 1st, 3560 last
INFO: ngram_search_fwdtree.c(1562): 736 words for which last channels evaluated (7/fr)
INFO: ngram_search_fwdtree.c(1564): 141 candidate words for entering last phone (1/fr)
INFO: ngram_search_fwdtree.c(1567): fwdtree 0.05 CPU 0.048 xRT
INFO: ngram_search_fwdtree.c(1570): fwdtree 0.05 wall 0.046 xRT
INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 4 words
INFO: ngram_search_fwdflat.c(948): 448 words recognized (5/fr)
INFO: ngram_search_fwdflat.c(950): 8433 senones evaluated (86/fr)
INFO: ngram_search_fwdflat.c(952): 3914 channels searched (39/fr)
INFO: ngram_search_fwdflat.c(954): 990 words searched (10/fr)
INFO: ngram_search_fwdflat.c(957): 196 word transitions (2/fr)
INFO: ngram_search_fwdflat.c(960): fwdflat 0.02 CPU 0.016 xRT
INFO: ngram_search_fwdflat.c(963): fwdflat 0.01 wall 0.010 xRT
INFO: ngram_search.c(1253): lattice start node .0 end node .41
INFO: ngram_search.c(1279): Eliminated 1 nodes before end node
INFO: ngram_search.c(1384): Lattice has 85 nodes, 64 links
INFO: ps_lattice.c(1380): Bestpath score: -1706
INFO: ps_lattice.c(1384): Normalizer P(O) = alpha(:41:96) = -167245
INFO: ps_lattice.c(1441): Joint P(O,S) = -200318 P(S|O) = -33073
INFO: ngram_search.c(875): bestpath 0.00 CPU 0.000 xRT
INFO: ngram_search.c(878): bestpath 0.00 wall 0.000 xRT
INFO: cmn_prior.c(131): cmn_prior_update: from < 38.34 16.92 -25.36 30.96 -18.14 -2.58 1.91 -21.13 7.08 -9.87 -3.41 6.60 -7.72 >
INFO: cmn_prior.c(149): cmn_prior_update: to < 39.15 17.59 -27.05 29.90 -18.46 -2.14 2.83 -21.15 8.23 -9.13 -3.93 6.50 -7.13 >
INFO: ngram_search_fwdtree.c(1553): 556 words recognized (7/fr)
INFO: ngram_search_fwdtree.c(1555): 15221 senones evaluated (179/fr)
INFO: ngram_search_fwdtree.c(1559): 8953 channels searched (105/fr), 972 1st, 7144 last
INFO: ngram_search_fwdtree.c(1562): 844 words for which last channels evaluated (9/fr)
INFO: ngram_search_fwdtree.c(1564): 250 candidate words for entering last phone (2/fr)
INFO: ngram_search_fwdtree.c(1567): fwdtree 0.05 CPU 0.055 xRT
INFO: ngram_search_fwdtree.c(1570): fwdtree 0.05 wall 0.059 xRT
INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 8 words
INFO: ngram_search_fwdflat.c(948): 523 words recognized (6/fr)
INFO: ngram_search_fwdflat.c(950): 15344 senones evaluated (181/fr)
INFO: ngram_search_fwdflat.c(952): 10863 channels seanine
rched (127/fr)
INFO: ngram_search_fwdflat.c(954): 1177 words searched (13/fr)
INFO: ngram_search_fwdflat.c(957): 346 word transitions (4/fr)
INFO: ngram_search_fwdflat.c(960): fwdflat 0.03 CPU 0.037 xRT
INFO: ngram_search_fwdflat.c(963): fwdflat 0.03 wall 0.035 xRT
INFO: ngram_search.c(1253): lattice start node .0 end node .60
INFO: ngram_search.c(1279): Eliminated 1 nodes before end node
INFO: ngram_search.c(1384): Lattice has 134 nodes, 336 links
INFO: ps_lattice.c(1380): Bestpath score: -2644
INFO: ps_lattice.c(1384): Normalizer P(O) = alpha(:60:83) = -151454
INFO: ps_lattice.c(1441): Joint P(O,S) = -210101 P(S|O) = -58647
INFO: ngram_search.c(875): bestpath 0.00 CPU 0.000 xRT
INFO: ngram_search.c(878): bestpath 0.00 wall 0.000 xRT
INFO: ngram_search_fwdtree.c(432): TOTAL fwdtree 1.70 CPU 0.058 xRT
INFO: ngram_search_fwdtree.c(435): TOTAL fwdtree 1.72 wall 0.059 xRT
INFO: ngram_search_fwdflat.c(176): TOTAL fwdflat 1.08 CPU 0.037 xRT
INFO: ngram_search_fwdflat.c(179): TOTAL fwdflat 1.05 wall 0.036 xRT
INFO: ngram_search.c(303): TOTAL bestpath 0.02 CPU 0.001 xRT
INFO: ngram_search.c(306): TOTAL bestpath 0.02 wall 0.001 xRT
The output at this point:
eight two
eight two two one
nine three one two one two one three
three
eight
three
eight
three
four
eight
eight
eight
three
eight
three
four
eight
nine
When the output should actually be:
eight two
eight two two one
nine three one two one two one three
three six
three
eight
three
four
eight
eight six
three
eight
three
four
eight
nine
I have no idea, how or why, it's confusing the "eight" with the "six", I should mention that as you could see on the dictionary I tried to add more references into the six, just in case that pronounciation wasn't correct, including similar sounds, however, I'm still getting the same result.
I understand that the basic for a good accuracy are basically 3 parameters:
As my knowledge on this field is quite limited, I was only able to remove a lot of words from the original dictionary, creating a smaller one. Doing this I was able to recognize most of the numbers, all of them actually, with exeption of the six. I also downloaded and tested different hmm (models?) and lm files (.bin), but didn't change too much...
Any help would be really appreciated!
Kind regards,
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
The feat.params file that I'm using has the -nfilt value originally defined in 25. Now, I realized, that if I decrease the value, to 5 for example...
Almost every single number becomes "eight":
one two
one one one one one one
eight
eight
eight
eight
eight
eight
eight
eight
eight
eight
I imagine that this is somehow related with the problem, so I'm trying to increase the value and see if I can fix this. The manual states that nfilt is "The number of filter banks", I'm trying to comprehend what exactly does a filter bank...
Help would be more than welcome! I hope this post helps someone else as well!
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
1) Don't touch paramters you do not understand. You should never change nfilt
2) Don't modify the dictionary, modify language model instead as described in our tutorial
3) If you need help on the accuracy provide data files (recordings) to reproduce your problems.
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
In point 3) I never wrote you need many samples. Actually langauge model training does not require you to provide samples, you can just list the words you expect to see in a grammar. For more details please read the tutorial
Hello Nickolay, thanks a lot for your help so far.
I've been working on this like a maniac, but I'm pretty sure that I'm missing something because I'm not being able to recognize a few digits.
I tried the grammar file as you suggested:
JSGF V1.0;
grammar numbers;
public <number> = zero | one | two | three | four | five | six | seven | eight | nine ;</number>
I tried also reducing the dictionary as I mentioned, as the only thing I need are numbers:
zero Z IH R OW
zero(2) Z IY R OW
one W AH N
one(2) HH W AH N
two T UW
three TH R IY
four F AO R
five F AY V
six S F IH K S
seven S EH V AH N
eight EY T
nine N AY N
And finally I also downloaded an uncompressed (non-bin) .lm file and modified it to fit:
This is an ARPA-format language model file, generated by CMU Sphinx
\data\
ngram 1=12
\1-grams:
-1.4016 -0.4051
-99.0000 1.4124
-3.8842 zero -0.4376
-2.4976 one -0.4239
-2.7508 two -0.2802
-2.9571 three -0.3222
-3.0738 four -0.3780
-2.9962 five -0.4140
-3.1903 six -0.3997
-3.2768 seven -0.3833
-3.3046 eight -0.3988
-3.4077 nine -0.3758
\end\
Before modifying anything at all, I tried the different components directly out of the box, with very similar accuracy as well.
**But the problem persists! **
4 different .wav files pronouncing clearly "475125" (...four... ...seven... ...five... ...one... ...two... ...five), for some reason, it heards "175125" (the first 4 is proccessed as 1).
A second group of audio files, where the number dictated is "542757", is recognized as "512757", once again, everything looks normal but the 4 is proccessed as 1.
I really have no idea where else I could check..
Models:
I'm using "cmusphinx-en-us-5.2".
I'm seriously without a clue, walking on air at this point...
Hello CMU Sphinx users!
I've been working with the stand-alone pre-compiled version of pocketsphinx for the last 48 hours. I found the technology really interesting and I'm just practicing a little bit. At this point, I've a reduced dictionary, as the only thing I have to detect are numbers.
However for some reason, it's actually properly detecting all the numbers, but not the "six".
Command I'm running:
./pocketsphinx_continuous.exe -infile files/01_16.wav -hmm model/en-us/en-us -lm model/en-us/en-us.lm.bin -dict model/en-us/micro.dict -logfn record.txt -lw 1
The micro.dict file
Verbose / logs:
$ ./pocketsphinx_continuous.exe -infile files/01_16.wav -hmm model/en-us/en-us -lm model/en-us/en-us.lm.bin -dict model/en-us/micro.dict -lw 1 -debug 2
The output at this point:
When the output should actually be:
I have no idea, how or why, it's confusing the "eight" with the "six", I should mention that as you could see on the dictionary I tried to add more references into the six, just in case that pronounciation wasn't correct, including similar sounds, however, I'm still getting the same result.
I understand that the basic for a good accuracy are basically 3 parameters:
-hmm model/en-us/en-us
-lm model/en-us/en-us.lm.bin
-dict model/en-us/micro.dict
As my knowledge on this field is quite limited, I was only able to remove a lot of words from the original dictionary, creating a smaller one. Doing this I was able to recognize most of the numbers, all of them actually, with exeption of the six. I also downloaded and tested different hmm (models?) and lm files (.bin), but didn't change too much...
Any help would be really appreciated!
Kind regards,
03:13 AM:
The feat.params file that I'm using has the -nfilt value originally defined in 25. Now, I realized, that if I decrease the value, to 5 for example...
Almost every single number becomes "eight":
I imagine that this is somehow related with the problem, so I'm trying to increase the value and see if I can fix this. The manual states that nfilt is "The number of filter banks", I'm trying to comprehend what exactly does a filter bank...
Help would be more than welcome! I hope this post helps someone else as well!
Hello Chris
1) Don't touch paramters you do not understand. You should never change nfilt
2) Don't modify the dictionary, modify language model instead as described in our tutorial
3) If you need help on the accuracy provide data files (recordings) to reproduce your problems.
Hello Nickolay, Thanks a lot for the quick feedback!
Regarding the point number (3) described in your approach, is there any way to prepare the language model without many samples?
Hello Chris
In point 3) I never wrote you need many samples. Actually langauge model training does not require you to provide samples, you can just list the words you expect to see in a grammar. For more details please read the tutorial
http://cmusphinx.sourceforge.net/wiki/tutoriallm
Hello Nickolay, thanks a lot for your help so far.
I've been working on this like a maniac, but I'm pretty sure that I'm missing something because I'm not being able to recognize a few digits.
I tried the grammar file as you suggested:
I tried also reducing the dictionary as I mentioned, as the only thing I need are numbers:
And finally I also downloaded an uncompressed (non-bin) .lm file and modified it to fit:
Before modifying anything at all, I tried the different components directly out of the box, with very similar accuracy as well.
**But the problem persists! **
4 different .wav files pronouncing clearly "475125" (...four... ...seven... ...five... ...one... ...two... ...five), for some reason, it heards "175125" (the first 4 is proccessed as 1).
A second group of audio files, where the number dictated is "542757", is recognized as "512757", once again, everything looks normal but the 4 is proccessed as 1.
I really have no idea where else I could check..
Models:
I'm using "cmusphinx-en-us-5.2".
I'm seriously without a clue, walking on air at this point...
Sox conversion proccess:
Sphinx exec:
I'm adding some sample files... hope someone has some ideas!
1,000 Thanks!Chris C.
Last edit: Chris C. Russo 2016-03-25
Ok, I found a really nice one:
If I run from mic:
sphinxc.exe -inmic yes -hmm cmusphinx-en-us-5.2 -lm en-us.lm -dict micro.dict -jsgf grammar.txt -logfn record.txt -lw 1 -remove_dc yes -remove_noise yes -vad_threshold 2.4 -vad_prespeech 19 -vad_postspeech 37 -silprob 3.5
Place mic near speakers and play wav on any audio player. Works flawlessly. WER 0%
If I run from file:
sphinxc.exe -infile conversion_2033910124_1458812700.wav -hmm cmusphinx-en-us-5.2 -lm en-us.lm -dict micro.dict -jsgf grammar.txt -logfn record.txt -lw 1 -remove_dc yes -remove_noise yes -vad_threshold 2.4 -vad_prespeech 19 -vad_postspeech 37 -silprob 3.5
4 becomes 1.
Same command, same file, from mic works (playback from VLC / Winamp), using sphinx, fails.