Menu

decoding model

Help
2015-11-20
2015-11-22
  • Lorena Dutra

    Lorena Dutra - 2015-11-20

    Hello, I'm decoding my model trained with SphinxTrain and got these results:

    INFO: batch.c (777): TOTAL 4648.31 seconds speech, 83.07 seconds CPU, 83.12 seconds wall
    INFO: batch.c (779): AVERAGE 0:02 XRT (CPU), 0:02 XRT (elapsed)
    INFO: ngram_search_fwdtree.c (432): TOTAL 76.58 fwdtree CPU 0016 XRT
    INFO: ngram_search_fwdtree.c (435): TOTAL 76.55 fwdtree wall XRT 0016
    INFO: ngram_search_fwdflat.c (176): fwdflat TOTAL 6:37 CPU 0001 XRT
    INFO: ngram_search_fwdflat.c (179): fwdflat TOTAL 6:43 wall XRT 0001
    INFO: ngram_search.c (303): TOTAL bestpath 0:08 CPU 0000 XRT
    INFO: ngram_search.c (306): TOTAL bestpath 00:10 wall XRT 0000
    Thu Nov 19 14:59:26 2015

    How could I calculate the Real Time Factor (RTF)?

     
    • Nickolay V. Shmyrev

      If you correctly copy-paste this line

         INFO: batch.c (779): AVERAGE 0.02 xRT (CPU), 0.02 xRT (elapsed)
      

      That 0.02xRT is the real-time factor.

       
  • Lorena Dutra

    Lorena Dutra - 2015-11-20

    What is the xRT?

    RFT is measured where? Seconds?

     
  • Lorena Dutra

    Lorena Dutra - 2015-11-21

    Nikolay, I did several treatments, changing the parameters and always gave RFT = 0.02 you tell me why?

     
    • Nickolay V. Shmyrev

      you tell me why?

      Sure, as soon as you provide details on what exactly did you try, share the command line options, data files you are using and so on.

       
  • Lorena Dutra

    Lorena Dutra - 2015-11-22

    Well..

    Here is my file db-1-1.log

            INFO: pocketsphinx.c(145): Parsed model-specific feature parameters from /home/lorena/astout/model_parameters/db.cd_cont_200/feat.params
    Current configuration:
    [NAME]          [DEFLT]     [VALUE]
    -agc            none        none
    -agcthresh      2.0     2.000000e+00
    -allphone               
    -allphone_ci        no      no
    -alpha          0.97        9.700000e-01
    -ascale         20.0        2.000000e+01
    -aw         1       1
    -backtrace      no      no
    -beam           1e-48       1.000000e-80
    -bestpath       yes     yes
    -bestpathlw     9.5     1.000000e+01
    -ceplen         13      13
    -cmn            current     current
    -cmninit        8.0     8.0
    -compallsen     no      no
    -debug                  0
    -dict                   /home/lorena/astout/etc/db.dic
    -dictcase       no      no
    -dither         no      no
    -doublebw       no      no
    -ds         1       1
    -fdict                  /home/lorena/astout/model_parameters/db.cd_cont_200/noisedict
    -feat           1s_c_d_dd   1s_c_d_dd
    -featparams             /home/lorena/astout/model_parameters/db.cd_cont_200/feat.params
    -fillprob       1e-8        1.000000e-08
    -frate          100     100
    -fsg                    
    -fsgusealtpron      yes     yes
    -fsgusefiller       yes     yes
    -fwdflat        yes     yes
    -fwdflatbeam        1e-64       1.000000e-80
    -fwdflatefwid       4       4
    -fwdflatlw      8.5     1.000000e+01
    -fwdflatsfwin       25      25
    -fwdflatwbeam       7e-29       1.000000e-40
    -fwdtree        yes     yes
    -hmm                    /home/lorena/astout/model_parameters/db.cd_cont_200
    -input_endian       little      little
    -jsgf                   
    -keyphrase              
    -kws                    
    -kws_delay      10      10
    -kws_plp        1e-1        1.000000e-01
    -kws_threshold      1       1.000000e+00
    -latsize        5000        5000
    -lda                    
    -ldadim         0       0
    -lifter         0       22
    -lm                 /home/lorena/astout/etc/db.lm.DMP
    -lmctl                  
    -lmname                 
    -logbase        1.0001      1.000100e+00
    -logfn                  
    -logspec        no      no
    -lowerf         133.33334   1.300000e+02
    -lpbeam         1e-40       1.000000e-80
    -lponlybeam     7e-29       1.000000e-80
    -lw         6.5     1.000000e+01
    -maxhmmpf       30000       30000
    -maxwpf         -1      -1
    -mdef                   /home/lorena/astout/model_parameters/db.cd_cont_200/mdef
    -mean                   /home/lorena/astout/model_parameters/db.cd_cont_200/means
    -mfclogdir              
    -min_endfr      0       0
    -mixw                   /home/lorena/astout/model_parameters/db.cd_cont_200/mixture_weights
    -mixwfloor      0.0000001   1.000000e-07
    -mllr                   
    -mmap           yes     yes
    -ncep           13      13
    -nfft           512     512
    -nfilt          40      25
    -nwpen          1.0     1.000000e+00
    -pbeam          1e-48       1.000000e-80
    -pip            1.0     1.000000e+00
    -pl_beam        1e-10       1.000000e-10
    -pl_pbeam       1e-10       1.000000e-10
    -pl_pip         1.0     1.000000e+00
    -pl_weight      3.0     3.000000e+00
    -pl_window      5       5
    -rawlogdir              
    -remove_dc      no      no
    -remove_noise       yes     yes
    -remove_silence     yes     yes
    -round_filters      yes     yes
    -samprate       16000       1.600000e+04
    -seed           -1      -1
    -sendump                
    -senlogdir              
    -senmgau                
    -silprob        0.005       5.000000e-03
    -smoothspec     no      no
    -svspec                 
    -tmat                   /home/lorena/astout/model_parameters/db.cd_cont_200/transition_matrices
    -tmatfloor      0.0001      1.000000e-04
    -topn           4       4
    -topn_beam      0       0
    -toprule                
    -transform      legacy      dct
    -unit_area      yes     yes
    -upperf         6855.4976   6.800000e+03
    -uw         1.0     1.000000e+00
    -vad_postspeech     50      50
    -vad_prespeech      20      20
    -vad_startspeech    10      10
    -vad_threshold      2.0     2.000000e+00
    -var                    /home/lorena/astout/model_parameters/db.cd_cont_200/variances
    -varfloor       0.0001      1.000000e-04
    -varnorm        no      no
    -verbose        no      no
    -warp_params                
    -warp_type      inverse_linear  inverse_linear
    -wbeam          7e-29       1.000000e-40
    -wip            0.65        2.000000e-01
    -wlen           0.025625    2.562500e-02
    
    INFO: feat.c(715): Initializing feature stream to type: '1s_c_d_dd', ceplen=13, CMN='current', VARNORM='no', AGC='none'
    INFO: cmn.c(143): mean[0]= 12.00, mean[1..12]= 0.0
    INFO: mdef.c(518): Reading model definition: /home/lorena/astout/model_parameters/db.cd_cont_200/mdef
    INFO: bin_mdef.c(181): Allocating 41005 * 8 bytes (320 KiB) for CD tree
    INFO: tmat.c(206): Reading HMM transition probability matrices: /home/lorena/astout/model_parameters/db.cd_cont_200/transition_matrices
    INFO: acmod.c(117): Attempting to use PTM computation module
    INFO: ms_gauden.c(198): Reading mixture gaussian parameter: /home/lorena/astout/model_parameters/db.cd_cont_200/means
    INFO: ms_gauden.c(292): 320 codebook, 1 feature, size: 
    INFO: ms_gauden.c(294):  8x39
    INFO: ms_gauden.c(198): Reading mixture gaussian parameter: /home/lorena/astout/model_parameters/db.cd_cont_200/variances
    INFO: ms_gauden.c(292): 320 codebook, 1 feature, size: 
    INFO: ms_gauden.c(294):  8x39
    INFO: ms_gauden.c(354): 1248 variance values floored
    INFO: ptm_mgau.c(801): Number of codebooks exceeds 256: 320
    INFO: acmod.c(119): Attempting to use semi-continuous computation module
    INFO: ms_gauden.c(198): Reading mixture gaussian parameter: /home/lorena/astout/model_parameters/db.cd_cont_200/means
    INFO: ms_gauden.c(292): 320 codebook, 1 feature, size: 
    INFO: ms_gauden.c(294):  8x39
    INFO: ms_gauden.c(198): Reading mixture gaussian parameter: /home/lorena/astout/model_parameters/db.cd_cont_200/variances
    INFO: ms_gauden.c(292): 320 codebook, 1 feature, size: 
    INFO: ms_gauden.c(294):  8x39
    INFO: ms_gauden.c(354): 1248 variance values floored
    INFO: acmod.c(121): Falling back to general multi-stream GMM computation
    INFO: ms_gauden.c(198): Reading mixture gaussian parameter: /home/lorena/astout/model_parameters/db.cd_cont_200/means
    INFO: ms_gauden.c(292): 320 codebook, 1 feature, size: 
    INFO: ms_gauden.c(294):  8x39
    INFO: ms_gauden.c(198): Reading mixture gaussian parameter: /home/lorena/astout/model_parameters/db.cd_cont_200/variances
    INFO: ms_gauden.c(292): 320 codebook, 1 feature, size: 
    INFO: ms_gauden.c(294):  8x39
    INFO: ms_gauden.c(354): 1248 variance values floored
    INFO: ms_senone.c(149): Reading senone mixture weights: /home/lorena/astout/model_parameters/db.cd_cont_200/mixture_weights
    INFO: ms_senone.c(200): Truncating senone logs3(pdf) values by 10 bits
    INFO: ms_senone.c(207): Not transposing mixture weights in memory
    INFO: ms_senone.c(268): Read mixture weights for 320 senones: 1 features x 8 codewords
    INFO: ms_senone.c(320): Mapping senones to individual codebooks
    INFO: ms_mgau.c(141): The value of topn: 4
    INFO: phone_loop_search.c(114): State beam -225 Phone exit beam -225 Insertion penalty 0
    INFO: dict.c(320): Allocating 4950 * 32 bytes (154 KiB) for word entries
    INFO: dict.c(333): Reading main dictionary: /home/lorena/astout/etc/db.dic
    INFO: dict.c(213): Allocated 6 KiB for strings, 12 KiB for phones
    INFO: dict.c(336): 851 words read
    INFO: dict.c(358): Reading filler dictionary: /home/lorena/astout/model_parameters/db.cd_cont_200/noisedict
    INFO: dict.c(213): Allocated 0 KiB for strings, 0 KiB for phones
    INFO: dict.c(361): 3 words read
    INFO: dict2pid.c(396): Building PID tables for dictionary
    INFO: dict2pid.c(406): Allocating 40^3 * 2 bytes (125 KiB) for word-initial triphones
    INFO: dict2pid.c(132): Allocated 38720 bytes (37 KiB) for word-final triphones
    INFO: dict2pid.c(196): Allocated 38720 bytes (37 KiB) for single-phone word triphones
    INFO: ngram_model_trie.c(456): Trying to read LM in trie binary format
    INFO: ngram_search_fwdtree.c(99): 189 unique initial diphones
    INFO: ngram_search_fwdtree.c(148): 0 root, 0 non-root channels, 37 single-phone words
    INFO: ngram_search_fwdtree.c(186): Creating search tree
    INFO: ngram_search_fwdtree.c(192): before: 0 root, 0 non-root channels, 37 single-phone words
    INFO: ngram_search_fwdtree.c(326): after: max nonroot chan increased to 3381
    INFO: ngram_search_fwdtree.c(339): after: 188 root, 3253 non-root channels, 10 single-phone words
    INFO: ngram_search_fwdflat.c(157): fwdflat: min_ef_width = 4, max_sf_win = 25
    INFO: batch.c(728): Decoding '4E65879E2F4C276870C233D02C5A2F50'
    INFO: cmn.c(183): CMN: 31.24 -10.95 16.35 -10.84 -2.55 -3.02  8.62 -4.84  6.83  3.39  1.67  2.73 -1.71 
    INFO: ngram_search_fwdtree.c(1553):     1244 words recognized (4/fr)
    INFO: ngram_search_fwdtree.c(1555):    53752 senones evaluated (154/fr)
    INFO: ngram_search_fwdtree.c(1559):   231357 channels searched (661/fr), 55340 1st, 17848 last
    INFO: ngram_search_fwdtree.c(1562):     7916 words for which last channels evaluated (22/fr)
    INFO: ngram_search_fwdtree.c(1564):     6442 candidate words for entering last phone (18/fr)
    INFO: ngram_search_fwdtree.c(1567): fwdtree 0.09 CPU 0.026 xRT
    INFO: ngram_search_fwdtree.c(1570): fwdtree 0.09 wall 0.027 xRT
    INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 35 words
    INFO: ngram_search_fwdflat.c(948):      723 words recognized (2/fr)
    INFO: ngram_search_fwdflat.c(950):     9679 senones evaluated (28/fr)
    INFO: ngram_search_fwdflat.c(952):     6148 channels searched (17/fr)
    INFO: ngram_search_fwdflat.c(954):     2843 words searched (8/fr)
    INFO: ngram_search_fwdflat.c(957):     1983 word transitions (5/fr)
    INFO: ngram_search_fwdflat.c(960): fwdflat 0.01 CPU 0.002 xRT
    INFO: ngram_search_fwdflat.c(963): fwdflat 0.01 wall 0.002 xRT
    INFO: ngram_search.c(1253): lattice start node <s>.0 end node </s>.315
    INFO: ngram_search.c(1279): Eliminated 0 nodes before end node
    INFO: ngram_search.c(1384): Lattice has 70 nodes, 9 links
    INFO: ps_lattice.c(1380): Bestpath score: -6012
    INFO: ps_lattice.c(1384): Normalizer P(O) = alpha(</s>:315:348) = -363633
    INFO: ps_lattice.c(1441): Joint P(O,S) = -363716 P(S|O) = -83
    INFO: ngram_search.c(875): bestpath 0.00 CPU 0.001 xRT
    INFO: ngram_search.c(878): bestpath 0.00 wall 0.000 xRT
    INFO: batch.c(760): 4E65879E2F4C276870C233D02C5A2F50: 3.49 seconds speech, 0.10 seconds CPU, 0.10 seconds wall
    INFO: batch.c(762): 4E65879E2F4C276870C233D02C5A2F50: 0.03 xRT (CPU), 0.03 xRT (elapsed)
    ossos temporais (4E65879E2F4C276870C233D02C5A2F50 -6309)
    4E65879E2F4C276870C233D02C5A2F50 done --------------------------------------
    INFO: batch.c(728): Decoding '8F4BE1EF2373B846B8803DCD0BEFA542'
    INFO: cmn.c(183): CMN: 34.49 -22.29 14.97  6.46 -5.15 -6.22  2.00  2.56 -5.63 11.06 -6.62 12.27  0.10 
    INFO: ngram_search_fwdtree.c(1553):     1602 words recognized (2/fr)
    INFO: ngram_search_fwdtree.c(1555):    76086 senones evaluated (104/fr)
    INFO: ngram_search_fwdtree.c(1559):   227253 channels searched (310/fr), 70741 1st, 17426 last
    INFO: ngram_search_fwdtree.c(1562):     8347 words for which last channels evaluated (11/fr)
    INFO: ngram_search_fwdtree.c(1564):     5172 candidate words for entering last phone (7/fr)
    INFO: ngram_search_fwdtree.c(1567): fwdtree 0.12 CPU 0.017 xRT
    INFO: ngram_search_fwdtree.c(1570): fwdtree 0.12 wall 0.017 xRT
    INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 26 words
    INFO: ngram_search_fwdflat.c(948):      730 words recognized (1/fr)
    INFO: ngram_search_fwdflat.c(950):    14323 senones evaluated (20/fr)
    INFO: ngram_search_fwdflat.c(952):     7612 channels searched (10/fr)
    INFO: ngram_search_fwdflat.c(954):     3214 words searched (4/fr)
    INFO: ngram_search_fwdflat.c(957):     2146 word transitions (2/fr)
    INFO: ngram_search_fwdflat.c(960): fwdflat 0.01 CPU 0.001 xRT
    INFO: ngram_search_fwdflat.c(963): fwdflat 0.01 wall 0.001 xRT
    INFO: ngram_search.c(1253): lattice start node <s>.0 end node </s>.729
    INFO: ngram_search.c(1279): Eliminated 0 nodes before end node
    INFO: ngram_search.c(1384): Lattice has 96 nodes, 50 links
    INFO: ps_lattice.c(1380): Bestpath score: -8189
    INFO: ps_lattice.c(1384): Normalizer P(O) = alpha(</s>:729:731) = -538456
    INFO: ps_lattice.c(1441): Joint P(O,S) = -538494 P(S|O) = -38
    INFO: ngram_search.c(875): bestpath 0.00 CPU 0.000 xRT
    INFO: ngram_search.c(878): bestpath 0.00 wall 0.000 xRT
    INFO: batch.c(760): 8F4BE1EF2373B846B8803DCD0BEFA542: 7.32 seconds speech, 0.13 seconds CPU, 0.13 seconds wall
    INFO: batch.c(762): 8F4BE1EF2373B846B8803DCD0BEFA542: 0.02 xRT (CPU), 0.02 xRT (elapsed)
    vias biliares sem sinais evidentes de dilatação (8F4BE1EF2373B846B8803DCD0BEFA542 -8438)
    8F4BE1EF2373B846B8803DCD0BEFA542 done --------------------------------------
    INFO: batch.c(728): Decoding 'D0A20AFD9EAFFBEBDB5DC1B20CE1D0F9'
    INFO: cmn.c(183): CMN: 34.28 -11.55  9.98  5.85 -4.00 -0.73 -4.40  7.20 -5.18 10.99 -5.04  8.42 -1.75 
    INFO: ngram_search_fwdtree.c(1553):     2056 words recognized (2/fr)
    INFO: ngram_search_fwdtree.c(1555):   103847 senones evaluated (117/fr)
    INFO: ngram_search_fwdtree.c(1559):   356596 channels searched (403/fr), 101601 1st, 27887 last
    INFO: ngram_search_fwdtree.c(1562):    12915 words for which last channels evaluated (14/fr)
    INFO: ngram_search_fwdtree.c(1564):     8838 candidate words for entering last phone (9/fr)
    INFO: ngram_search_fwdtree.c(1567): fwdtree 0.16 CPU 0.018 xRT
    INFO: ngram_search_fwdtree.c(1570): fwdtree 0.15 wall 0.017 xRT
    INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 39 words
    INFO: ngram_search_fwdflat.c(948):     1011 words recognized (1/fr)
    INFO: ngram_search_fwdflat.c(950):    17850 senones evaluated (20/fr)
    INFO: ngram_search_fwdflat.c(952):     9788 channels searched (11/fr)
    INFO: ngram_search_fwdflat.c(954):     4486 words searched (5/fr)
    INFO: ngram_search_fwdflat.c(957):     2690 word transitions (3/fr)
    INFO: ngram_search_fwdflat.c(960): fwdflat 0.01 CPU 0.001 xRT
    INFO: ngram_search_fwdflat.c(963): fwdflat 0.01 wall 0.001 xRT
    INFO: ngram_search.c(1253): lattice start node <s>.0 end node </s>.880
    INFO: ngram_search.c(1279): Eliminated 0 nodes before end node
    INFO: ngram_search.c(1384): Lattice has 89 nodes, 43 links
    INFO: ps_lattice.c(1380): Bestpath score: -9897
    INFO: ps_lattice.c(1384): Normalizer P(O) = alpha(</s>:880:882) = -645607
    INFO: ps_lattice.c(1441): Joint P(O,S) = -645629 P(S|O) = -22
    INFO: ngram_search.c(875): bestpath 0.00 CPU 0.000 xRT
    INFO: ngram_search.c(878): bestpath 0.00 wall 0.000 xRT
    INFO: batch.c(760): D0A20AFD9EAFFBEBDB5DC1B20CE1D0F9: 8.83 seconds speech, 0.17 seconds CPU, 0.17 seconds wall
    INFO: batch.c(762): D0A20AFD9EAFFBEBDB5DC1B20CE1D0F9: 0.02 xRT (CPU), 0.02 xRT (elapsed)
    vias biliares sem sinais evidentes de dilatação (D0A20AFD9EAFFBEBDB5DC1B20CE1D0F9 -10231)
    D0A20AFD9EAFFBEBDB5DC1B20CE1D0F9 done --------------------------------------
    
    INFO: batch.c(728): Decoding 'CB79BAE39EFBCD81C72959EE2A17E5E8'
    INFO: cmn.c(183): CMN: 27.43 -6.25 13.96 -1.29 -6.93 -0.34 -0.26  5.28 -1.74  0.98  1.18  3.19  7.87 
    INFO: ngram_search_fwdtree.c(1553):     2394 words recognized (2/fr)
    INFO: ngram_search_fwdtree.c(1555):   113072 senones evaluated (108/fr)
    INFO: ngram_search_fwdtree.c(1559):   357452 channels searched (341/fr), 111365 1st, 26624 last
    INFO: ngram_search_fwdtree.c(1562):    12562 words for which last channels evaluated (11/fr)
    INFO: ngram_search_fwdtree.c(1564):     7943 candidate words for entering last phone (7/fr)
    INFO: ngram_search_fwdtree.c(1567): fwdtree 0.18 CPU 0.017 xRT
    INFO: ngram_search_fwdtree.c(1570): fwdtree 0.18 wall 0.017 xRT
    INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 37 words
    INFO: ngram_search_fwdflat.c(948):     1306 words recognized (1/fr)
    INFO: ngram_search_fwdflat.c(950):    19726 senones evaluated (19/fr)
    INFO: ngram_search_fwdflat.c(952):    10474 channels searched (10/fr)
    INFO: ngram_search_fwdflat.c(954):     4680 words searched (4/fr)
    INFO: ngram_search_fwdflat.c(957):     2587 word transitions (2/fr)
    INFO: ngram_search_fwdflat.c(960): fwdflat 0.01 CPU 0.001 xRT
    INFO: ngram_search_fwdflat.c(963): fwdflat 0.01 wall 0.001 xRT
    INFO: ngram_search.c(1253): lattice start node <s>.0 end node </s>.1005
    INFO: ngram_search.c(1279): Eliminated 0 nodes before end node
    INFO: ngram_search.c(1384): Lattice has 139 nodes, 51 links
    INFO: ps_lattice.c(1380): Bestpath score: -12292
    INFO: ps_lattice.c(1384): Normalizer P(O) = alpha(</s>:1005:1045) = -794759
    INFO: ps_lattice.c(1441): Joint P(O,S) = -796484 P(S|O) = -1725
    INFO: ngram_search.c(875): bestpath 0.00 CPU 0.000 xRT
    INFO: ngram_search.c(878): bestpath 0.00 wall 0.000 xRT
    INFO: batch.c(760): CB79BAE39EFBCD81C72959EE2A17E5E8: 10.46 seconds speech, 0.19 seconds CPU, 0.19 seconds wall
    INFO: batch.c(762): CB79BAE39EFBCD81C72959EE2A17E5E8: 0.02 xRT (CPU), 0.02 xRT (elapsed)
    globos oculares de morfologia dimensões e densidade normais (CB79BAE39EFBCD81C72959EE2A17E5E8 -12584)
    CB79BAE39EFBCD81C72959EE2A17E5E8 done --------------------------------------
    INFO: batch.c(728): Decoding '701FC2B8C8A868242284DDE4A559CF9A'
    INFO: cmn.c(183): CMN: 30.28 -11.82  6.52 -2.64 -13.28  3.86 -6.42  7.77 -3.46  6.47  0.80  3.88  4.71 
    INFO: ngram_search_fwdtree.c(1553):     4126 words recognized (2/fr)
    INFO: ngram_search_fwdtree.c(1555):   194639 senones evaluated (99/fr)
    INFO: ngram_search_fwdtree.c(1559):   558892 channels searched (282/fr), 195409 1st, 39260 last
    INFO: ngram_search_fwdtree.c(1562):    19785 words for which last channels evaluated (10/fr)
    INFO: ngram_search_fwdtree.c(1564):    11513 candidate words for entering last phone (5/fr)
    INFO: ngram_search_fwdtree.c(1567): fwdtree 0.31 CPU 0.016 xRT
    INFO: ngram_search_fwdtree.c(1570): fwdtree 0.32 wall 0.016 xRT
    INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 45 words
    INFO: ngram_search_fwdflat.c(948):     2397 words recognized (1/fr)
    INFO: ngram_search_fwdflat.c(950):    29433 senones evaluated (15/fr)
    INFO: ngram_search_fwdflat.c(952):    15859 channels searched (8/fr)
    INFO: ngram_search_fwdflat.c(954):     8030 words searched (4/fr)
    INFO: ngram_search_fwdflat.c(957):     3866 word transitions (1/fr)
    INFO: ngram_search_fwdflat.c(960): fwdflat 0.02 CPU 0.001 xRT
    INFO: ngram_search_fwdflat.c(963): fwdflat 0.02 wall 0.001 xRT
    INFO: ngram_search.c(1253): lattice start node <s>.0 end node </s>.1930
    INFO: ngram_search.c(1279): Eliminated 0 nodes before end node
    INFO: ngram_search.c(1384): Lattice has 187 nodes, 120 links
    INFO: ps_lattice.c(1380): Bestpath score: -17560
    INFO: ps_lattice.c(1384): Normalizer P(O) = alpha(</s>:1930:1973) = -1169670
    INFO: ps_lattice.c(1441): Joint P(O,S) = -1169785 P(S|O) = -115
    INFO: ngram_search.c(875): bestpath 0.00 CPU 0.000 xRT
    INFO: ngram_search.c(878): bestpath 0.00 wall 0.000 xRT
    INFO: batch.c(760): 701FC2B8C8A868242284DDE4A559CF9A: 19.74 seconds speech, 0.34 seconds CPU, 0.34 seconds wall
    INFO: batch.c(762): 701FC2B8C8A868242284DDE4A559CF9A: 0.02 xRT (CPU), 0.02 xRT (elapsed)
    hipófise de volume e densidade normais com impregnação habitual após a administração do meio de contraste (701FC2B8C8A868242284DDE4A559CF9A -18657)
    701FC2B8C8A868242284DDE4A559CF9A done --------------------------------------
    INFO: batch.c(728): Decoding '47B548B08FBA2A723AFA560BAFAF9523'
    INFO: cmn.c(183): CMN: 35.88 -14.70  9.17  0.40 -10.71  4.32 -7.68  8.27 -8.67 10.75 -6.49  0.13  3.80 
    INFO: ngram_search_fwdtree.c(1553):     2054 words recognized (2/fr)
    INFO: ngram_search_fwdtree.c(1555):    83678 senones evaluated (73/fr)
    INFO: ngram_search_fwdtree.c(1559):   210449 channels searched (184/fr), 80155 1st, 17584 last
    INFO: ngram_search_fwdtree.c(1562):     8980 words for which last channels evaluated (7/fr)
    INFO: ngram_search_fwdtree.c(1564):     4382 candidate words for entering last phone (3/fr)
    INFO: ngram_search_fwdtree.c(1567): fwdtree 0.15 CPU 0.013 xRT
    INFO: ngram_search_fwdtree.c(1570): fwdtree 0.15 wall 0.013 xRT
    INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 28 words
    INFO: ngram_search_fwdflat.c(948):     1382 words recognized (1/fr)
    INFO: ngram_search_fwdflat.c(950):    16471 senones evaluated (14/fr)
    INFO: ngram_search_fwdflat.c(952):    10357 channels searched (9/fr)
    INFO: ngram_search_fwdflat.c(954):     4821 words searched (4/fr)
    INFO: ngram_search_fwdflat.c(957):     2217 word transitions (1/fr)
    INFO: ngram_search_fwdflat.c(960): fwdflat 0.01 CPU 0.001 xRT
    INFO: ngram_search_fwdflat.c(963): fwdflat 0.01 wall 0.001 xRT
    INFO: ngram_search.c(1253): lattice start node <s>.0 end node </s>.1113
    INFO: ngram_search.c(1279): Eliminated 0 nodes before end node
    INFO: ngram_search.c(1384): Lattice has 71 nodes, 44 links
    INFO: ps_lattice.c(1380): Bestpath score: -17268
    INFO: ps_lattice.c(1384): Normalizer P(O) = alpha(</s>:1113:1141) = -1197381
    INFO: ps_lattice.c(1441): Joint P(O,S) = -1200230 P(S|O) = -2849
    INFO: ngram_search.c(875): bestpath 0.00 CPU 0.000 xRT
    INFO: ngram_search.c(878): bestpath 0.00 wall 0.000 xRT
    INFO: batch.c(760): 47B548B08FBA2A723AFA560BAFAF9523: 11.42 seconds speech, 0.16 seconds CPU, 0.16 seconds wall
    INFO: batch.c(762): 47B548B08FBA2A723AFA560BAFAF9523: 0.01 xRT (CPU), 0.01 xRT (elapsed)
    cavidades timpânicas e recessos os e que de a do saco (47B548B08FBA2A723AFA560BAFAF9523 -17344)
    47B548B08FBA2A723AFA560BAFAF9523 done --------------------------------------
    INFO: batch.c(728): Decoding 'B4B49ADC6579F00594C2EBECA5D0990E'
    INFO: cmn.c(183): CMN: 42.23 -2.64 -2.10 -1.85 -21.07  2.78 -10.47  7.96 -13.23  6.88 -4.26 -1.96  6.10 
    INFO: ngram_search_fwdtree.c(1553):     1268 words recognized (2/fr)
    INFO: ngram_search_fwdtree.c(1555):    62877 senones evaluated (104/fr)
    INFO: ngram_search_fwdtree.c(1559):   191887 channels searched (316/fr), 63266 1st, 13367 last
    INFO: ngram_search_fwdtree.c(1562):     6669 words for which last channels evaluated (11/fr)
    INFO: ngram_search_fwdtree.c(1564):     4143 candidate words for entering last phone (6/fr)
    INFO: ngram_search_fwdtree.c(1567): fwdtree 0.09 CPU 0.015 xRT
    INFO: ngram_search_fwdtree.c(1570): fwdtree 0.09 wall 0.016 xRT
    INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 23 words
    INFO: ngram_search_fwdflat.c(948):      675 words recognized (1/fr)
    INFO: ngram_search_fwdflat.c(950):    10071 senones evaluated (17/fr)
    INFO: ngram_search_fwdflat.c(952):     5672 channels searched (9/fr)
    INFO: ngram_search_fwdflat.c(954):     2557 words searched (4/fr)
    INFO: ngram_search_fwdflat.c(957):     1445 word transitions (2/fr)
    INFO: ngram_search_fwdflat.c(960): fwdflat 0.01 CPU 0.001 xRT
    INFO: ngram_search_fwdflat.c(963): fwdflat 0.01 wall 0.001 xRT
    INFO: ngram_search.c(1253): lattice start node <s>.0 end node </s>.578
    INFO: ngram_search.c(1279): Eliminated 0 nodes before end node
    INFO: ngram_search.c(1384): Lattice has 56 nodes, 45 links
    INFO: ps_lattice.c(1380): Bestpath score: -8147
    INFO: ps_lattice.c(1384): Normalizer P(O) = alpha(</s>:578:604) = -472139
    INFO: ps_lattice.c(1441): Joint P(O,S) = -472149 P(S|O) = -10
    INFO: ngram_search.c(875): bestpath 0.00 CPU 0.000 xRT
    INFO: ngram_search.c(878): bestpath 0.00 wall 0.000 xRT
    INFO: batch.c(760): B4B49ADC6579F00594C2EBECA5D0990E: 6.05 seconds speech, 0.10 seconds CPU, 0.10 seconds wall
    INFO: batch.c(762): B4B49ADC6579F00594C2EBECA5D0990E: 0.02 xRT (CPU), 0.02 xRT (elapsed)
    ângulo condilomaleolar direito esquerdo (B4B49ADC6579F00594C2EBECA5D0990E -8350)
    B4B49ADC6579F00594C2EBECA5D0990E done --------------------------------------
    INFO: batch.c(728): Decoding '164A64F44EADCE4266549BE8EBE17CA6'
    INFO: cmn.c(183): CMN: 31.43 -20.51  5.64 -2.63  0.14 -5.23 -1.45  4.25 -3.17  6.80 -0.82  3.68  3.09 
    INFO: ngram_search_fwdtree.c(1553):     1697 words recognized (2/fr)
    INFO: ngram_search_fwdtree.c(1555):    86267 senones evaluated (115/fr)
    INFO: ngram_search_fwdtree.c(1559):   329879 channels searched (438/fr), 85038 1st, 26034 last
    INFO: ngram_search_fwdtree.c(1562):    11045 words for which last channels evaluated (14/fr)
    INFO: ngram_search_fwdtree.c(1564):     9582 candidate words for entering last phone (12/fr)
    INFO: ngram_search_fwdtree.c(1567): fwdtree 0.13 CPU 0.018 xRT
    INFO: ngram_search_fwdtree.c(1570): fwdtree 0.13 wall 0.018 xRT
    INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 30 words
    INFO: ngram_search_fwdflat.c(948):      927 words recognized (1/fr)
    INFO: ngram_search_fwdflat.c(950):    14248 senones evaluated (19/fr)
    INFO: ngram_search_fwdflat.c(952):     9287 channels searched (12/fr)
    INFO: ngram_search_fwdflat.c(954):     3661 words searched (4/fr)
    INFO: ngram_search_fwdflat.c(957):     1822 word transitions (2/fr)
    INFO: ngram_search_fwdflat.c(960): fwdflat 0.01 CPU 0.002 xRT
    INFO: ngram_search_fwdflat.c(963): fwdflat 0.01 wall 0.001 xRT
    INFO: ngram_search.c(1253): lattice start node <s>.0 end node </s>.707
    INFO: ngram_search.c(1279): Eliminated 0 nodes before end node
    INFO: ngram_search.c(1384): Lattice has 61 nodes, 29 links
    INFO: ps_lattice.c(1380): Bestpath score: -6988
    INFO: ps_lattice.c(1384): Normalizer P(O) = alpha(</s>:707:750) = -422751
    INFO: ps_lattice.c(1441): Joint P(O,S) = -422756 P(S|O) = -5
    INFO: ngram_search.c(875): bestpath 0.00 CPU 0.000 xRT
    INFO: ngram_search.c(878): bestpath 0.00 wall 0.000 xRT
    INFO: batch.c(760): 164A64F44EADCE4266549BE8EBE17CA6: 7.51 seconds speech, 0.14 seconds CPU, 0.14 seconds wall
    INFO: batch.c(762): 164A64F44EADCE4266549BE8EBE17CA6: 0.02 xRT (CPU), 0.02 xRT (elapsed)
    cisternas basais sem alterações (164A64F44EADCE4266549BE8EBE17CA6 -7434)
    164A64F44EADCE4266549BE8EBE17CA6 done --------------------------------------
    INFO: batch.c(728): Decoding 'C898CAAA5023235A38402575731EBC9E'
    INFO: cmn.c(183): CMN: 30.72 -7.20  3.75 -3.60 -1.30 -5.82  1.23 -1.36  4.59  5.16  0.39  2.48  5.09 
    INFO: ngram_search_fwdtree.c(1553):     1774 words recognized (2/fr)
    INFO: ngram_search_fwdtree.c(1555):    92002 senones evaluated (116/fr)
    INFO: ngram_search_fwdtree.c(1559):   328000 channels searched (412/fr), 90480 1st, 24539 last
    INFO: ngram_search_fwdtree.c(1562):    10853 words for which last channels evaluated (13/fr)
    INFO: ngram_search_fwdtree.c(1564):     8418 candidate words for entering last phone (10/fr)
    INFO: ngram_search_fwdtree.c(1567): fwdtree 0.14 CPU 0.018 xRT
    INFO: ngram_search_fwdtree.c(1570): fwdtree 0.14 wall 0.018 xRT
    INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 32 words
    INFO: ngram_search_fwdflat.c(948):      979 words recognized (1/fr)
    INFO: ngram_search_fwdflat.c(950):    14842 senones evaluated (19/fr)
    INFO: ngram_search_fwdflat.c(952):     9721 channels searched (12/fr)
    INFO: ngram_search_fwdflat.c(954):     4100 words searched (5/fr)
    INFO: ngram_search_fwdflat.c(957):     2085 word transitions (2/fr)
    INFO: ngram_search_fwdflat.c(960): fwdflat 0.01 CPU 0.001 xRT
    INFO: ngram_search_fwdflat.c(963): fwdflat 0.01 wall 0.001 xRT
    INFO: ngram_search.c(1253): lattice start node <s>.0 end node </s>.755
    INFO: ngram_search.c(1279): Eliminated 0 nodes before end node
    INFO: ngram_search.c(1384): Lattice has 99 nodes, 31 links
    INFO: ps_lattice.c(1380): Bestpath score: -6020
    INFO: ps_lattice.c(1384): Normalizer P(O) = alpha(</s>:755:794) = -360803
    INFO: ps_lattice.c(1441): Joint P(O,S) = -360803 P(S|O) = 0
    INFO: ngram_search.c(875): bestpath 0.00 CPU 0.000 xRT
    INFO: ngram_search.c(878): bestpath 0.00 wall 0.000 xRT
    INFO: batch.c(760): C898CAAA5023235A38402575731EBC9E: 7.95 seconds speech, 0.15 seconds CPU, 0.15 seconds wall
    INFO: batch.c(762): C898CAAA5023235A38402575731EBC9E: 0.02 xRT (CPU), 0.02 xRT (elapsed)
    cisternas basais sem alterações (C898CAAA5023235A38402575731EBC9E -6224)
    C898CAAA5023235A38402575731EBC9E done --------------------------------------
    INFO: batch.c(728): Decoding 'AF4AB6D0A048CD6AC01D87E12126BE2A'
    INFO: cmn.c(183): CMN: 28.55 -13.00 10.47 -2.68 -10.79  0.65 -1.13  1.44 -0.45  5.75 -1.99  4.14  0.83 
    INFO: ngram_search_fwdtree.c(1553):     4430 words recognized (2/fr)
    INFO: ngram_search_fwdtree.c(1555):   234632 senones evaluated (113/fr)
    INFO: ngram_search_fwdtree.c(1559):   699995 channels searched (336/fr), 225902 1st, 52672 last
    INFO: ngram_search_fwdtree.c(1562):    24652 words for which last channels evaluated (11/fr)
    INFO: ngram_search_fwdtree.c(1564):    15925 candidate words for entering last phone (7/fr)
    INFO: ngram_search_fwdtree.c(1567): fwdtree 0.36 CPU 0.017 xRT
    INFO: ngram_search_fwdtree.c(1570): fwdtree 0.35 wall 0.017 xRT
    INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 48 words
    INFO: ngram_search_fwdflat.c(948):     2613 words recognized (1/fr)
    INFO: ngram_search_fwdflat.c(950):    39977 senones evaluated (19/fr)
    INFO: ngram_search_fwdflat.c(952):    23016 channels searched (11/fr)
    INFO: ngram_search_fwdflat.c(954):    10117 words searched (4/fr)
    INFO: ngram_search_fwdflat.c(957):     5616 word transitions (2/fr)
    INFO: ngram_search_fwdflat.c(960): fwdflat 0.03 CPU 0.001 xRT
    INFO: ngram_search_fwdflat.c(963): fwdflat 0.03 wall 0.001 xRT
    INFO: ngram_search.c(1253): lattice start node <s>.0 end node </s>.2045
    INFO: ngram_search.c(1279): Eliminated 0 nodes before end node
    INFO: ngram_search.c(1384): Lattice has 296 nodes, 182 links
    INFO: ps_lattice.c(1380): Bestpath score: -25429
    INFO: ps_lattice.c(1384): Normalizer P(O) = alpha(</s>:2045:2076) = -1600166
    INFO: ps_lattice.c(1441): Joint P(O,S) = -1617870 P(S|O) = -17704
    INFO: ngram_search.c(875): bestpath 0.00 CPU 0.000 xRT
    INFO: ngram_search.c(878): bestpath 0.00 wall 0.000 xRT
    INFO: batch.c(760): AF4AB6D0A048CD6AC01D87E12126BE2A: 20.77 seconds speech, 0.38 seconds CPU, 0.38 seconds wall
    INFO: batch.c(762): AF4AB6D0A048CD6AC01D87E12126BE2A: 0.02 xRT (CPU), 0.02 xRT (elapsed)
    o estudo com a boca aberta demonstra excursão ampla e simétrica dos côndilos mandibulares sem evidências de subluxações (AF4AB6D0A048CD6AC01D87E12126BE2A -25582)
    AF4AB6D0A048CD6AC01D87E12126BE2A done --------------------------------------
    INFO: batch.c(728): Decoding '481531FBC5006DDAAD1FA00BB8233185'
    INFO: cmn.c(183): CMN: 36.27 -5.43  4.44 -4.16 -11.98  9.04 -6.60 -0.17  3.05  7.31  3.31 -1.22  6.48 
    INFO: ngram_search_fwdtree.c(1553):     2169 words recognized (2/fr)
    INFO: ngram_search_fwdtree.c(1555):    90997 senones evaluated (86/fr)
    INFO: ngram_search_fwdtree.c(1559):   260937 channels searched (246/fr), 88056 1st, 21422 last
    INFO: ngram_search_fwdtree.c(1562):    10113 words for which last channels evaluated (9/fr)
    INFO: ngram_search_fwdtree.c(1564):     5709 candidate words for entering last phone (5/fr)
    INFO: ngram_search_fwdtree.c(1567): fwdtree 0.15 CPU 0.014 xRT
    INFO: ngram_search_fwdtree.c(1570): fwdtree 0.15 wall 0.014 xRT
    INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 35 words
    INFO: ngram_search_fwdflat.c(948):     1098 words recognized (1/fr)
    INFO: ngram_search_fwdflat.c(950):    19244 senones evaluated (18/fr)
    INFO: ngram_search_fwdflat.c(952):    11891 channels searched (11/fr)
    INFO: ngram_search_fwdflat.c(954):     4941 words searched (4/fr)
    INFO: ngram_search_fwdflat.c(957):     2647 word transitions (2/fr)
    INFO: ngram_search_fwdflat.c(960): fwdflat 0.02 CPU 0.002 xRT
    INFO: ngram_search_fwdflat.c(963): fwdflat 0.01 wall 0.001 xRT
    INFO: ngram_search.c(1253): lattice start node <s>.0 end node </s>.1019
    INFO: ngram_search.c(1279): Eliminated 0 nodes before end node
    INFO: ngram_search.c(1384): Lattice has 89 nodes, 70 links
    INFO: ps_lattice.c(1380): Bestpath score: -12185
    INFO: ps_lattice.c(1384): Normalizer P(O) = alpha(</s>:1019:1058) = -748374
    INFO: ps_lattice.c(1441): Joint P(O,S) = -748845 P(S|O) = -471
    INFO: ngram_search.c(875): bestpath 0.00 CPU 0.000 xRT
    INFO: ngram_search.c(878): bestpath 0.00 wall 0.000 xRT
    INFO: batch.c(760): 481531FBC5006DDAAD1FA00BB8233185: 10.59 seconds speech, 0.17 seconds CPU, 0.17 seconds wall
    INFO: batch.c(762): 481531FBC5006DDAAD1FA00BB8233185: 0.02 xRT (CPU), 0.02 xRT (elapsed)
    angiotomografia computadorizada dos vasos pulmonares (481531FBC5006DDAAD1FA00BB8233185 -12379)
    481531FBC5006DDAAD1FA00BB8233185 done --------------------------------------
    INFO: batch.c(728): Decoding 'E68D10FE03F5F20262E48A1275B87B64'
    INFO: cmn.c(183): CMN: 35.23 -12.58 -2.01 -7.39 -7.09  1.09 -10.99  9.58 -9.66  5.26 -4.68  0.30  2.76 
    INFO: ngram_search_fwdtree.c(1553):      693 words recognized (3/fr)
    INFO: ngram_search_fwdtree.c(1555):    33470 senones evaluated (124/fr)
    INFO: ngram_search_fwdtree.c(1559):   118255 channels searched (436/fr), 36800 1st, 8304 last
    INFO: ngram_search_fwdtree.c(1562):     3894 words for which last channels evaluated (14/fr)
    INFO: ngram_search_fwdtree.c(1564):     2571 candidate words for entering last phone (9/fr)
    INFO: ngram_search_fwdtree.c(1567): fwdtree 0.05 CPU 0.018 xRT
    INFO: ngram_search_fwdtree.c(1570): fwdtree 0.05 wall 0.018 xRT
    INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 12 words
    INFO: ngram_search_fwdflat.c(948):      492 words recognized (2/fr)
    INFO: ngram_search_fwdflat.c(950):     4058 senones evaluated (15/fr)
    INFO: ngram_search_fwdflat.c(952):     2014 channels searched (7/fr)
    INFO: ngram_search_fwdflat.c(954):     1181 words searched (4/fr)
    INFO: ngram_search_fwdflat.c(957):      626 word transitions (2/fr)
    INFO: ngram_search_fwdflat.c(960): fwdflat 0.00 CPU 0.001 xRT
    INFO: ngram_search_fwdflat.c(963): fwdflat 0.00 wall 0.001 xRT
    INFO: ngram_search.c(1253): lattice start node <s>.0 end node </s>.232
    INFO: ngram_search.c(1279): Eliminated 0 nodes before end node
    INFO: ngram_search.c(1384): Lattice has 41 nodes, 23 links
    INFO: ps_lattice.c(1380): Bestpath score: -3279
    INFO: ps_lattice.c(1384): Normalizer P(O) = alpha(</s>:232:269) = -218109
    INFO: ps_lattice.c(1441): Joint P(O,S) = -218113 P(S|O) = -4
    INFO: ngram_search.c(875): bestpath 0.00 CPU 0.000 xRT
    INFO: ngram_search.c(878): bestpath 0.00 wall 0.000 xRT
    INFO: batch.c(760): E68D10FE03F5F20262E48A1275B87B64: 2.70 seconds speech, 0.05 seconds CPU, 0.05 seconds wall
    INFO: batch.c(762): E68D10FE03F5F20262E48A1275B87B64: 0.02 xRT (CPU), 0.02 xRT (elapsed)
    comentários (E68D10FE03F5F20262E48A1275B87B64 -3611)
    E68D10FE03F5F20262E48A1275B87B64 done --------------------------------------
    INFO: batch.c(728): Decoding '18F5C148A5B9F644BB33D9567DEAE7A5'
    INFO: cmn.c(183): CMN: 41.04 -13.45 13.54  1.81 -2.22 -6.72  0.16 -1.82 -6.19  9.47  2.28  1.53  1.58 
    INFO: ngram_search_fwdtree.c(1553):     1146 words recognized (2/fr)
    INFO: ngram_search_fwdtree.c(1555):    62545 senones evaluated (117/fr)
    INFO: ngram_search_fwdtree.c(1559):   185267 channels searched (346/fr), 61553 1st, 13796 last
    INFO: ngram_search_fwdtree.c(1562):     6675 words for which last channels evaluated (12/fr)
    INFO: ngram_search_fwdtree.c(1564):     4008 candidate words for entering last phone (7/fr)
    INFO: ngram_search_fwdtree.c(1567): fwdtree 0.09 CPU 0.017 xRT
    INFO: ngram_search_fwdtree.c(1570): fwdtree 0.09 wall 0.017 xRT
    INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 29 words
    INFO: ngram_search_fwdflat.c(948):      546 words recognized (1/fr)
    INFO: ngram_search_fwdflat.c(950):     9640 senones evaluated (18/fr)
    INFO: ngram_search_fwdflat.c(952):     6103 channels searched (11/fr)
    INFO: ngram_search_fwdflat.c(954):     2514 words searched (4/fr)
    INFO: ngram_search_fwdflat.c(957):     1757 word transitions (3/fr)
    INFO: ngram_search_fwdflat.c(960): fwdflat 0.00 CPU 0.001 xRT
    INFO: ngram_search_fwdflat.c(963): fwdflat 0.01 wall 0.001 xRT
    INFO: ngram_search.c(1253): lattice start node <s>.0 end node </s>.494
    INFO: ngram_search.c(1279): Eliminated 0 nodes before end node
    INFO: ngram_search.c(1384): Lattice has 37 nodes, 21 links
    INFO: ps_lattice.c(1380): Bestpath score: -5219
    INFO: ps_lattice.c(1384): Normalizer P(O) = alpha(</s>:494:533) = -349032
    INFO: ps_lattice.c(1441): Joint P(O,S) = -349092 P(S|O) = -60
    INFO: ngram_search.c(875): bestpath 0.00 CPU 0.000 xRT
    INFO: ngram_search.c(878): bestpath 0.00 wall 0.000 xRT
    INFO: batch.c(760): 18F5C148A5B9F644BB33D9567DEAE7A5: 5.34 seconds speech, 0.10 seconds CPU, 0.10 seconds wall
    CB82728D27F01A41B0EBC9557B27E4C7 done --------------------------------------
    INFO: batch.c(728): Decoding 'E01359329B41907BFFBAC052629A8096'
    INFO: cmn.c(183): CMN: 31.07 -10.35  7.52 -5.81 -3.02 -1.41 -2.23 -2.03 -2.86  6.33 -0.80  4.49  2.33 
    INFO: ngram_search_fwdtree.c(1553):     3627 words recognized (2/fr)
    INFO: ngram_search_fwdtree.c(1555):   144897 senones evaluated (96/fr)
    INFO: ngram_search_fwdtree.c(1559):   434503 channels searched (287/fr), 138403 1st, 34376 last
    INFO: ngram_search_fwdtree.c(1562):    16094 words for which last channels evaluated (10/fr)
    INFO: ngram_search_fwdtree.c(1564):     9893 candidate words for entering last phone (6/fr)
    INFO: ngram_search_fwdtree.c(1567): fwdtree 0.24 CPU 0.016 xRT
    INFO: ngram_search_fwdtree.c(1570): fwdtree 0.24 wall 0.016 xRT
    INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 54 words
    INFO: ngram_search_fwdflat.c(948):     1673 words recognized (1/fr)
    INFO: ngram_search_fwdflat.c(950):    27900 senones evaluated (18/fr)
    INFO: ngram_search_fwdflat.c(952):    17848 channels searched (11/fr)
    INFO: ngram_search_fwdflat.c(954):     7546 words searched (4/fr)
    INFO: ngram_search_fwdflat.c(957):     4182 word transitions (2/fr)
    INFO: ngram_search_fwdflat.c(960): fwdflat 0.02 CPU 0.001 xRT
    INFO: ngram_search_fwdflat.c(963): fwdflat 0.02 wall 0.001 xRT
    INFO: ngram_search.c(1253): lattice start node <s>.0 end node </s>.1483
    INFO: ngram_search.c(1279): Eliminated 0 nodes before end node
    INFO: ngram_search.c(1384): Lattice has 148 nodes, 76 links
    INFO: ps_lattice.c(1380): Bestpath score: -15491
    INFO: ps_lattice.c(1384): Normalizer P(O) = alpha(</s>:1483:1510) = -921350
    INFO: ps_lattice.c(1441): Joint P(O,S) = -921400 P(S|O) = -50
    INFO: ngram_search.c(875): bestpath 0.00 CPU 0.000 xRT
    INFO: ngram_search.c(878): bestpath 0.00 wall 0.000 xRT
    INFO: batch.c(760): E01359329B41907BFFBAC052629A8096: 15.11 seconds speech, 0.26 seconds CPU, 0.26 seconds wall
    INFO: batch.c(762): E01359329B41907BFFBAC052629A8096: 0.02 xRT (CPU), 0.02 xRT (elapsed)
    espaços paravertebrais cervicais e planos musculogordurosos adjacentes íntegros (E01359329B41907BFFBAC052629A8096 -15770)
    E01359329B41907BFFBAC052629A8096 done --------------------------------------
    INFO: batch.c(777): TOTAL 4648.31 seconds speech, 83.07 seconds CPU, 83.12 seconds wall
    INFO: batch.c(779): AVERAGE 0.02 xRT (CPU), 0.02 xRT (elapsed)
    INFO: ngram_search_fwdtree.c(432): TOTAL fwdtree 76.58 CPU 0.016 xRT
    INFO: ngram_search_fwdtree.c(435): TOTAL fwdtree 76.55 wall 0.016 xRT
    INFO: ngram_search_fwdflat.c(176): TOTAL fwdflat 6.37 CPU 0.001 xRT
    INFO: ngram_search_fwdflat.c(179): TOTAL fwdflat 6.43 wall 0.001 xRT
    INFO: ngram_search.c(303): TOTAL bestpath 0.08 CPU 0.000 xRT
    INFO: ngram_search.c(306): TOTAL bestpath 0.10 wall 0.000 xRT
    Thu Nov 19 14:59:26 2015
    ~~~~~~~~~~~~~~~~~~
    
    Here is my file db.align
    
    ~~~~~~~~~~~~~~~~~~~
    Use of the encoding pragma is deprecated at /usr/local/lib/sphinxtrain/scripts/decode/word_align.pl line 14.
    ossos temporais  (user-4E65879E2F4C276870C233D02C5A2F50)
    ossos temporais  (user-4E65879E2F4C276870C233D02C5A2F50)
    Words: 2 Correct: 2 Errors: 0 Percent correct = 100.00% Error = 0.00% Accuracy = 100.00%
    Insertions: 0 Deletions: 0 Substitutions: 0
    vias biliares sem sinais evidentes de  dilataã§ã£o  (user-8F4BE1EF2373B846B8803DCD0BEFA542)
    vias biliares sem sinais evidentes de  dilataã§ã£o  (user-8F4BE1EF2373B846B8803DCD0BEFA542)
    Words: 7 Correct: 7 Errors: 0 Percent correct = 100.00% Error = 0.00% Accuracy = 100.00%
    Insertions: 0 Deletions: 0 Substitutions: 0
    vias biliares sem sinais evidentes de  dilataã§ã£o  (user-D0A20AFD9EAFFBEBDB5DC1B20CE1D0F9)
    vias biliares sem sinais evidentes de  dilataã§ã£o  (user-D0A20AFD9EAFFBEBDB5DC1B20CE1D0F9)
    Words: 7 Correct: 7 Errors: 0 Percent correct = 100.00% Error = 0.00% Accuracy = 100.00%
    Insertions: 0 Deletions: 0 Substitutions: 0
    artã©ria poplã­tea ©rvia SIL sem evidãªncias de  dilataã§ãµes ou  estenoses significativas  (user-1A03EA5E603235CDC97BD17E44E9A203)
    artã©ria poplã­tea ©rvia *** sem evidãªncias de  dilataã§ãµes ou  estenoses significativas  (user-1A03EA5E603235CDC97BD17E44E9A203)
    Words: 11 Correct: 10 Errors: 1 Percent correct = 90.91% Error = 9.09% Accuracy = 90.91%
    Insertions: 0 Deletions: 1 Substitutions: 0
    extensã£o com contraã§ã£o muscular direita esquerda  (user-343061727FCC66751CE5E7E3FC65AE4E)
    extensã£o com contraã§ã£o muscular direita esquerda  (user-343061727FCC66751CE5E7E3FC65AE4E)
    Words: 6 Correct: 6 Errors: 0 Percent correct = 100.00% Error = 0.00% Accuracy = 100.00%
    Insertions: 0 Deletions: 0 Substitutions: 0
    hilos de  aspecto normal  (user-D3469B1327001158ECEC971796F7939D)
    hilos de  aspecto normal  (user-D3469B1327001158ECEC971796F7939D)
    Words: 4 Correct: 4 Errors: 0 Percent correct = 100.00% Error = 0.00% Accuracy = 100.00%
    Insertions: 0 Deletions: 0 Substitutions: 0
    cisternas SIL sulcos E   fissuras sem alteraã§ãµes  (user-38AAC9DB5AC48DFB043A2F6DF326F7E1)
    cisternas *** sulcos DE  fissuras sem alteraã§ãµes  (user-38AAC9DB5AC48DFB043A2F6DF326F7E1)
    Words: 7 Correct: 5 Errors: 2 Percent correct = 71.43% Error = 28.57% Accuracy = 71.43%
    Insertions: 0 Deletions: 1 Substitutions: 1
    parãªnquima pulmonar com volume e   coeficientes de  atenuaã§ã£o preservados  (user-7C4EA21E08889927D924206B44A6872A)
    parãªnquima pulmonar com volume e   coeficientes de  atenuaã§ã£o preservados  (user-7C4EA21E08889927D924206B44A6872A)
    Words: 9 Correct: 9 Errors: 0 Percent correct = 100.00% Error = 0.00% Accuracy = 100.00%
    Insertions: 0 Deletions: 0 Substitutions: 0
    ã¢ngulo de  rotaã§ã£o tibial EXTERNA DIREITO         esquerdo  (user-86FC3ED89C9B45E194179A4E32FE9954)
    ã¢ngulo de  rotaã§ã£o tibial ***     OSTEOBLáSTICAS esquerdo  (user-86FC3ED89C9B45E194179A4E32FE9954)
    Words: 7 Correct: 5 Errors: 2 Percent correct = 71.43% Error = 28.57% Accuracy = 71.43%
    Insertions: 0 Deletions: 1 Substitutions: 1
    condutos auditivos externos de  dimensãµes e   morfologia normais  (user-5BBB3A748811FC327F33493C1584E3E8)
    condutos auditivos externos de  dimensãµes e   morfologia normais  (user-5BBB3A748811FC327F33493C1584E3E8)
    Words: 8 Correct: 8 Errors: 0 Percent correct = 100.00% Error = 0.00% Accuracy = 100.00%
    Insertions: 0 Deletions: 0 Substitutions: 0
    veias porta e   supra hepã¡ticas e   de  calibre preservado  (user-F6C34FC476CB972049F5CB63B2732E5F)
    veias porta e   supra hepã¡ticas e   de  calibre preservado  (user-F6C34FC476CB972049F5CB63B2732E5F)
    Words: 9 Correct: 9 Errors: 0 Percent correct = 100.00% Error = 0.00% Accuracy = 100.00%
    Insertions: 0 Deletions: 0 Substitutions: 0
    o   tronco da  artã©ria pulmonar MEDE       cerca de  centã­metros e   A   artã©ria interlobar direita SIL cerca de  centã­metros  (user-36956B212ABE4CD9F4BD84B9A8CDCB01)
    o   tronco da  artã©ria pulmonar SIMéTRICA cerca de  centã­metros e   *** artã©ria interlobar direita *** cerca de  centã­metros  (user-36956B212ABE4CD9F4BD84B9A8CDCB01)
    Words: 18 Correct: 15 Errors: 3 Percent correct = 83.33% Error = 16.67% Accuracy = 83.33%
    Insertions: 0 Deletions: 2 Substitutions: 1
    £o SIL antebraã§o SIL braã§o SIL coxa SIL perna e   ©  (user-091679A00202C27DCA38F80BE7A61C52)
    £o *** antebraã§o *** braã§o *** coxa *** perna e   ©  (user-091679A00202C27DCA38F80BE7A61C52)
    Words: 11 Correct: 7 Errors: 4 Percent correct = 63.64% Error = 36.36% Accuracy = 63.64%
    Insertions: 0 Deletions: 4 Substitutions: 0
    cavidade oral SIL base da  ­ngua SIL espaã§os sublingual e   submandibular de  aspecto normal  (user-B0CC045FB19ACDFDFFC9B6429A82F65A)
    cavidade oral *** base da  ­ngua *** espaã§os sublingual e   submandibular de  aspecto normal  (user-B0CC045FB19ACDFDFFC9B6429A82F65A)
    Words: 14 Correct: 12 Errors: 2 Percent correct = 85.71% Error = 14.29% Accuracy = 85.71%
    Insertions: 0 Deletions: 2 Substitutions: 0
    o   tronco tibiofibular e   seus principais ramos artã©rias fibular SIL tibial anterior e   tibial posterior estã£o ©rvios e   £o apresentam alteraã§ãµes parietais significativas  (user-E8558C1731B506592BA6D3504D7C3975)
    o   tronco tibiofibular e   seus principais ramos artã©rias fibular *** tibial anterior e   tibial posterior estã£o ©rvios e   £o apresentam alteraã§ãµes parietais significativas  (user-E8558C1731B506592BA6D3504D7C3975)
    Words: 23 Correct: 22 Errors: 1 Percent correct = 95.65% Error = 4.35% Accuracy = 95.65%
    Insertions: 0 Deletions: 1 Substitutions: 0
    caso tenha ocorrido realce significativo  (user-91975F9536F2AE812610253FF620F496)
    caso tenha ocorrido realce significativo  (user-91975F9536F2AE812610253FF620F496)
    Words: 5 Correct: 5 Errors: 0 Percent correct = 100.00% Error = 0.00% Accuracy = 100.00%
    Insertions: 0 Deletions: 0 Substitutions: 0
    abdome superior e   pelve com contraste venoso  (user-B6A51731A74542905203D8B137A5B248)
    abdome superior e   pelve com contraste venoso  (user-B6A51731A74542905203D8B137A5B248)
    Words: 7 Correct: 7 Errors: 0 Percent correct = 100.00% Error = 0.00% Accuracy = 100.00%
    Insertions: 0 Deletions: 0 Substitutions: 0
    coluna ©rea da  rinofaringe de  calibre normal  (user-FE09ED50FD76C1FA5642EDBDCE396399)
    coluna ©rea da  rinofaringe de  calibre normal  (user-FE09ED50FD76C1FA5642EDBDCE396399)
    Words: 7 Correct: 7 Errors: 0 Percent correct = 100.00% Error = 0.00% Accuracy = 100.00%
    Insertions: 0 Deletions: 0 Substitutions: 0
    parãªnquima cerebral com coeficientes de  atenuaã§ã£o preservados  (user-7EEC374A784B6B92741A17E9E339FA6C)
    parãªnquima cerebral com coeficientes de  atenuaã§ã£o preservados  (user-7EEC374A784B6B92741A17E9E339FA6C)
    Words: 7 Correct: 7 Errors: 0 Percent correct = 100.00% Error = 0.00% Accuracy = 100.00%
    Insertions: 0 Deletions: 0 Substitutions: 0
    parãªnquima cerebral com coeficientes de  atenuaã§ã£o preservados  (user-CB82728D27F01A41B0EBC9557B27E4C7)
    parãªnquima cerebral com coeficientes de  atenuaã§ã£o preservados  (user-CB82728D27F01A41B0EBC9557B27E4C7)
    Words: 7 Correct: 7 Errors: 0 Percent correct = 100.00% Error = 0.00% Accuracy = 100.00%
    Insertions: 0 Deletions: 0 Substitutions: 0
    espaã§os paravertebrais cervicais e   planos musculogordurosos adjacentes ã­ntegros  (user-E01359329B41907BFFBAC052629A8096)
    espaã§os paravertebrais cervicais e   planos musculogordurosos adjacentes ã­ntegros  (user-E01359329B41907BFFBAC052629A8096)
    Words: 8 Correct: 8 Errors: 0 Percent correct = 100.00% Error = 0.00% Accuracy = 100.00%
    Insertions: 0 Deletions: 0 Substitutions: 0
    TOTAL Words: 4057 Correct: 3687 Errors: 442
    TOTAL Percent correct = 90.88% Error = 10.89% Accuracy = 89.11%
    TOTAL Insertions: 72 Deletions: 243 Substitutions: 127
    ~~~~~~~~~
    
    Here is my file config
    

    Configuration script for sphinx trainer --mode:Perl--

    $CFG_VERBOSE = 1; # Determines how much goes to the screen.

    These are filled in at configuration time

    $CFG_DB_NAME = "db";

    Experiment name, will be used to name model files and log files

    $CFG_EXPTNAME = "$CFG_DB_NAME";

    Directory containing SphinxTrain binaries

    $CFG_BASE_DIR = "/home/lorena/astout";
    $CFG_SPHINXTRAIN_DIR = "/usr/local/lib/sphinxtrain";
    $CFG_BIN_DIR = "/usr/local/libexec/sphinxtrain";
    $CFG_SCRIPT_DIR = "/usr/local/lib/sphinxtrain/scripts";

    Audio waveform and feature file information

    $CFG_WAVFILES_DIR = "$CFG_BASE_DIR/wav";
    $CFG_WAVFILE_EXTENSION = 'wav';
    $CFG_WAVFILE_TYPE = 'mswav'; # one of nist, mswav, raw
    $CFG_FEATFILES_DIR = "$CFG_BASE_DIR/feat";
    $CFG_FEATFILE_EXTENSION = 'mfc';

    Feature extraction parameters

    $CFG_WAVFILE_SRATE = 16000.0;
    $CFG_NUM_FILT = 25; # For wideband speech it's 25, for telephone 8khz reasonable value is 15
    $CFG_LO_FILT = 130; # For telephone 8kHz speech value is 200
    $CFG_HI_FILT = 6800; # For telephone 8kHz speech value is 3500
    $CFG_TRANSFORM = "dct"; # Previously legacy transform is used, but dct is more accurate
    $CFG_LIFTER = "22"; # Cepstrum lifter is smoothing to improve recognition
    $CFG_VECTOR_LENGTH = 13; # 13 is usually enough

    $CFG_MIN_ITERATIONS = 1; # BW Iterate at least this many times
    $CFG_MAX_ITERATIONS = 10; # BW Don't iterate more than this, somethings likely wrong.

    (none/max) Type of AGC to apply to input files

    $CFG_AGC = 'none';

    (current/none) Type of cepstral mean subtraction/normalization

    to apply to input files

    $CFG_CMN = 'current';

    (yes/no) Normalize variance of input files to 1.0

    $CFG_VARNORM = 'no';

    (yes/no) Train full covariance matrices

    $CFG_FULLVAR = 'no';

    (yes/no) Use diagonals only of full covariance matrices for

    Forward-Backward evaluation (recommended if CFG_FULLVAR is yes)

    $CFG_DIAGFULL = 'no';

    (yes/no) Perform vocal tract length normalization in training. This

    will result in a "normalized" model which requires VTLN to be done

    during decoding as well.

    $CFG_VTLN = 'no';

    Starting warp factor for VTLN

    $CFG_VTLN_START = 0.80;

    Ending warp factor for VTLN

    $CFG_VTLN_END = 1.40;

    Step size of warping factors

    $CFG_VTLN_STEP = 0.05;

    Directory to write queue manager logs to

    $CFG_QMGR_DIR = "$CFG_BASE_DIR/qmanager";

    Directory to write training logs to

    $CFG_LOG_DIR = "$CFG_BASE_DIR/logdir";

    Directory for re-estimation counts

    $CFG_BWACCUM_DIR = "$CFG_BASE_DIR/bwaccumdir";

    Directory to write model parameter files to

    $CFG_MODEL_DIR = "$CFG_BASE_DIR/model_parameters";

    Directory containing transcripts and control files for

    speaker-adaptive training

    $CFG_LIST_DIR = "$CFG_BASE_DIR/etc";

    Decoding variables for MMIE training

    $CFG_LANGUAGEWEIGHT = "11.5";
    $CFG_BEAMWIDTH = "1e-100";
    $CFG_WORDBEAM = "1e-80";
    $CFG_LANGUAGEMODEL = "$CFG_LIST_DIR/$CFG_DB_NAME.lm.DMP";
    $CFG_WORDPENALTY = "0.2";

    Lattice pruning variables

    $CFG_ABEAM = "1e-50";
    $CFG_NBEAM = "1e-10";
    $CFG_PRUNED_DENLAT_DIR = "$CFG_BASE_DIR/pruned_denlat";

    MMIE training related variables

    $CFG_MMIE = "no";
    $CFG_MMIE_MAX_ITERATIONS = 5;
    $CFG_LATTICE_DIR = "$CFG_BASE_DIR/lattice";
    $CFG_MMIE_TYPE = "rand"; # Valid values are "rand", "best" or "ci"
    $CFG_MMIE_CONSTE = "3.0";
    $CFG_NUMLAT_DIR = "$CFG_BASE_DIR/numlat";
    $CFG_DENLAT_DIR = "$CFG_BASE_DIR/denlat";

    Variables used in main training of models

    $CFG_DICTIONARY = "$CFG_LIST_DIR/$CFG_DB_NAME.dic";
    $CFG_RAWPHONEFILE = "$CFG_LIST_DIR/$CFG_DB_NAME.phone";
    $CFG_FILLERDICT = "$CFG_LIST_DIR/$CFG_DB_NAME.filler";
    $CFG_LISTOFFILES = "$CFG_LIST_DIR/${CFG_DB_NAME}_train.fileids";
    $CFG_TRANSCRIPTFILE = "$CFG_LIST_DIR/${CFG_DB_NAME}_train.transcription";
    $CFG_FEATPARAMS = "$CFG_LIST_DIR/feat.params";

    Variables used in characterizing models

    $CFG_HMM_TYPE = '.cont.'; # Sphinx 4, PocketSphinx

    $CFG_HMM_TYPE = '.semi.'; # PocketSphinx

    $CFG_HMM_TYPE = '.ptm.'; # PocketSphinx (larger data sets)

    if (($CFG_HMM_TYPE ne ".semi.")
    and ($CFG_HMM_TYPE ne ".ptm.")
    and ($CFG_HMM_TYPE ne ".cont.")) {
    die "Please choose one CFG_HMM_TYPE out of '.cont.', '.ptm.', or '.semi.', " .
    "currently $CFG_HMM_TYPE\n";
    }

    This configuration is fastest and best for most acoustic models in

    PocketSphinx and Sphinx-III. See below for Sphinx-II.

    $CFG_STATESPERHMM = 3;
    $CFG_SKIPSTATE = 'no';

    if ($CFG_HMM_TYPE eq '.semi.') {
    $CFG_DIRLABEL = 'semi';

    Four stream features for PocketSphinx

    $CFG_FEATURE = "s2_4x";
    $CFG_NUM_STREAMS = 4;
    $CFG_INITIAL_NUM_DENSITIES = 256;
    $CFG_FINAL_NUM_DENSITIES = 256;
    die "For semi continuous models, the initial and final models have the same density"
    if ($CFG_INITIAL_NUM_DENSITIES != $CFG_FINAL_NUM_DENSITIES);
    } elsif ($CFG_HMM_TYPE eq '.ptm.') {
    $CFG_DIRLABEL = 'ptm';

    Four stream features for PocketSphinx

    $CFG_FEATURE = "s2_4x";
    $CFG_NUM_STREAMS = 4;
    $CFG_INITIAL_NUM_DENSITIES = 64;
    $CFG_FINAL_NUM_DENSITIES = 64;
    die "For phonetically tied models, the initial and final models have the same density"
    if ($CFG_INITIAL_NUM_DENSITIES != $CFG_FINAL_NUM_DENSITIES);
    } elsif ($CFG_HMM_TYPE eq '.cont.') {
    $CFG_DIRLABEL = 'cont';

    Single stream features - Sphinx 3

    $CFG_FEATURE = "1s_c_d_dd";
    $CFG_NUM_STREAMS = 1;
    $CFG_INITIAL_NUM_DENSITIES = 1;
    $CFG_FINAL_NUM_DENSITIES = 8;
    die "The initial has to be less than the final number of densities"
    if ($CFG_INITIAL_NUM_DENSITIES > $CFG_FINAL_NUM_DENSITIES);
    }

    Number of top gaussians to score a frame. A little bit less accurate computations

    make training significantly faster. Uncomment to apply this during the training

    For good accuracy make sure you are using the same setting in decoder

    In theory this can be different for various training stages. For example 4 for

    CI stage and 16 for CD stage

    $CFG_CI_TOPN = 4;

    $CFG_CD_TOPN = 16;

    (yes/no) Train multiple-gaussian context-independent models (useful

    for alignment, use 'no' otherwise) in the models created

    specifically for forced alignment

    $CFG_FALIGN_CI_MGAU = 'no';

    (yes/no) Train multiple-gaussian context-independent models (useful

    for alignment, use 'no' otherwise)

    $CFG_CI_MGAU = 'no';

    (yes/no) Train context-dependent models

    $CFG_CD_TRAIN = 'yes';

    Number of tied states (senones) to create in decision-tree clustering

    $CFG_N_TIED_STATES = 200;

    How many parts to run Forward-Backward estimatinon in

    $CFG_NPART = 1;

    (yes/no) Train a single decision tree for all phones (actually one

    per state) (useful for grapheme-based models, use 'no' otherwise)

    $CFG_CROSS_PHONE_TREES = 'no';

    Use force-aligned transcripts (if available) as input to training

    $CFG_FORCEDALIGN = 'no';

    Use a specific set of models for force alignment. If not defined,

    context-independent models for the current experiment will be used.

    $CFG_FORCE_ALIGN_MODELDIR = "$CFG_MODEL_DIR/$CFG_EXPTNAME.falign_ci_$CFG_DIRLABEL";

    Use a specific dictionary and filler dictionary for force alignment.

    If these are not defined, a dictionary and filler dictionary will be

    created from $CFG_DICTIONARY and $CFG_FILLERDICT, with noise words

    removed from the filler dictionary and added to the dictionary (this

    is because the force alignment is not very good at inserting them)

    $CFG_FORCE_ALIGN_DICTIONARY = "$ST::CFG_BASE_DIR/falignout$ST::CFG_EXPTNAME.falign.dict";;

    $CFG_FORCE_ALIGN_FILLERDICT = "$ST::CFG_BASE_DIR/falignout/$ST::CFG_EXPTNAME.falign.fdict";;

    Use a particular beam width for force alignment. The wider

    (i.e. smaller numerically) the beam, the fewer sentences will be

    rejected for bad alignment.

    $CFG_FORCE_ALIGN_BEAM = 1e-60;

    Calculate an LDA/MLLT transform?

    $CFG_LDA_MLLT = 'no';

    Dimensionality of LDA/MLLT output

    $CFG_LDA_DIMENSION = 29;

    This is actually just a difference in log space (it doesn't make

    sense otherwise, because different feature parameters have very

    different likelihoods)

    $CFG_CONVERGENCE_RATIO = 0.1;

    Queue::POSIX for multiple CPUs on a local machine

    Queue::PBS to use a PBS/TORQUE queue

    $CFG_QUEUE_TYPE = "Queue";

    Name of queue to use for PBS/TORQUE

    $CFG_QUEUE_NAME = "workq";

    (yes/no) Build questions for decision tree clustering automatically

    $CFG_MAKE_QUESTS = "yes";

    If CFG_MAKE_QUESTS is yes, questions are written to this file.

    If CFG_MAKE_QUESTS is no, questions are read from this file.

    $CFG_QUESTION_SET = "${CFG_BASE_DIR}/model_architecture/${CFG_EXPTNAME}.tree_questions";

    $CFG_QUESTION_SET = "${CFG_BASE_DIR}/linguistic_questions";

    $CFG_CP_OPERATION = "${CFG_BASE_DIR}/model_architecture/${CFG_EXPTNAME}.cpmeanvar";

    Configuration for grapheme-to-phoneme model

    $CFG_G2P_MODEL= 'no';

    Configuration script for sphinx decoder

    Variables starting with $DEC_CFG_ refer to decoder specific

    arguments, those starting with $CFG_ refer to trainer arguments,

    some of them also used by the decoder.

    $DEC_CFG_VERBOSE = 1; # Determines how much goes to the screen.

    These are filled in at configuration time

    Name of the decoding script to use (psdecode.pl or s3decode.pl, probably)

    $DEC_CFG_SCRIPT = 'psdecode.pl';

    $DEC_CFG_EXPTNAME = "$CFG_EXPTNAME";
    $DEC_CFG_JOBNAME = "$CFG_EXPTNAME"."_job";

    Models to use.

    $DEC_CFG_MODEL_NAME = "$CFG_EXPTNAME.cd_${CFG_DIRLABEL}_${CFG_N_TIED_STATES}";

    $DEC_CFG_FEATFILES_DIR = "$CFG_BASE_DIR/feat";
    $DEC_CFG_FEATFILE_EXTENSION = '.mfc';
    $DEC_CFG_AGC = $CFG_AGC;
    $DEC_CFG_CMN = $CFG_CMN;
    $DEC_CFG_VARNORM = $CFG_VARNORM;

    $DEC_CFG_QMGR_DIR = "$CFG_BASE_DIR/qmanager";
    $DEC_CFG_LOG_DIR = "$CFG_BASE_DIR/logdir";
    $DEC_CFG_MODEL_DIR = "$CFG_MODEL_DIR";

    $DEC_CFG_DICTIONARY = "$CFG_BASE_DIR/etc/$CFG_DB_NAME.dic";
    $DEC_CFG_FILLERDICT = "$CFG_BASE_DIR/etc/$CFG_DB_NAME.filler";
    $DEC_CFG_LISTOFFILES = "$CFG_BASE_DIR/etc/${CFG_DB_NAME}_test.fileids";
    $DEC_CFG_TRANSCRIPTFILE = "$CFG_BASE_DIR/etc/${CFG_DB_NAME}_test.transcription";
    $DEC_CFG_RESULT_DIR = "$CFG_BASE_DIR/result";
    $DEC_CFG_PRESULT_DIR = "$CFG_BASE_DIR/presult";

    This variables, used by the decoder, have to be user defined, and

    may affect the decoder output

    $DEC_CFG_LANGUAGEMODEL = "$CFG_BASE_DIR/etc/${CFG_DB_NAME}.lm.DMP";

    Or can be JSGF or FSG too, used if uncommented

    $DEC_CFG_GRAMMAR = "$CFG_BASE_DIR/etc/${CFG_DB_NAME}.jsgf";

    $DEC_CFG_FSG = "$CFG_BASE_DIR/etc/${CFG_DB_NAME}.fsg";

    $DEC_CFG_LANGUAGEWEIGHT = "10";
    $DEC_CFG_BEAMWIDTH = "1e-80";
    $DEC_CFG_WORDBEAM = "1e-40";

    $DEC_CFG_ALIGN = "builtin";

    $DEC_CFG_NPART = 1; # Define how many pieces to split decode in

    This variable has to be defined, otherwise utils.pl will not load.

    $CFG_DONE = 1;

    return 1;
    ~~~~~~~~~~~~~~~

    I would like to understand why the RFT value is always 0.02 and wherein it is measured RFT. It is in seconds?

     

    Last edit: Nickolay V. Shmyrev 2015-11-22
  • Nickolay V. Shmyrev

    Well.. Here is my file db-1-1.log

    It is polite to add files as an archive in attachment.

    I would like to understand why the RFT value is always 0.02

    Your model is very small so decoding is quite fast. "Always" is not applicable here since you didn't change any decoding parameters. If you increase the BEAMWIDTH in config file decoding will be slower.

    and wherein it is measured RFT. It is in seconds?

    real-time factor is a ratio between lenght of audio (in seconds) and decoding time (in seconds also) so it is unitless (dimensionless).

     

    Last edit: Nickolay V. Shmyrev 2015-11-22

Log in to post a comment.