Menu

Can not listening

Help
Ludo
2015-09-27
2015-09-29
  • Ludo

    Ludo - 2015-09-27

    Hello,

    I try to set up vocal recognition on a RPi with raspbian and sphinx-5prealpha. I'm stuck on the "ready" state. You can see the result of sudo

    ./pocketsphinx_continuous -inmic yes  below 
    

    NB: I think the microphone is ok because i can hear everything I say when it is blocked on ready,... and I also already tested the usb sound card (daffodil c-media usb) on windows.
    I hope someone already had the same problem and solved it!
    Thank you in advance

    pi@raspberrypi ~ $ cd /home/pi/pocketsphinx-5prealpha/src/programs
    pi@raspberrypi ~/pocketsphinx-5prealpha/src/programs $ sudo ./pocketsphinx_continuous -inmic yes
    INFO: pocketsphinx.c(145): Parsed model-specific feature parameters from /usr/local/share/pocketsphinx/model/en-us/en-us/feat.params
    Current configuration:
    [NAME]          [DEFLT]     [VALUE]
    -agc            none        none
    -agcthresh      2.0     2.000000e+00
    -allphone               
    -allphone_ci        no      no
    -alpha          0.97        9.700000e-01
    -ascale         20.0        2.000000e+01
    -aw         1       1
    -backtrace      no      no
    -beam           1e-48       1.000000e-48
    -bestpath       yes     yes
    -bestpathlw     9.5     9.500000e+00
    -ceplen         13      13
    -cmn            current     current
    -cmninit        8.0     40,3,-1
    -compallsen     no      no
    -debug                  0
    -dict                   /usr/local/share/pocketsphinx/model/en-us/cmudict-en-us.dict
    -dictcase       no      no
    -dither         no      no
    -doublebw       no      no
    -ds         1       1
    -fdict                  /usr/local/share/pocketsphinx/model/en-us/en-us/noisedict
    -feat           1s_c_d_dd   1s_c_d_dd
    -featparams             /usr/local/share/pocketsphinx/model/en-us/en-us/feat.params
    -fillprob       1e-8        1.000000e-08
    -frate          100     100
    -fsg                    
    -fsgusealtpron      yes     yes
    -fsgusefiller       yes     yes
    -fwdflat        yes     yes
    -fwdflatbeam        1e-64       1.000000e-64
    -fwdflatefwid       4       4
    -fwdflatlw      8.5     8.500000e+00
    -fwdflatsfwin       25      25
    -fwdflatwbeam       7e-29       7.000000e-29
    -fwdtree        yes     yes
    -hmm                    /usr/local/share/pocketsphinx/model/en-us/en-us
    -input_endian       little      little
    -jsgf                   
    -keyphrase              
    -kws                    
    -kws_delay      10      10
    -kws_plp        1e-1        1.000000e-01
    -kws_threshold      1       1.000000e+00
    -latsize        5000        5000
    -lda                    
    -ldadim         0       0
    -lifter         0       22
    -lm                 /usr/local/share/pocketsphinx/model/en-us/en-us.lm.bin
    -lmctl                  
    -lmname                 
    -logbase        1.0001      1.000100e+00
    -logfn                  
    -logspec        no      no
    -lowerf         133.33334   1.300000e+02
    -lpbeam         1e-40       1.000000e-40
    -lponlybeam     7e-29       7.000000e-29
    -lw         6.5     6.500000e+00
    -maxhmmpf       30000       30000
    -maxwpf         -1      -1
    -mdef                   /usr/local/share/pocketsphinx/model/en-us/en-us/mdef
    -mean                   /usr/local/share/pocketsphinx/model/en-us/en-us/means
    -mfclogdir              
    -min_endfr      0       0
    -mixw                   
    -mixwfloor      0.0000001   1.000000e-07
    -mllr                   
    -mmap           yes     yes
    -ncep           13      13
    -nfft           512     512
    -nfilt          40      25
    -nwpen          1.0     1.000000e+00
    -pbeam          1e-48       1.000000e-48
    -pip            1.0     1.000000e+00
    -pl_beam        1e-10       1.000000e-10
    -pl_pbeam       1e-10       1.000000e-10
    -pl_pip         1.0     1.000000e+00
    -pl_weight      3.0     3.000000e+00
    -pl_window      5       5
    -rawlogdir              
    -remove_dc      no      no
    -remove_noise       yes     yes
    -remove_silence     yes     yes
    -round_filters      yes     yes
    -samprate       16000       1.600000e+04
    -seed           -1      -1
    -sendump                /usr/local/share/pocketsphinx/model/en-us/en-us/sendump
    -senlogdir              
    -senmgau                
    -silprob        0.005       5.000000e-03
    -smoothspec     no      no
    -svspec                 0-12/13-25/26-38
    -tmat                   /usr/local/share/pocketsphinx/model/en-us/en-us/transition_matrices
    -tmatfloor      0.0001      1.000000e-04
    -topn           4       4
    -topn_beam      0       0
    -toprule                
    -transform      legacy      dct
    -unit_area      yes     yes
    -upperf         6855.4976   6.800000e+03
    -uw         1.0     1.000000e+00
    -vad_postspeech     50      50
    -vad_prespeech      20      20
    -vad_startspeech    10      10
    -vad_threshold      2.0     2.000000e+00
    -var                    /usr/local/share/pocketsphinx/model/en-us/en-us/variances
    -varfloor       0.0001      1.000000e-04
    -varnorm        no      no
    -verbose        no      no
    -warp_params                
    -warp_type      inverse_linear  inverse_linear
    -wbeam          7e-29       7.000000e-29
    -wip            0.65        6.500000e-01
    -wlen           0.025625    2.562500e-02
    
    INFO: feat.c(715): Initializing feature stream to type: '1s_c_d_dd', ceplen=13, CMN='current', VARNORM='no', AGC='none'
    INFO: cmn.c(143): mean[0]= 12.00, mean[1..12]= 0.0
    INFO: acmod.c(164): Using subvector specification 0-12/13-25/26-38
    INFO: mdef.c(518): Reading model definition: /usr/local/share/pocketsphinx/model/en-us/en-us/mdef
    INFO: mdef.c(531): Found byte-order mark BMDF, assuming this is a binary mdef file
    INFO: bin_mdef.c(336): Reading binary model definition: /usr/local/share/pocketsphinx/model/en-us/en-us/mdef
    INFO: bin_mdef.c(516): 42 CI-phone, 137053 CD-phone, 3 emitstate/phone, 126 CI-sen, 5126 Sen, 29324 Sen-Seq
    INFO: tmat.c(206): Reading HMM transition probability matrices: /usr/local/share/pocketsphinx/model/en-us/en-us/transition_matrices
    INFO: acmod.c(117): Attempting to use PTM computation module
    INFO: ms_gauden.c(198): Reading mixture gaussian parameter: /usr/local/share/pocketsphinx/model/en-us/en-us/means
    INFO: ms_gauden.c(292): 42 codebook, 3 feature, size: 
    INFO: ms_gauden.c(294):  128x13
    INFO: ms_gauden.c(294):  128x13
    INFO: ms_gauden.c(294):  128x13
    INFO: ms_gauden.c(198): Reading mixture gaussian parameter: /usr/local/share/pocketsphinx/model/en-us/en-us/variances
    INFO: ms_gauden.c(292): 42 codebook, 3 feature, size: 
    INFO: ms_gauden.c(294):  128x13
    INFO: ms_gauden.c(294):  128x13
    INFO: ms_gauden.c(294):  128x13
    INFO: ms_gauden.c(354): 222 variance values floored
    INFO: ptm_mgau.c(476): Loading senones from dump file /usr/local/share/pocketsphinx/model/en-us/en-us/sendump
    INFO: ptm_mgau.c(500): BEGIN FILE FORMAT DESCRIPTION
    INFO: ptm_mgau.c(563): Rows: 128, Columns: 5126
    INFO: ptm_mgau.c(595): Using memory-mapped I/O for senones
    INFO: ptm_mgau.c(835): Maximum top-N: 4
    INFO: phone_loop_search.c(114): State beam -225 Phone exit beam -225 Insertion penalty 0
    INFO: dict.c(320): Allocating 138623 * 20 bytes (2707 KiB) for word entries
    INFO: dict.c(333): Reading main dictionary: /usr/local/share/pocketsphinx/model/en-us/cmudict-en-us.dict
    INFO: dict.c(213): Allocated 1014 KiB for strings, 1677 KiB for phones
    INFO: dict.c(336): 134522 words read
    INFO: dict.c(358): Reading filler dictionary: /usr/local/share/pocketsphinx/model/en-us/en-us/noisedict
    INFO: dict.c(213): Allocated 0 KiB for strings, 0 KiB for phones
    INFO: dict.c(361): 5 words read
    INFO: dict2pid.c(396): Building PID tables for dictionary
    INFO: dict2pid.c(406): Allocating 42^3 * 2 bytes (144 KiB) for word-initial triphones
    INFO: dict2pid.c(132): Allocated 21336 bytes (20 KiB) for word-final triphones
    INFO: dict2pid.c(196): Allocated 21336 bytes (20 KiB) for single-phone word triphones
    INFO: ngram_model_trie.c(456): Trying to read LM in trie binary format
    INFO: ngram_search_fwdtree.c(99): 790 unique initial diphones
    INFO: ngram_search_fwdtree.c(148): 0 root, 0 non-root channels, 57 single-phone words
    INFO: ngram_search_fwdtree.c(186): Creating search tree
    INFO: ngram_search_fwdtree.c(192): before: 0 root, 0 non-root channels, 57 single-phone words
    INFO: ngram_search_fwdtree.c(326): after: max nonroot chan increased to 152144
    INFO: ngram_search_fwdtree.c(339): after: 722 root, 152016 non-root channels, 53 single-phone words
    INFO: ngram_search_fwdflat.c(157): fwdflat: min_ef_width = 4, max_sf_win = 25
    INFO: continuous.c(305): /home/pi/pocketsphinx-5prealpha/src/programs/.libs/lt-pocketsphinx_continuous COMPILED ON: Sep 27 2015, AT: 13:43:33
    
    READY....
    
     
    • Ludo

      Ludo - 2015-09-29

      hello all,

      I reinstalled pulse audio and other dependencies before install sphinx and now, it seems to be better, but it don't works! Is anybody know how to be sure sphinx capture stream from the microphone?
      Below are the text in cmd line. It makes loops with ready, listening and info, that is all!
      Thank you

      pi@raspberrypi ~/pocketsphinx-5prealpha/src/programs $ sudo ./pocketsphinx_continuous -inmic yes
      INFO: pocketsphinx.c(145): Parsed model-specific feature parameters from /usr/local/share/pocketsphinx/model/en-us/en-us/feat.params
      Current configuration:
      [NAME]          [DEFLT]     [VALUE]
      -agc            none        none
      -agcthresh      2.0     2.000000e+00
      -allphone               
      -allphone_ci        no      no
      -alpha          0.97        9.700000e-01
      -ascale         20.0        2.000000e+01
      -aw         1       1
      -backtrace      no      no
      -beam           1e-48       1.000000e-48
      -bestpath       yes     yes
      -bestpathlw     9.5     9.500000e+00
      -ceplen         13      13
      -cmn            current     current
      -cmninit        8.0     40,3,-1
      -compallsen     no      no
      -debug                  0
      -dict                   /usr/local/share/pocketsphinx/model/en-us/cmudict-en-us.dict
      -dictcase       no      no
      -dither         no      no
      -doublebw       no      no
      -ds         1       1
      -fdict                  /usr/local/share/pocketsphinx/model/en-us/en-us/noisedict
      -feat           1s_c_d_dd   1s_c_d_dd
      -featparams             /usr/local/share/pocketsphinx/model/en-us/en-us/feat.params
      -fillprob       1e-8        1.000000e-08
      -frate          100     100
      -fsg                    
      -fsgusealtpron      yes     yes
      -fsgusefiller       yes     yes
      -fwdflat        yes     yes
      -fwdflatbeam        1e-64       1.000000e-64
      -fwdflatefwid       4       4
      -fwdflatlw      8.5     8.500000e+00
      -fwdflatsfwin       25      25
      -fwdflatwbeam       7e-29       7.000000e-29
      -fwdtree        yes     yes
      -hmm                    /usr/local/share/pocketsphinx/model/en-us/en-us
      -input_endian       little      little
      -jsgf                   
      -keyphrase              
      -kws                    
      -kws_delay      10      10
      -kws_plp        1e-1        1.000000e-01
      -kws_threshold      1       1.000000e+00
      -latsize        5000        5000
      -lda                    
      -ldadim         0       0
      -lifter         0       22
      -lm                 /usr/local/share/pocketsphinx/model/en-us/en-us.lm.bin
      -lmctl                  
      -lmname                 
      -logbase        1.0001      1.000100e+00
      -logfn                  
      -logspec        no      no
      -lowerf         133.33334   1.300000e+02
      -lpbeam         1e-40       1.000000e-40
      -lponlybeam     7e-29       7.000000e-29
      -lw         6.5     6.500000e+00
      -maxhmmpf       30000       30000
      -maxwpf         -1      -1
      -mdef                   /usr/local/share/pocketsphinx/model/en-us/en-us/mdef
      -mean                   /usr/local/share/pocketsphinx/model/en-us/en-us/means
      -mfclogdir              
      -min_endfr      0       0
      -mixw                   
      -mixwfloor      0.0000001   1.000000e-07
      -mllr                   
      -mmap           yes     yes
      -ncep           13      13
      -nfft           512     512
      -nfilt          40      25
      -nwpen          1.0     1.000000e+00
      -pbeam          1e-48       1.000000e-48
      -pip            1.0     1.000000e+00
      -pl_beam        1e-10       1.000000e-10
      -pl_pbeam       1e-10       1.000000e-10
      -pl_pip         1.0     1.000000e+00
      -pl_weight      3.0     3.000000e+00
      -pl_window      5       5
      -rawlogdir              
      -remove_dc      no      no
      -remove_noise       yes     yes
      -remove_silence     yes     yes
      -round_filters      yes     yes
      -samprate       16000       1.600000e+04
      -seed           -1      -1
      -sendump                /usr/local/share/pocketsphinx/model/en-us/en-us/sendump
      -senlogdir              
      -senmgau                
      -silprob        0.005       5.000000e-03
      -smoothspec     no      no
      -svspec                 0-12/13-25/26-38
      -tmat                   /usr/local/share/pocketsphinx/model/en-us/en-us/transition_matrices
      -tmatfloor      0.0001      1.000000e-04
      -topn           4       4
      -topn_beam      0       0
      -toprule                
      -transform      legacy      dct
      -unit_area      yes     yes
      -upperf         6855.4976   6.800000e+03
      -uw         1.0     1.000000e+00
      -vad_postspeech     50      50
      -vad_prespeech      20      20
      -vad_startspeech    10      10
      -vad_threshold      2.0     2.000000e+00
      -var                    /usr/local/share/pocketsphinx/model/en-us/en-us/variances
      -varfloor       0.0001      1.000000e-04
      -varnorm        no      no
      -verbose        no      no
      -warp_params                
      -warp_type      inverse_linear  inverse_linear
      -wbeam          7e-29       7.000000e-29
      -wip            0.65        6.500000e-01
      -wlen           0.025625    2.562500e-02
      
      INFO: feat.c(715): Initializing feature stream to type: '1s_c_d_dd', ceplen=13, CMN='current', VARNORM='no', AGC='none'
      INFO: cmn.c(143): mean[0]= 12.00, mean[1..12]= 0.0
      INFO: acmod.c(164): Using subvector specification 0-12/13-25/26-38
      INFO: mdef.c(518): Reading model definition: /usr/local/share/pocketsphinx/model/en-us/en-us/mdef
      INFO: mdef.c(531): Found byte-order mark BMDF, assuming this is a binary mdef file
      INFO: bin_mdef.c(336): Reading binary model definition: /usr/local/share/pocketsphinx/model/en-us/en-us/mdef
      INFO: bin_mdef.c(516): 42 CI-phone, 137053 CD-phone, 3 emitstate/phone, 126 CI-sen, 5126 Sen, 29324 Sen-Seq
      INFO: tmat.c(206): Reading HMM transition probability matrices: /usr/local/share/pocketsphinx/model/en-us/en-us/transition_matrices
      INFO: acmod.c(117): Attempting to use PTM computation module
      INFO: ms_gauden.c(198): Reading mixture gaussian parameter: /usr/local/share/pocketsphinx/model/en-us/en-us/means
      INFO: ms_gauden.c(292): 42 codebook, 3 feature, size: 
      INFO: ms_gauden.c(294):  128x13
      INFO: ms_gauden.c(294):  128x13
      INFO: ms_gauden.c(294):  128x13
      INFO: ms_gauden.c(198): Reading mixture gaussian parameter: /usr/local/share/pocketsphinx/model/en-us/en-us/variances
      INFO: ms_gauden.c(292): 42 codebook, 3 feature, size: 
      INFO: ms_gauden.c(294):  128x13
      INFO: ms_gauden.c(294):  128x13
      INFO: ms_gauden.c(294):  128x13
      INFO: ms_gauden.c(354): 222 variance values floored
      INFO: ptm_mgau.c(476): Loading senones from dump file /usr/local/share/pocketsphinx/model/en-us/en-us/sendump
      INFO: ptm_mgau.c(500): BEGIN FILE FORMAT DESCRIPTION
      INFO: ptm_mgau.c(563): Rows: 128, Columns: 5126
      INFO: ptm_mgau.c(595): Using memory-mapped I/O for senones
      INFO: ptm_mgau.c(835): Maximum top-N: 4
      INFO: phone_loop_search.c(114): State beam -225 Phone exit beam -225 Insertion penalty 0
      INFO: dict.c(320): Allocating 138623 * 20 bytes (2707 KiB) for word entries
      INFO: dict.c(333): Reading main dictionary: /usr/local/share/pocketsphinx/model/en-us/cmudict-en-us.dict
      INFO: dict.c(213): Allocated 1014 KiB for strings, 1677 KiB for phones
      INFO: dict.c(336): 134522 words read
      INFO: dict.c(358): Reading filler dictionary: /usr/local/share/pocketsphinx/model/en-us/en-us/noisedict
      INFO: dict.c(213): Allocated 0 KiB for strings, 0 KiB for phones
      INFO: dict.c(361): 5 words read
      INFO: dict2pid.c(396): Building PID tables for dictionary
      INFO: dict2pid.c(406): Allocating 42^3 * 2 bytes (144 KiB) for word-initial triphones
      INFO: dict2pid.c(132): Allocated 21336 bytes (20 KiB) for word-final triphones
      INFO: dict2pid.c(196): Allocated 21336 bytes (20 KiB) for single-phone word triphones
      INFO: ngram_model_trie.c(456): Trying to read LM in trie binary format
      INFO: ngram_search_fwdtree.c(99): 790 unique initial diphones
      INFO: ngram_search_fwdtree.c(148): 0 root, 0 non-root channels, 57 single-phone words
      INFO: ngram_search_fwdtree.c(186): Creating search tree
      INFO: ngram_search_fwdtree.c(192): before: 0 root, 0 non-root channels, 57 single-phone words
      INFO: ngram_search_fwdtree.c(326): after: max nonroot chan increased to 152144
      INFO: ngram_search_fwdtree.c(339): after: 722 root, 152016 non-root channels, 53 single-phone words
      INFO: ngram_search_fwdflat.c(157): fwdflat: min_ef_width = 4, max_sf_win = 25
      INFO: continuous.c(305): /home/pi/pocketsphinx-5prealpha/src/programs/.libs/lt-pocketsphinx_continuous COMPILED ON: Sep 28 2015, AT: 20:12:20
      
      READY....
      Listening...
      INFO: cmn_prior.c(131): cmn_prior_update: from < 40.00  3.00 -1.00  0.00  0.00  0.00  0.00  0.00  0.00  0.00  0.00  0.00  0.00 >
      INFO: cmn_prior.c(149): cmn_prior_update: to   < 87.50 -3.33 -7.86 -9.85  1.77 -1.52 -4.61 -3.89 -11.73 -4.48 -2.11  4.52 -1.47 >
      INFO: ngram_search_fwdtree.c(1553):      941 words recognized (12/fr)
      INFO: ngram_search_fwdtree.c(1555):   139522 senones evaluated (1812/fr)
      INFO: ngram_search_fwdtree.c(1559):   887375 channels searched (11524/fr), 25915 1st, 35703 last
      INFO: ngram_search_fwdtree.c(1562):     2303 words for which last channels evaluated (29/fr)
      INFO: ngram_search_fwdtree.c(1564):    68624 candidate words for entering last phone (891/fr)
      INFO: ngram_search_fwdtree.c(1567): fwdtree 9.45 CPU 12.273 xRT
      INFO: ngram_search_fwdtree.c(1570): fwdtree 14.22 wall 18.464 xRT
      INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 45 words
      INFO: ngram_search_fwdflat.c(948):      559 words recognized (7/fr)
      INFO: ngram_search_fwdflat.c(950):    38794 senones evaluated (504/fr)
      INFO: ngram_search_fwdflat.c(952):    34229 channels searched (444/fr)
      INFO: ngram_search_fwdflat.c(954):     2303 words searched (29/fr)
      INFO: ngram_search_fwdflat.c(957):     1854 word transitions (24/fr)
      INFO: ngram_search_fwdflat.c(960): fwdflat 0.84 CPU 1.091 xRT
      INFO: ngram_search_fwdflat.c(963): fwdflat 1.05 wall 1.360 xRT
      INFO: ngram_search.c(1253): lattice start node <s>.0 end node </s>.21
      INFO: ngram_search.c(1279): Eliminated 1 nodes before end node
      INFO: ngram_search.c(1384): Lattice has 159 nodes, 188 links
      INFO: ps_lattice.c(1380): Bestpath score: -1452
      INFO: ps_lattice.c(1384): Normalizer P(O) = alpha(</s>:21:75) = -57968
      INFO: ps_lattice.c(1441): Joint P(O,S) = -124949 P(S|O) = -66981
      INFO: ngram_search.c(875): bestpath 0.01 CPU 0.013 xRT
      INFO: ngram_search.c(878): bestpath 0.02 wall 0.032 xRT
      uh
      READY....
      Listening...
      INFO: cmn_prior.c(131): cmn_prior_update: from < 87.50 -3.33 -7.86 -9.85  1.77 -1.52 -4.61 -3.89 -11.73 -4.48 -2.11  4.52 -1.47 >
      INFO: cmn_prior.c(149): cmn_prior_update: to   < 87.61 -2.42 -2.78 -4.75 -0.14 -3.80 -6.73 -4.92 -3.08 -1.35 -5.07  0.92  0.82 >
      INFO: ngram_search_fwdtree.c(1553):     1694 words recognized (18/fr)
      INFO: ngram_search_fwdtree.c(1555):   231360 senones evaluated (2488/fr)
      INFO: ngram_search_fwdtree.c(1559):  1714878 channels searched (18439/fr), 48265 1st, 51371 last
      INFO: ngram_search_fwdtree.c(1562):     3447 words for which last channels evaluated (37/fr)
      INFO: ngram_search_fwdtree.c(1564):   202674 candidate words for entering last phone (2179/fr)
      INFO: ngram_search_fwdtree.c(1567): fwdtree 22.58 CPU 24.280 xRT
      INFO: ngram_search_fwdtree.c(1570): fwdtree 36.01 wall 38.720 xRT
      INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 60 words
      INFO: ngram_search_fwdflat.c(948):     1041 words recognized (11/fr)
      INFO: ngram_search_fwdflat.c(950):    51122 senones evaluated (550/fr)
      INFO: ngram_search_fwdflat.c(952):    65836 channels searched (707/fr)
      INFO: ngram_search_fwdflat.c(954):     3584 words searched (38/fr)
      INFO: ngram_search_fwdflat.c(957):     3024 word transitions (32/fr)
      INFO: ngram_search_fwdflat.c(960): fwdflat 1.32 CPU 1.419 xRT
      INFO: ngram_search_fwdflat.c(963): fwdflat 1.63 wall 1.751 xRT
      INFO: ngram_search.c(1253): lattice start node <s>.0 end node </s>.42
      INFO: ngram_search.c(1279): Eliminated 1 nodes before end node
      INFO: ngram_search.c(1384): Lattice has 247 nodes, 1745 links
      INFO: ps_lattice.c(1380): Bestpath score: -1968
      INFO: ps_lattice.c(1384): Normalizer P(O) = alpha(</s>:42:91) = -92079
      INFO: ps_lattice.c(1441): Joint P(O,S) = -140311 P(S|O) = -48232
      INFO: ngram_search.c(875): bestpath 0.15 CPU 0.163 xRT
      INFO: ngram_search.c(878): bestpath 0.41 wall 0.451 xRT
      and
      READY....
      Listening...
      INFO: cmn_prior.c(131): cmn_prior_update: from < 87.61 -2.42 -2.78 -4.75 -0.14 -3.80 -6.73 -4.92 -3.08 -1.35 -5.07  0.92  0.82 >
      INFO: cmn_prior.c(149): cmn_prior_update: to   < 89.69 -0.72 -1.70 -2.72 -0.20 -2.61 -4.95 -3.35 -1.59 -0.24 -2.67  1.06  0.69 >
      INFO: ngram_search_fwdtree.c(1553):     1567 words recognized (15/fr)
      INFO: ngram_search_fwdtree.c(1555):   255683 senones evaluated (2435/fr)
      INFO: ngram_search_fwdtree.c(1559):  1645778 channels searched (15674/fr), 46634 1st, 56745 last
      INFO: ngram_search_fwdtree.c(1562):     3641 words for which last channels evaluated (34/fr)
      INFO: ngram_search_fwdtree.c(1564):   185252 candidate words for entering last phone (1764/fr)
      INFO: ngram_search_fwdtree.c(1567): fwdtree 19.21 CPU 18.295 xRT
      INFO: ngram_search_fwdtree.c(1570): fwdtree 46.22 wall 44.018 xRT
      INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 58 words
      INFO: ngram_search_fwdflat.c(948):      774 words recognized (7/fr)
      INFO: ngram_search_fwdflat.c(950):    51553 senones evaluated (491/fr)
      INFO: ngram_search_fwdflat.c(952):    52515 channels searched (500/fr)
      INFO: ngram_search_fwdflat.c(954):     3219 words searched (30/fr)
      INFO: ngram_search_fwdflat.c(957):     2575 word transitions (24/fr)
      INFO: ngram_search_fwdflat.c(960): fwdflat 1.18 CPU 1.124 xRT
      INFO: ngram_search_fwdflat.c(963): fwdflat 1.45 wall 1.386 xRT
      INFO: ngram_search.c(1253): lattice start node <s>.0 end node </s>.29
      INFO: ngram_search.c(1279): Eliminated 1 nodes before end node
      INFO: ngram_search.c(1384): Lattice has 249 nodes, 601 links
      INFO: ps_lattice.c(1380): Bestpath score: -1790
      INFO: ps_lattice.c(1384): Normalizer P(O) = alpha(</s>:29:103) = -55938
      INFO: ps_lattice.c(1441): Joint P(O,S) = -133390 P(S|O) = -77452
      INFO: ngram_search.c(875): bestpath 0.03 CPU 0.029 xRT
      INFO: ngram_search.c(878): bestpath 0.13 wall 0.129 xRT
      
      READY....
      Listening...
      INFO: cmn_prior.c(131): cmn_prior_update: from < 89.69 -0.72 -1.70 -2.72 -0.20 -2.61 -4.95 -3.35 -1.59 -0.24 -2.67  1.06  0.69 >
      INFO: cmn_prior.c(149): cmn_prior_update: to   < 89.73 -0.85 -0.67 -1.10  0.16 -1.98 -3.85 -2.24 -0.80  0.03 -1.99 -0.01 -0.09 >
      INFO: ngram_search_fwdtree.c(1553):     1697 words recognized (16/fr)
      INFO: ngram_search_fwdtree.c(1555):   307031 senones evaluated (2981/fr)
      INFO: ngram_search_fwdtree.c(1559):  2006339 channels searched (19479/fr), 61002 1st, 50104 last
      INFO: ngram_search_fwdtree.c(1562):     3601 words for which last channels evaluated (34/fr)
      INFO: ngram_search_fwdtree.c(1564):   145241 candidate words for entering last phone (1410/fr)
      INFO: ngram_search_fwdtree.c(1567): fwdtree 19.32 CPU 18.757 xRT
      INFO: ngram_search_fwdtree.c(1570): fwdtree 31.96 wall 31.026 xRT
      INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 53 words
      INFO: ngram_search_fwdflat.c(948):     1049 words recognized (10/fr)
      INFO: ngram_search_fwdflat.c(950):    59194 senones evaluated (575/fr)
      INFO: ngram_search_fwdflat.c(952):    67676 channels searched (657/fr)
      INFO: ngram_search_fwdflat.c(954):     3679 words searched (35/fr)
      INFO: ngram_search_fwdflat.c(957):     2394 word transitions (23/fr)
      INFO: ngram_search_fwdflat.c(960): fwdflat 1.47 CPU 1.427 xRT
      INFO: ngram_search_fwdflat.c(963): fwdflat 2.36 wall 2.295 xRT
      INFO: ngram_search.c(1253): lattice start node <s>.0 end node </s>.21
      INFO: ngram_search.c(1279): Eliminated 1 nodes before end node
      INFO: ngram_search.c(1384): Lattice has 279 nodes, 204 links
      INFO: ps_lattice.c(1380): Bestpath score: -1458
      INFO: ps_lattice.c(1384): Normalizer P(O) = alpha(</s>:21:101) = -92672
      INFO: ps_lattice.c(1441): Joint P(O,S) = -140934 P(S|O) = -48262
      INFO: ngram_search.c(875): bestpath 0.01 CPU 0.010 xRT
      INFO: ngram_search.c(878): bestpath 0.03 wall 0.025 xRT
      no
      READY....
      Listening...
      
       
      • Nickolay V. Shmyrev

        To debug this issue you can add

        -rawlogdir <dir>
        

        to the command line, then it will dump raw files to the folder <dir>. You can share those files to get help.

        Most likely the hardware does not support recording with a proper sample rate of 16khz, you might need to specify sample rate in the command line, something like -samprate 48000 -nfft 2048.

         

Log in to post a comment.