I tried to run PocketsphinxAndroidDemo with the acoustic model
"zh_broadcastnews_ptm256_8000" . It loaded well, but it would automatically
exit when start listening. The log file is as follows:
INFO: cmd_ln.c(557): Parsing command line:
Current configuration:
-agc none none
-agcthresh 2.0 2.000000e+00
-alpha 0.97 9.700000e-01
-ascale 20.0 2.000000e+01
-aw 1 1
-backtrace no no
-beam 1e-48 1.000000e-48
-bestpath yes yes
-bestpathlw 9.5 9.500000e+00
-bghist no no
-ceplen 13 13
-cmn current current
-cmninit 8.0 8.0
-compallsen no no
-debug 0
-dict
-dictcase no no
-dither no no
-doublebw no no
-ds 1 1
-fdict
-feat 1s_c_d_dd 1s_c_d_dd
-featparams
-fillprob 1e-8 1.000000e-08
-frate 100 100
-fsg
-fsgusealtpron yes yes
-fsgusefiller yes yes
-fwdflat yes yes
-fwdflatbeam 1e-64 1.000000e-64
-fwdflatefwid 4 4
-fwdflatlw 8.5 8.500000e+00
-fwdflatsfwin 25 25
-fwdflatwbeam 7e-29 7.000000e-29
-fwdtree yes yes
-hmm
-input_endian little little
-jsgf
-kdmaxbbi -1 -1
-kdmaxdepth 0 0
-kdtree
-latsize 5000 5000
-lda
-ldadim 0 0
-lextreedump 0 0
-lifter 0 0
-lm
-lmctl
-lmname default default
-logbase 1.0001 1.000100e+00
-logfn
-logspec no no
-lowerf 133.33334 1.333333e+02
-lpbeam 1e-40 1.000000e-40
-lponlybeam 7e-29 7.000000e-29
-lw 6.5 6.500000e+00
-maxhmmpf -1 -1
-maxnewoov 20 20
-maxwpf -1 -1
-mdef
-mean
-mfclogdir
-min_endfr 0 0
-mixw
-mixwfloor 0.0000001 1.000000e-07
-mllr
-mmap yes yes
-ncep 13 13
-nfft 512 512
-nfilt 40 40
-nwpen 1.0 1.000000e+00
-pbeam 1e-48 1.000000e-48
-pip 1.0 1.000000e+00
-pl_beam 1e-10 1.000000e-10
-pl_pbeam 1e-5 1.000000e-05
-pl_window 0 0
-rawlogdir
-remove_dc no no
-round_filters yes yes
-samprate 16000 1.600000e+04
-seed -1 -1
-sendump
-senlogdir
-senmgau
-silprob 0.005 5.000000e-03
-smoothspec no no
-svspec
-tmat
-tmatfloor 0.0001 1.000000e-04
-topn 4 4
-topn_beam 0 0
-toprule
-transform legacy legacy
-unit_area yes yes
-upperf 6855.4976 6.855498e+03
-usewdphones no no
-uw 1.0 1.000000e+00
-var
-varfloor 0.0001 1.000000e-04
-varnorm no no
-verbose no no
-warp_params
-warp_type inverse_linear inverse_linear
-wbeam 7e-29 7.000000e-29
-wip 0.65 6.500000e-01
-wlen 0.025625 2.562500e-02
Looks like it crashed. Try to run pocketsphinx_continuous with the same
arguments on Linux, see what happens. Try to debug an application on device
and get a stack trace. Check that all words from fsg are present in your
dictionary and so on.
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Hi Nickolay,
I tried to run PocketsphinxAndroidDemo with the acoustic model
"zh_broadcastnews_ptm256_8000" . It loaded well, but it would automatically
exit when start listening. The log file is as follows:
INFO: cmd_ln.c(557): Parsing command line:
Current configuration:
-agc none none
-agcthresh 2.0 2.000000e+00
-alpha 0.97 9.700000e-01
-ascale 20.0 2.000000e+01
-aw 1 1
-backtrace no no
-beam 1e-48 1.000000e-48
-bestpath yes yes
-bestpathlw 9.5 9.500000e+00
-bghist no no
-ceplen 13 13
-cmn current current
-cmninit 8.0 8.0
-compallsen no no
-debug 0
-dict
-dictcase no no
-dither no no
-doublebw no no
-ds 1 1
-fdict
-feat 1s_c_d_dd 1s_c_d_dd
-featparams
-fillprob 1e-8 1.000000e-08
-frate 100 100
-fsg
-fsgusealtpron yes yes
-fsgusefiller yes yes
-fwdflat yes yes
-fwdflatbeam 1e-64 1.000000e-64
-fwdflatefwid 4 4
-fwdflatlw 8.5 8.500000e+00
-fwdflatsfwin 25 25
-fwdflatwbeam 7e-29 7.000000e-29
-fwdtree yes yes
-hmm
-input_endian little little
-jsgf
-kdmaxbbi -1 -1
-kdmaxdepth 0 0
-kdtree
-latsize 5000 5000
-lda
-ldadim 0 0
-lextreedump 0 0
-lifter 0 0
-lm
-lmctl
-lmname default default
-logbase 1.0001 1.000100e+00
-logfn
-logspec no no
-lowerf 133.33334 1.333333e+02
-lpbeam 1e-40 1.000000e-40
-lponlybeam 7e-29 7.000000e-29
-lw 6.5 6.500000e+00
-maxhmmpf -1 -1
-maxnewoov 20 20
-maxwpf -1 -1
-mdef
-mean
-mfclogdir
-min_endfr 0 0
-mixw
-mixwfloor 0.0000001 1.000000e-07
-mllr
-mmap yes yes
-ncep 13 13
-nfft 512 512
-nfilt 40 40
-nwpen 1.0 1.000000e+00
-pbeam 1e-48 1.000000e-48
-pip 1.0 1.000000e+00
-pl_beam 1e-10 1.000000e-10
-pl_pbeam 1e-5 1.000000e-05
-pl_window 0 0
-rawlogdir
-remove_dc no no
-round_filters yes yes
-samprate 16000 1.600000e+04
-seed -1 -1
-sendump
-senlogdir
-senmgau
-silprob 0.005 5.000000e-03
-smoothspec no no
-svspec
-tmat
-tmatfloor 0.0001 1.000000e-04
-topn 4 4
-topn_beam 0 0
-toprule
-transform legacy legacy
-unit_area yes yes
-upperf 6855.4976 6.855498e+03
-usewdphones no no
-uw 1.0 1.000000e+00
-var
-varfloor 0.0001 1.000000e-04
-varnorm no no
-verbose no no
-warp_params
-warp_type inverse_linear inverse_linear
-wbeam 7e-29 7.000000e-29
-wip 0.65 6.500000e-01
-wlen 0.025625 2.562500e-02
INFO: cmd_ln.c(557): Parsing command line:
\
-alpha 0.97 \
-doublebw no \
-nfilt 40 \
-ncep 13 \
-lowerf 133.33334 \
-upperf 6855.4976 \
-nfft 512 \
-wlen 0.0256 \
-transform legacy \
-feat s2_4x \
-agc none \
-cmn current \
-varnorm no
Current configuration:
-agc none none
-agcthresh 2.0 2.000000e+00
-alpha 0.97 9.700000e-01
-ceplen 13 13
-cmn current current
-cmninit 8.0 8.0
-dither no no
-doublebw no no
-feat 1s_c_d_dd s2_4x
-frate 100 100
-input_endian little little
-lda
-ldadim 0 0
-lifter 0 0
-logspec no no
-lowerf 133.33334 1.333333e+02
-ncep 13 13
-nfft 512 512
-nfilt 40 40
-remove_dc no no
-round_filters yes yes
-samprate 16000 8.000000e+03
-seed -1 -1
-smoothspec no no
-svspec
-transform legacy legacy
-unit_area yes yes
-upperf 6855.4976 6.855498e+03
-varnorm no no
-verbose no no
-warp_params
-warp_type inverse_linear inverse_linear
-wlen 0.025625 2.560000e-02
INFO: acmod.c(238): Parsed model-specific feature parameters from /sdcard/Andr
oid/data/edu.cmu.pocketsphinx/hmm/zh/zh_broadcastnews_ptm256_8000/feat.params
INFO: feat.c(860): Initializing feature stream to type: 's2_4x', ceplen=13,
CMN='current', VARNORM='no', AGC='none'
INFO: cmn.c(142): mean= 12.00, mean= 0.0
INFO: mdef.c(520): Reading model definition: /sdcard/Android/data/edu.cmu.pock
etsphinx/hmm/zh/zh_broadcastnews_ptm256_8000/mdef
INFO: bin_mdef.c(173): Allocating 68760 * 8 bytes (537 KiB) for CD tree
INFO: tmat.c(205): Reading HMM transition probability matrices: /sdcard/Androi
d/data/edu.cmu.pocketsphinx/hmm/zh/zh_broadcastnews_ptm256_8000/transition_mat
rices
INFO: acmod.c(117): Attempting to use SCHMM computation module
INFO: ms_gauden.c(198): Reading mixture gaussian parameter: /sdcard/Android/da
ta/edu.cmu.pocketsphinx/hmm/zh/zh_broadcastnews_ptm256_8000/means
INFO: ms_gauden.c(292): 70 codebook, 4 feature, size:
INFO: ms_gauden.c(294): 256x12
INFO: ms_gauden.c(294): 256x24
INFO: ms_gauden.c(294): 256x3
INFO: ms_gauden.c(294): 256x12
INFO: ms_gauden.c(198): Reading mixture gaussian parameter: /sdcard/Android/da
ta/edu.cmu.pocketsphinx/hmm/zh/zh_broadcastnews_ptm256_8000/variances
INFO: ms_gauden.c(292): 70 codebook, 4 feature, size:
INFO: ms_gauden.c(294): 256x12
INFO: ms_gauden.c(294): 256x24
INFO: ms_gauden.c(294): 256x3
INFO: ms_gauden.c(294): 256x12
INFO: ms_gauden.c(354): 24440 variance values floored
INFO: acmod.c(119): Attempting to use PTHMM computation module
INFO: ms_gauden.c(198): Reading mixture gaussian parameter: /sdcard/Android/da
ta/edu.cmu.pocketsphinx/hmm/zh/zh_broadcastnews_ptm256_8000/means
INFO: ms_gauden.c(292): 70 codebook, 4 feature, size:
INFO: ms_gauden.c(294): 256x12
INFO: ms_gauden.c(294): 256x24
INFO: ms_gauden.c(294): 256x3
INFO: ms_gauden.c(294): 256x12
INFO: ms_gauden.c(198): Reading mixture gaussian parameter: /sdcard/Android/da
ta/edu.cmu.pocketsphinx/hmm/zh/zh_broadcastnews_ptm256_8000/variances
INFO: ms_gauden.c(292): 70 codebook, 4 feature, size:
INFO: ms_gauden.c(294): 256x12
INFO: ms_gauden.c(294): 256x24
INFO: ms_gauden.c(294): 256x3
INFO: ms_gauden.c(294): 256x12
INFO: ms_gauden.c(354): 24440 variance values floored
INFO: ptm_mgau.c(472): Loading senones from dump file /sdcard/Android/data/edu
.cmu.pocketsphinx/hmm/zh/zh_broadcastnews_ptm256_8000/sendump
INFO: ptm_mgau.c(496): BEGIN FILE FORMAT DESCRIPTION
INFO: ptm_mgau.c(559): Rows: 256, Columns: 8210
INFO: ptm_mgau.c(591): Using memory-mapped I/O for senones
INFO: ptm_mgau.c(830): Maximum top-N: 4
INFO: phone_loop_search.c(105): State beam -230231 Phone exit beam -115115
Insertion penalty 0
INFO: dict.c(306): Allocating 4113 * 20 bytes (80 KiB) for word entries
INFO: dict.c(321): Reading main dictionary:
/sdcard/Android/data/edu.cmu.pocketsphinx/lm/idea/testdic.dic
INFO: dict.c(212): Allocated 0 KiB for strings, 0 KiB for phones
INFO: dict.c(324): 9 words read
INFO: dict.c(330): Reading filler dictionary: /sdcard/Android/data/edu.cmu.poc
ketsphinx/hmm/zh/zh_broadcastnews_ptm256_8000/noisedict
INFO: dict.c(212): Allocated 0 KiB for strings, 0 KiB for phones
INFO: dict.c(333): 8 words read
INFO: dict2pid.c(396): Building PID tables for dictionary
INFO: dict2pid.c(404): Allocating 70^3 * 2 bytes (669 KiB) for word-initial
triphones
INFO: dict2pid.c(131): Allocated 59080 bytes (57 KiB) for word-final triphones
INFO: dict2pid.c(195): Allocated 59080 bytes (57 KiB) for single-phone word
triphones
INFO: fsg_search.c(145): FSG(beam: -1080, pbeam: -1080, wbeam: -634; wip: -26,
pip: 0)
INFO: jsgf.c(546): Defined rule: <test.g00000>
INFO: jsgf.c(546): Defined rule: <test.g00001>
INFO: jsgf.c(546): Defined rule: PUBLIC <test.char>
INFO: jsgf.c(349): Right recursion <test.g00001> 6 => 2
INFO: fsg_model.c(213): Computing transitive closure for null transitions
INFO: fsg_model.c(264): 55 null transitions added
INFO: fsg_model.c(411): Adding silence transitions for <sil> to FSG
INFO: fsg_model.c(431): Added 16 silence word transitions
INFO: fsg_model.c(411): Adding silence transitions for <sil> to FSG
INFO: fsg_model.c(431): Added 16 silence word transitions
INFO: fsg_model.c(411): Adding silence transitions for ++laugh++ to FSG
INFO: fsg_model.c(431): Added 16 silence word transitions
INFO: fsg_model.c(411): Adding silence transitions for ++lipsmack++ to FSG
INFO: fsg_model.c(431): Added 16 silence word transitions
INFO: fsg_model.c(411): Adding silence transitions for ++cough++ to FSG
INFO: fsg_model.c(431): Added 16 silence word transitions
INFO: fsg_model.c(411): Adding silence transitions for ++breath++ to FSG
INFO: fsg_model.c(431): Added 16 silence word transitions
INFO: fsg_search.c(360): Added 0 alternate word transitions
INFO: fsg_lextree.c(108): Allocated 2272 bytes (2 KiB) for left and right
context phones
INFO: fsg_lextree.c(251): 145 HMM nodes in lextree (137 leaves)
INFO: fsg_lextree.c(253): Allocated 16820 bytes (16 KiB) for all lextree nodes
INFO: fsg_lextree.c(256): Allocated 15892 bytes (15 KiB) for lextree leafnodes </sil></sil></test.g00001></test.char></test.g00001></test.g00000>
Hello
Looks like it crashed. Try to run pocketsphinx_continuous with the same
arguments on Linux, see what happens. Try to debug an application on device
and get a stack trace. Check that all words from fsg are present in your
dictionary and so on.
Hi,
pocketsphinx_continuous is OK with it. But with my emulator, the recognition
seems to be everlasting(more than 15 minutes).