Menu

AndroidDemo with PTM model problem

Help
z x
2011-04-13
2012-09-22
  • z x

    z x - 2011-04-13

    Hi Nickolay,

    I tried to run PocketsphinxAndroidDemo with the acoustic model
    "zh_broadcastnews_ptm256_8000" . It loaded well, but it would automatically
    exit when start listening. The log file is as follows:

    INFO: cmd_ln.c(557): Parsing command line:

    Current configuration:

    -agc none none
    -agcthresh 2.0 2.000000e+00
    -alpha 0.97 9.700000e-01
    -ascale 20.0 2.000000e+01
    -aw 1 1
    -backtrace no no
    -beam 1e-48 1.000000e-48
    -bestpath yes yes
    -bestpathlw 9.5 9.500000e+00
    -bghist no no
    -ceplen 13 13
    -cmn current current
    -cmninit 8.0 8.0
    -compallsen no no
    -debug 0
    -dict
    -dictcase no no
    -dither no no
    -doublebw no no
    -ds 1 1
    -fdict
    -feat 1s_c_d_dd 1s_c_d_dd
    -featparams
    -fillprob 1e-8 1.000000e-08
    -frate 100 100
    -fsg
    -fsgusealtpron yes yes
    -fsgusefiller yes yes
    -fwdflat yes yes
    -fwdflatbeam 1e-64 1.000000e-64
    -fwdflatefwid 4 4
    -fwdflatlw 8.5 8.500000e+00
    -fwdflatsfwin 25 25
    -fwdflatwbeam 7e-29 7.000000e-29
    -fwdtree yes yes
    -hmm
    -input_endian little little
    -jsgf
    -kdmaxbbi -1 -1
    -kdmaxdepth 0 0
    -kdtree
    -latsize 5000 5000
    -lda
    -ldadim 0 0
    -lextreedump 0 0
    -lifter 0 0
    -lm
    -lmctl
    -lmname default default
    -logbase 1.0001 1.000100e+00
    -logfn
    -logspec no no
    -lowerf 133.33334 1.333333e+02
    -lpbeam 1e-40 1.000000e-40
    -lponlybeam 7e-29 7.000000e-29
    -lw 6.5 6.500000e+00
    -maxhmmpf -1 -1
    -maxnewoov 20 20
    -maxwpf -1 -1
    -mdef
    -mean
    -mfclogdir
    -min_endfr 0 0
    -mixw
    -mixwfloor 0.0000001 1.000000e-07
    -mllr
    -mmap yes yes
    -ncep 13 13
    -nfft 512 512
    -nfilt 40 40
    -nwpen 1.0 1.000000e+00
    -pbeam 1e-48 1.000000e-48
    -pip 1.0 1.000000e+00
    -pl_beam 1e-10 1.000000e-10
    -pl_pbeam 1e-5 1.000000e-05
    -pl_window 0 0
    -rawlogdir
    -remove_dc no no
    -round_filters yes yes
    -samprate 16000 1.600000e+04
    -seed -1 -1
    -sendump
    -senlogdir
    -senmgau
    -silprob 0.005 5.000000e-03
    -smoothspec no no
    -svspec
    -tmat
    -tmatfloor 0.0001 1.000000e-04
    -topn 4 4
    -topn_beam 0 0
    -toprule
    -transform legacy legacy
    -unit_area yes yes
    -upperf 6855.4976 6.855498e+03
    -usewdphones no no
    -uw 1.0 1.000000e+00
    -var
    -varfloor 0.0001 1.000000e-04
    -varnorm no no
    -verbose no no
    -warp_params
    -warp_type inverse_linear inverse_linear
    -wbeam 7e-29 7.000000e-29
    -wip 0.65 6.500000e-01
    -wlen 0.025625 2.562500e-02

    INFO: cmd_ln.c(557): Parsing command line:
    \
    -alpha 0.97 \
    -doublebw no \
    -nfilt 40 \
    -ncep 13 \
    -lowerf 133.33334 \
    -upperf 6855.4976 \
    -nfft 512 \
    -wlen 0.0256 \
    -transform legacy \
    -feat s2_4x \
    -agc none \
    -cmn current \
    -varnorm no

    Current configuration:

    -agc none none
    -agcthresh 2.0 2.000000e+00
    -alpha 0.97 9.700000e-01
    -ceplen 13 13
    -cmn current current
    -cmninit 8.0 8.0
    -dither no no
    -doublebw no no
    -feat 1s_c_d_dd s2_4x
    -frate 100 100
    -input_endian little little
    -lda
    -ldadim 0 0
    -lifter 0 0
    -logspec no no
    -lowerf 133.33334 1.333333e+02
    -ncep 13 13
    -nfft 512 512
    -nfilt 40 40
    -remove_dc no no
    -round_filters yes yes
    -samprate 16000 8.000000e+03
    -seed -1 -1
    -smoothspec no no
    -svspec
    -transform legacy legacy
    -unit_area yes yes
    -upperf 6855.4976 6.855498e+03
    -varnorm no no
    -verbose no no
    -warp_params
    -warp_type inverse_linear inverse_linear
    -wlen 0.025625 2.560000e-02

    INFO: acmod.c(238): Parsed model-specific feature parameters from /sdcard/Andr
    oid/data/edu.cmu.pocketsphinx/hmm/zh/zh_broadcastnews_ptm256_8000/feat.params
    INFO: feat.c(860): Initializing feature stream to type: 's2_4x', ceplen=13,
    CMN='current', VARNORM='no', AGC='none'
    INFO: cmn.c(142): mean= 12.00, mean= 0.0
    INFO: mdef.c(520): Reading model definition: /sdcard/Android/data/edu.cmu.pock
    etsphinx/hmm/zh/zh_broadcastnews_ptm256_8000/mdef
    INFO: bin_mdef.c(173): Allocating 68760 * 8 bytes (537 KiB) for CD tree
    INFO: tmat.c(205): Reading HMM transition probability matrices: /sdcard/Androi
    d/data/edu.cmu.pocketsphinx/hmm/zh/zh_broadcastnews_ptm256_8000/transition_mat
    rices
    INFO: acmod.c(117): Attempting to use SCHMM computation module
    INFO: ms_gauden.c(198): Reading mixture gaussian parameter: /sdcard/Android/da
    ta/edu.cmu.pocketsphinx/hmm/zh/zh_broadcastnews_ptm256_8000/means
    INFO: ms_gauden.c(292): 70 codebook, 4 feature, size:
    INFO: ms_gauden.c(294): 256x12
    INFO: ms_gauden.c(294): 256x24
    INFO: ms_gauden.c(294): 256x3
    INFO: ms_gauden.c(294): 256x12
    INFO: ms_gauden.c(198): Reading mixture gaussian parameter: /sdcard/Android/da
    ta/edu.cmu.pocketsphinx/hmm/zh/zh_broadcastnews_ptm256_8000/variances
    INFO: ms_gauden.c(292): 70 codebook, 4 feature, size:
    INFO: ms_gauden.c(294): 256x12
    INFO: ms_gauden.c(294): 256x24
    INFO: ms_gauden.c(294): 256x3
    INFO: ms_gauden.c(294): 256x12
    INFO: ms_gauden.c(354): 24440 variance values floored
    INFO: acmod.c(119): Attempting to use PTHMM computation module
    INFO: ms_gauden.c(198): Reading mixture gaussian parameter: /sdcard/Android/da
    ta/edu.cmu.pocketsphinx/hmm/zh/zh_broadcastnews_ptm256_8000/means
    INFO: ms_gauden.c(292): 70 codebook, 4 feature, size:
    INFO: ms_gauden.c(294): 256x12
    INFO: ms_gauden.c(294): 256x24
    INFO: ms_gauden.c(294): 256x3
    INFO: ms_gauden.c(294): 256x12
    INFO: ms_gauden.c(198): Reading mixture gaussian parameter: /sdcard/Android/da
    ta/edu.cmu.pocketsphinx/hmm/zh/zh_broadcastnews_ptm256_8000/variances
    INFO: ms_gauden.c(292): 70 codebook, 4 feature, size:
    INFO: ms_gauden.c(294): 256x12
    INFO: ms_gauden.c(294): 256x24
    INFO: ms_gauden.c(294): 256x3
    INFO: ms_gauden.c(294): 256x12
    INFO: ms_gauden.c(354): 24440 variance values floored
    INFO: ptm_mgau.c(472): Loading senones from dump file /sdcard/Android/data/edu
    .cmu.pocketsphinx/hmm/zh/zh_broadcastnews_ptm256_8000/sendump
    INFO: ptm_mgau.c(496): BEGIN FILE FORMAT DESCRIPTION
    INFO: ptm_mgau.c(559): Rows: 256, Columns: 8210
    INFO: ptm_mgau.c(591): Using memory-mapped I/O for senones
    INFO: ptm_mgau.c(830): Maximum top-N: 4
    INFO: phone_loop_search.c(105): State beam -230231 Phone exit beam -115115
    Insertion penalty 0
    INFO: dict.c(306): Allocating 4113 * 20 bytes (80 KiB) for word entries
    INFO: dict.c(321): Reading main dictionary:
    /sdcard/Android/data/edu.cmu.pocketsphinx/lm/idea/testdic.dic
    INFO: dict.c(212): Allocated 0 KiB for strings, 0 KiB for phones
    INFO: dict.c(324): 9 words read
    INFO: dict.c(330): Reading filler dictionary: /sdcard/Android/data/edu.cmu.poc
    ketsphinx/hmm/zh/zh_broadcastnews_ptm256_8000/noisedict
    INFO: dict.c(212): Allocated 0 KiB for strings, 0 KiB for phones
    INFO: dict.c(333): 8 words read
    INFO: dict2pid.c(396): Building PID tables for dictionary
    INFO: dict2pid.c(404): Allocating 70^3 * 2 bytes (669 KiB) for word-initial
    triphones
    INFO: dict2pid.c(131): Allocated 59080 bytes (57 KiB) for word-final triphones
    INFO: dict2pid.c(195): Allocated 59080 bytes (57 KiB) for single-phone word
    triphones
    INFO: fsg_search.c(145): FSG(beam: -1080, pbeam: -1080, wbeam: -634; wip: -26,
    pip: 0)
    INFO: jsgf.c(546): Defined rule: <test.g00000>
    INFO: jsgf.c(546): Defined rule: <test.g00001>
    INFO: jsgf.c(546): Defined rule: PUBLIC <test.char>
    INFO: jsgf.c(349): Right recursion <test.g00001> 6 => 2
    INFO: fsg_model.c(213): Computing transitive closure for null transitions
    INFO: fsg_model.c(264): 55 null transitions added
    INFO: fsg_model.c(411): Adding silence transitions for <sil> to FSG
    INFO: fsg_model.c(431): Added 16 silence word transitions
    INFO: fsg_model.c(411): Adding silence transitions for <sil> to FSG
    INFO: fsg_model.c(431): Added 16 silence word transitions
    INFO: fsg_model.c(411): Adding silence transitions for ++laugh++ to FSG
    INFO: fsg_model.c(431): Added 16 silence word transitions
    INFO: fsg_model.c(411): Adding silence transitions for ++lipsmack++ to FSG
    INFO: fsg_model.c(431): Added 16 silence word transitions
    INFO: fsg_model.c(411): Adding silence transitions for ++cough++ to FSG
    INFO: fsg_model.c(431): Added 16 silence word transitions
    INFO: fsg_model.c(411): Adding silence transitions for ++breath++ to FSG
    INFO: fsg_model.c(431): Added 16 silence word transitions
    INFO: fsg_search.c(360): Added 0 alternate word transitions
    INFO: fsg_lextree.c(108): Allocated 2272 bytes (2 KiB) for left and right
    context phones
    INFO: fsg_lextree.c(251): 145 HMM nodes in lextree (137 leaves)
    INFO: fsg_lextree.c(253): Allocated 16820 bytes (16 KiB) for all lextree nodes
    INFO: fsg_lextree.c(256): Allocated 15892 bytes (15 KiB) for lextree leafnodes </sil></sil></test.g00001></test.char></test.g00001></test.g00000>

     
  • Nickolay V. Shmyrev

    Hello

    Looks like it crashed. Try to run pocketsphinx_continuous with the same
    arguments on Linux, see what happens. Try to debug an application on device
    and get a stack trace. Check that all words from fsg are present in your
    dictionary and so on.

     
  • z x

    z x - 2011-04-13

    Hi,
    pocketsphinx_continuous is OK with it. But with my emulator, the recognition
    seems to be everlasting(more than 15 minutes).

     

Log in to post a comment.