Menu

Adapting acoustic model

Help
2016-04-08
2016-04-12
  • Himanshu Srivastava

    i am done with acoustic model adaptation following tutorial at http://cmusphinx.sourceforge.net/wiki/tutorialadapt
    but i am little confused what to do after that.
    i run command
    pocketsphinx_continuous -hmm /en-us/en-us -infile /fle.wav -lm /en-us.lm -dict /cmudict-en-us.dict
    but i didn't unedrstand what this command does exactly.
    Now i am running fo9llowing command

    F:\PocketSphinx\pocketsphinx\bin\Release\x64>pocketsphinx_continuous -hmm F:\PocketSphinx\pocketsphinx\model\en-us\en-us -infile F:\PocketSphinx\new_acoustic -keyphrase HELP -kws_threshold \1e-1 -time
    yes

    this command is unable to find given word in the dictionary

    INFO: pocketsphinx.c(152): Parsed model-specific feature parameters from F:\PocketSphinx\pocketsphinx\model\en-us\en-us/feat.params
    Current configuration:
    [NAME] [DEFLT] [VALUE]
    -agc none none
    -agcthresh 2.0 2.000000e+000
    -allphone
    -allphone_ci no no
    -alpha 0.97 9.700000e-001
    -ascale 20.0 2.000000e+001
    -aw 1 1
    -backtrace no no
    -beam 1e-48 1.000000e-048
    -bestpath yes yes
    -bestpathlw 9.5 9.500000e+000
    -ceplen 13 13
    -cmn current current
    -cmninit 8.0 40,3,-1
    -compallsen no no
    -debug 0
    -dict
    -dictcase no no
    -dither no no
    -doublebw no no
    -ds 1 1
    -fdict
    -feat 1s_c_d_dd 1s_c_d_dd
    -featparams
    -fillprob 1e-8 1.000000e-008
    -frate 100 100
    -fsg
    -fsgusealtpron yes yes
    -fsgusefiller yes yes
    -fwdflat yes yes
    -fwdflatbeam 1e-64 1.000000e-064
    -fwdflatefwid 4 4
    -fwdflatlw 8.5 8.500000e+000
    -fwdflatsfwin 25 25
    -fwdflatwbeam 7e-29 7.000000e-029
    -fwdtree yes yes
    -hmm F:\PocketSphinx\pocketsphinx\model\en-us\en-us
    -input_endian little little
    -jsgf
    -keyphrase HELP
    -kws
    -kws_delay 10 10
    -kws_plp 1e-1 1.000000e-001
    -kws_threshold 1 0.000000e+000
    -latsize 5000 5000
    -lda
    -ldadim 0 0
    -lifter 0 22
    -lm
    -lmctl
    -lmname
    -logbase 1.0001 1.000100e+000
    -logfn
    -logspec no no
    -lowerf 133.33334 1.300000e+002
    -lpbeam 1e-40 1.000000e-040
    -lponlybeam 7e-29 7.000000e-029
    -lw 6.5 6.500000e+000
    -maxhmmpf 30000 30000
    -maxwpf -1 -1
    -mdef
    -mean
    -mfclogdir
    -min_endfr 0 0
    -mixw
    -mixwfloor 0.0000001 1.000000e-007
    -mllr
    -mmap yes yes
    -ncep 13 13
    -nfft 512 512
    -nfilt 40 25
    -nwpen 1.0 1.000000e+000
    -pbeam 1e-48 1.000000e-048
    -pip 1.0 1.000000e+000
    -pl_beam 1e-10 1.000000e-010
    -pl_pbeam 1e-10 1.000000e-010
    -pl_pip 1.0 1.000000e+000
    -pl_weight 3.0 3.000000e+000
    -pl_window 5 5
    -rawlogdir
    -remove_dc no no
    -remove_noise yes yes
    -remove_silence yes yes
    -round_filters yes yes
    -samprate 16000 1.600000e+004
    -seed -1 -1
    -sendump
    -senlogdir
    -senmgau
    -silprob 0.005 5.000000e-003
    -smoothspec no no
    -svspec 0-12/13-25/26-38
    -tmat
    -tmatfloor 0.0001 1.000000e-004
    -topn 4 4
    -topn_beam 0 0
    -toprule
    -transform legacy dct
    -unit_area yes yes
    -upperf 6855.4976 6.800000e+003
    -uw 1.0 1.000000e+000
    -vad_postspeech 50 50
    -vad_prespeech 20 20
    -vad_startspeech 10 10
    -vad_threshold 2.0 2.000000e+000
    -var
    -varfloor 0.0001 1.000000e-004
    -varnorm no no
    -verbose no no
    -warp_params
    -warp_type inverse_linear inverse_linear
    -wbeam 7e-29 7.000000e-029
    -wip 0.65 6.500000e-001
    -wlen 0.025625 2.562500e-002

    INFO: feat.c(715): Initializing feature stream to type: '1s_c_d_dd', ceplen=13, CMN='current', VARNORM='no', AGC='none'
    INFO: cmn.c(143): mean[0]= 12.00, mean[1..12]= 0.0
    INFO: acmod.c(164): Using subvector specification 0-12/13-25/26-38
    INFO: mdef.c(518): Reading model definition: F:\PocketSphinx\pocketsphinx\model\en-us\en-us/mdef
    INFO: mdef.c(531): Found byte-order mark BMDF, assuming this is a binary mdef file
    INFO: bin_mdef.c(336): Reading binary model definition: F:\PocketSphinx\pocketsphinx\model\en-us\en-us/mdef
    INFO: bin_mdef.c(516): 42 CI-phone, 137053 CD-phone, 3 emitstate/phone, 126 CI-sen, 5126 Sen, 29324 Sen-Seq
    INFO: tmat.c(206): Reading HMM transition probability matrices: F:\PocketSphinx\pocketsphinx\model\en-us\en-us/transition_matrices
    INFO: acmod.c(117): Attempting to use PTM computation module
    INFO: ms_gauden.c(198): Reading mixture gaussian parameter: F:\PocketSphinx\pocketsphinx\model\en-us\en-us/means
    INFO: ms_gauden.c(292): 42 codebook, 3 feature, size:
    INFO: ms_gauden.c(294): 128x13
    INFO: ms_gauden.c(294): 128x13
    INFO: ms_gauden.c(294): 128x13
    INFO: ms_gauden.c(198): Reading mixture gaussian parameter: F:\PocketSphinx\pocketsphinx\model\en-us\en-us/variances
    INFO: ms_gauden.c(292): 42 codebook, 3 feature, size:
    INFO: ms_gauden.c(294): 128x13
    INFO: ms_gauden.c(294): 128x13
    INFO: ms_gauden.c(294): 128x13
    INFO: ms_gauden.c(354): 222 variance values floored
    INFO: ptm_mgau.c(476): Loading senones from dump file F:\PocketSphinx\pocketsphinx\model\en-us\en-us/sendump
    INFO: ptm_mgau.c(500): BEGIN FILE FORMAT DESCRIPTION
    INFO: ptm_mgau.c(563): Rows: 128, Columns: 5126
    INFO: ptm_mgau.c(595): Using memory-mapped I/O for senones
    INFO: ptm_mgau.c(835): Maximum top-N: 4
    INFO: phone_loop_search.c(114): State beam -225 Phone exit beam -225 Insertion penalty 0
    INFO: dict.c(320): Allocating 4101 * 32 bytes (128 KiB) for word entries
    INFO: dict.c(358): Reading filler dictionary: F:\PocketSphinx\pocketsphinx\model\en-us\en-us/noisedict
    INFO: dict.c(213): Dictionary size 5, allocated 0 KiB for strings, 0 KiB for phones
    INFO: dict.c(361): 5 words read
    INFO: dict2pid.c(396): Building PID tables for dictionary
    INFO: dict2pid.c(406): Allocating 42^3 * 2 bytes (144 KiB) for word-initial triphones
    INFO: dict2pid.c(132): Allocated 42672 bytes (41 KiB) for word-final triphones
    INFO: dict2pid.c(196): Allocated 42672 bytes (41 KiB) for single-phone word triphones
    INFO: kws_search.c(420): KWS(beam: -1080, plp: -23, default threshold -524288, delay 10)
    ERROR: "kws_search.c", line 171: The word 'HELP' is missing in the dictionary
    INFO: kws_search.c(467): TOTAL kws 0.00 CPU -1.#IO xRT
    INFO: kws_search.c(470): TOTAL kws 0.00 wall -1.#IO xRT

    Please ,anybody help

     
    • Nickolay V. Shmyrev

      The word 'HELP' is missing in the dictionary

      This error says you need to add word "HELP" to the dicitonary. It might be that the dictionary already contains "help" in lowercase, so you need to use "help" in lowercase instead of "HELP" in uppercase.

       

Log in to post a comment.

Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.