Menu

Problem in getting nbest list

Help
Pankaj
2011-07-13
2012-09-22
  • Pankaj

    Pankaj - 2011-07-13

    Hi,
    I am trying to get n-best list using pocketsphinx (0.6.1) with the following
    configuration, but not getting it. What could be wrong with the command
    -hmm I:\sphinx\pocketsphinx\model\hmm\en_US\hub4wsj_sc_8k
    -dict I:\sphinx\testdata\2803.dic
    -lm I:\sphinx\testdata\2803.lm
    -hyp I:\sphinx\testdata\ngram.hyp
    -fwdflat 1
    -fwdtree 1
    -adcin yes
    -nbestdir I:\sphinx\testdata\nbestdir
    -ctl I:\sphinx\testdata\2803.ctl
    -cepext .raw
    -cepdir I:\sphinx\testdata.

    After typing the command : I am getting following output on the console:

    INFO: cmd_ln.c(512): Parsing command line:
    \
    -hmm I:\sphinx\pocketsphinx\model\hmm\en_US\hub4wsj_sc_8k \
    -dict I:\sphinx\testdata\2803.dic \
    -lm I:\sphinx\testdata\2803.lm \
    -hyp I:\sphinx\testdata\ngram.hyp \
    -fwdflat 1 \
    -fwdtree 1 \
    -adcin yes \
    -nbestdir I:\sphinx\testdata\nbestdir \
    -ctl I:\sphinx\testdata\2803.ctl \
    -cepext .raw \
    -cepdir I:\sphinx\testdata

    Current configuration:

    -adchdr 0 0
    -adcin no yes
    -agc none none
    -agcthresh 2.0 2.000000e+000
    -alpha 0.97 9.700000e-001
    -argfile
    -ascale 20.0 2.000000e+001
    -backtrace no no
    -beam 1e-48 1.000000e-048
    -bestpath yes yes
    -bestpathlw 9.5 9.500000e+000
    -bghist no no
    -build_outdirs yes yes
    -cepdir I:\sphinx\testdata
    -cepext .mfc .raw
    -ceplen 13 13
    -cmn current current
    -cmninit 8.0 8.0
    -compallsen no no
    -ctl I:\sphinx\testdata\2803.ctl
    -ctlcount -1 -1
    -ctlincr 1 1
    -ctloffset 0 0
    -ctm
    -debug 0
    -dict I:\sphinx\testdata\2803.dic
    -dictcase no no
    -dither no no
    -doublebw no no
    -ds 1 1
    -fdict
    -feat 1s_c_d_dd 1s_c_d_dd
    -featparams
    -fillprob 1e-8 1.000000e-008
    -frate 100 100
    -fsg
    -fsgctl
    -fsgdir
    -fsgext
    -fsgusealtpron yes yes
    -fsgusefiller yes yes
    -fwdflat yes yes
    -fwdflatbeam 1e-64 1.000000e-064
    -fwdflatefwid 4 4
    -fwdflatlw 8.5 8.500000e+000
    -fwdflatsfwin 25 25
    -fwdflatwbeam 7e-29 7.000000e-029
    -fwdtree yes yes
    -hmm I:\sphinx\pocketsphinx\model\hmm\en_US\hub
    4wsj_sc_8k
    -hyp I:\sphinx\testdata\ngram.hyp
    -hypseg
    -input_endian little little
    -jsgf
    -kdmaxbbi -1 -1
    -kdmaxdepth 0 0
    -kdtree
    -latsize 5000 5000
    -lda
    -ldadim 0 0
    -lextreedump 0 0
    -lifter 0 0
    -lm I:\sphinx\testdata\2803.lm
    -lmctl
    -lmname default default
    -lmnamectl
    -logbase 1.0001 1.000100e+000
    -logfn
    -logspec no no
    -lowerf 133.33334 1.333333e+002
    -lpbeam 1e-40 1.000000e-040
    -lponlybeam 7e-29 7.000000e-029
    -lw 6.5 6.500000e+000
    -maxhmmpf -1 -1
    -maxnewoov 20 20
    -maxwpf -1 -1
    -mdef
    -mean
    -mfclogdir
    -mixw
    -mixwfloor 0.0000001 1.000000e-007
    -mllr
    -mllrctl
    -mllrdir
    -mllrext
    -mmap yes yes
    -nbest 0 0
    -nbestdir I:\sphinx\testdata\nbestdir
    -nbestext .hyp .hyp
    -ncep 13 13
    -nfft 512 512
    -nfilt 40 40
    -nwpen 1.0 1.000000e+000
    -outlatdir
    -pbeam 1e-48 1.000000e-048
    -pip 1.0 1.000000e+000
    -pl_beam 1e-10 1.000000e-010
    -pl_pbeam 1e-5 1.000000e-005
    -pl_window 0 0
    -rawlogdir
    -remove_dc no no
    -round_filters yes yes
    -samprate 16000 1.600000e+004
    -seed -1 -1
    -sendump
    -senmgau
    -silprob 0.005 5.000000e-003
    -smoothspec no no
    -svspec
    -tmat
    -tmatfloor 0.0001 1.000000e-004
    -topn 4 4
    -topn_beam 0 0
    -toprule
    -transform legacy legacy
    -unit_area yes yes
    -upperf 6855.4976 6.855498e+003
    -usewdphones no no
    -uw 1.0 1.000000e+000
    -var
    -varfloor 0.0001 1.000000e-004
    -varnorm no no
    -verbose no no
    -warp_params
    -warp_type inverse_linear inverse_linear
    -wbeam 7e-29 7.000000e-029
    -wip 0.65 6.500000e-001
    -wlen 0.025625 2.562500e-002

    INFO: cmd_ln.c(512): Parsing command line:
    \
    -nfilt 20 \
    -lowerf 1 \
    -upperf 4000 \
    -wlen 0.025 \
    -transform dct \
    -round_filters no \
    -remove_dc yes \
    -svspec 0-12/13-25/26-38 \
    -feat 1s_c_d_dd \
    -agc none \
    -cmn current \
    -cmninit 56,-3,1 \
    -varnorm no

    Current configuration:

    -agc none none
    -agcthresh 2.0 2.000000e+000
    -alpha 0.97 9.700000e-001
    -ceplen 13 13
    -cmn current current
    -cmninit 8.0 56,-3,1
    -dither no no
    -doublebw no no
    -feat 1s_c_d_dd 1s_c_d_dd
    -frate 100 100
    -input_endian little little
    -lda
    -ldadim 0 0
    -lifter 0 0
    -logspec no no
    -lowerf 133.33334 1.000000e+000
    -ncep 13 13
    -nfft 512 512
    -nfilt 40 20
    -remove_dc no yes
    -round_filters yes no
    -samprate 16000 1.600000e+004
    -seed -1 -1
    -smoothspec no no
    -svspec 0-12/13-25/26-38
    -transform legacy dct
    -unit_area yes yes
    -upperf 6855.4976 4.000000e+003
    -varnorm no no
    -verbose no no
    -warp_params
    -warp_type inverse_linear inverse_linear
    -wlen 0.025625 2.500000e-002

    INFO: acmod.c(238): Parsed model-specific feature parameters from
    I:\sphinx\po
    cketsphinx\model\hmm\en_US\hub4wsj_sc_8k/feat.params
    INFO: feat.c(848): Initializing feature stream to type: '1s_c_d_dd',
    ceplen=13,
    CMN='current', VARNORM='no', AGC='none'
    INFO: cmn.c(142): mean= 12.00, mean= 0.0
    INFO: acmod.c(163): Using subvector specification 0-12/13-25/26-38
    INFO: mdef.c(520): Reading model definition:
    I:\sphinx\pocketsphinx\model\hm
    m\en_US\hub4wsj_sc_8k/mdef
    INFO: mdef.c(531): Found byte-order mark BMDF, assuming this is a binary mdef
    fi
    le
    INFO: bin_mdef.c(330): Reading binary model definition:
    I:\sphinx\pocketsphinx
    \model\hmm\en_US\hub4wsj_sc_8k/mdef
    INFO: bin_mdef.c(508): 50 CI-phone, 143047 CD-phone, 3 emitstate/phone, 150
    CI-s
    en, 5150 Sen, 27135 Sen-Seq
    INFO: tmat.c(205): Reading HMM transition probability matrices:
    I:\sphinx\pock
    etsphinx\model\hmm\en_US\hub4wsj_sc_8k/transition_matrices
    INFO: acmod.c(117): Attempting to use SCHMM computation module
    INFO: ms_gauden.c(198): Reading mixture gaussian parameter:
    I:\sphinx\pocketsp
    hinx\model\hmm\en_US\hub4wsj_sc_8k/means
    INFO: ms_gauden.c(292): 1 codebook, 3 feature, size
    256x13 256x13 256x13
    INFO: ms_gauden.c(198): Reading mixture gaussian parameter:
    I:\sphinx\pocketsp
    hinx\model\hmm\en_US\hub4wsj_sc_8k/variances
    INFO: ms_gauden.c(292): 1 codebook, 3 feature, size
    256x13 256x13 256x13
    INFO: ms_gauden.c(356): 0 variance values floored
    INFO: s2_semi_mgau.c(897): Loading senones from dump file
    I:\sphinx\pocketsphi
    nx\model\hmm\en_US\hub4wsj_sc_8k/sendump
    INFO: s2_semi_mgau.c(921): BEGIN FILE FORMAT DESCRIPTION
    INFO: s2_semi_mgau.c(1016): Using memory-mapped I/O for senones
    INFO: s2_semi_mgau.c(1293): Maximum top-N: 4 Top-N beams: 0 0 0
    INFO: dict.c(294): Allocating 4122 * 20 bytes (80 KiB) for word entries
    INFO: dict.c(306): Reading main dictionary: I:\sphinx\testdata\2803.dic
    INFO: dict.c(206): Allocated 0 KiB for strings, 0 KiB for phones
    INFO: dict.c(309): 15 words read
    INFO: dict.c(314): Reading filler dictionary:
    I:\sphinx\pocketsphinx\model\h
    mm\en_US\hub4wsj_sc_8k/noisedict
    INFO: dict.c(206): Allocated 0 KiB for strings, 0 KiB for phones
    INFO: dict.c(317): 11 words read
    INFO: dict2pid.c(396): Building PID tables for dictionary
    INFO: dict2pid.c(405): Allocating 50^3 * 2 bytes (244 KiB) for word-initial
    trip
    hones
    INFO: dict2pid.c(131): Allocated 30200 bytes (29 KiB) for word-final triphones
    INFO: dict2pid.c(195): Allocated 30200 bytes (29 KiB) for single-phone word
    trip
    hones
    INFO: ngram_model_arpa.c(476): ngrams 1=13, 2=18, 3=13
    INFO: ngram_model_arpa.c(135): Reading unigrams
    INFO: ngram_model_arpa.c(515): 13 = #unigrams created
    INFO: ngram_model_arpa.c(194): Reading bigrams
    INFO: ngram_model_arpa.c(531): 18 = #bigrams created
    INFO: ngram_model_arpa.c(532): 5 = #prob2 entries
    INFO: ngram_model_arpa.c(539): 3 = #bo_wt2 entries
    INFO: ngram_model_arpa.c(291): Reading trigrams
    INFO: ngram_model_arpa.c(552): 13 = #trigrams created
    INFO: ngram_model_arpa.c(553): 3 = #prob3 entries
    INFO: ngram_search_fwdtree.c(99): 13 unique initial diphones
    INFO: ngram_search_fwdtree.c(147): 0 root, 0 non-root channels, 12 single-
    phone
    words
    INFO: ngram_search_fwdtree.c(186): Creating search tree
    INFO: ngram_search_fwdtree.c(191): before: 0 root, 0 non-root channels, 12
    singl
    e-phone words
    INFO: ngram_search_fwdtree.c(324): after: max nonroot chan increased to 160
    INFO: ngram_search_fwdtree.c(333): after: 13 root, 32 non-root channels, 11
    sing
    le-phone words
    INFO: ngram_search_fwdflat.c(153): fwdflat: min_ef_width = 4, max_sf_win = 25
    INFO: cmn.c(175): CMN: 32.27 0.79 1.00 -0.22 0.27 0.28 0.36 0.55 0.13 0.
    43 0.74 0.62 0.59
    INFO: ngram_search.c(407): Resized backpointer table to 10000 entries
    INFO: ngram_search.c(407): Resized backpointer table to 20000 entries
    INFO: ngram_search_fwdtree.c(1513): 11865 words recognized (11/fr)
    INFO: ngram_search_fwdtree.c(1515): 189790 senones evaluated (172/fr)
    INFO: ngram_search_fwdtree.c(1517): 83886 channels searched (75/fr), 14194 1s
    t, 43984 last
    INFO: ngram_search_fwdtree.c(1521): 13251 words for which last channels evalu
    ated (12/fr)
    INFO: ngram_search_fwdtree.c(1524): 3957 candidate words for entering last p
    hone (3/fr)
    INFO: ngram_search_fwdflat.c(295): Utterance vocabulary contains 11 words
    INFO: ngram_search_fwdflat.c(912): 2294 words recognized (2/fr)
    INFO: ngram_search_fwdflat.c(914): 106556 senones evaluated (97/fr)
    INFO: ngram_search_fwdflat.c(916): 61801 channels searched (55/fr)
    INFO: ngram_search_fwdflat.c(918): 4007 words searched (3/fr)
    INFO: ngram_search_fwdflat.c(920): 1741 word transitions (1/fr)
    INFO: ngram_search.c(1137): lattice start node .0 end node .887
    INFO: ps_lattice.c(1228): Normalizer P(O) = alpha(:887:1102) = -6109972
    INFO: ps_lattice.c(1266): Joint P(O,S) = -6149875 P(S|O) = -39903
    INFO: batch.c(661): ngram: 11.03 seconds speech, 0.31 seconds CPU, 0.34
    seconds
    wall
    INFO: batch.c(663): ngram: 0.03 xRT (CPU), 0.03 xRT (elapsed)
    INFO: batch.c(675): TOTAL 11.03 seconds speech, 0.31 seconds CPU, 0.34 seconds
    w
    all
    INFO: batch.c(677): AVERAGE 0.03 xRT (CPU), 0.03 xRT (elapsed)
    Press any key to continue . . .

     
  • Nickolay V. Shmyrev

    Hello

    Nbestdir is implemented only in 0.7

     

Log in to post a comment.