Menu

sphinx3

2009-07-07
2012-09-22
  • Padmavathi Jageti

    Hi

    when I decoding my database at command line i got the following error.

    C:\hanusri\sphinx3\bin\debug>sphinx3_decode -hmm \telugu\model_parameters\telugu
    .cd_cont_1000\ -dict \telugu\etc\telugu.dic -fdict \telugu\etc\telugu.filler -ct
    l \telugu\etc\telugu_test.fileids -cepext mfc -senmgau .cont. -cepdir \telugu\fe
    at -agc none -lm \telugu\etc\telugu.ug.lm.DMP -cmn current -feat 1s_s_d_dd

    -ppathdebug no no
    -ptranskip 0 0
    -remove_dc no no
    -round_filters yes yes
    -samprate 16000 1.600000e+004
    -seed -1 -1
    -senmgau .cont. .cont.
    -silprob 0.1 1.000000e-001
    -smoothspec no no
    -spec2cep no no
    -subvq
    -subvqbeam 3.0e-3 3.000000e-003
    -svq4svq no no
    -svspec
    -tighten_factor 0.5 5.000000e-001
    -tmat
    -tmatfloor 0.0001 1.000000e-004
    -topn 4 4
    -tracewhmm
    -transform legacy legacy
    -treeugprob yes yes
    -unit_area yes yes
    -upperf 6855.4976 6.855498e+003
    -utt
    -uw 0.7 7.000000e-001
    -var
    -varfloor 0.0001 1.000000e-004
    -varnorm no no
    -verbose no no
    -vqeval 3 3
    -warp_params
    -warp_type inverse_linear inverse_linear
    -wbeam 1.0e-35 1.000000e-035
    -wend_beam 1.0e-80 1.000000e-080
    -wip 0.7 7.000000e-001
    -wlen 0.025625 2.562500e-002
    -worddumpef 200000000 200000000
    -worddumpsf 200000000 200000000

    INFO: kbcore.c(433): Begin Initialization of Core Models:
    INFO: cmd_ln.c(506): Parsing command line:
    \
    -alpha 0.97 \
    -dither yes \
    -doublebw no \
    -nfilt 40 \
    -ncep 13 \
    -lowerf 133.33334 \
    -upperf 6855.4976 \
    -nfft 512 \
    -wlen 0.0256 \
    -transform legacy \
    -feat 1s_c_d_dd \
    -agc none \
    -cmn current \
    -varnorm no

    Current configuration:
    [NAME] [DEFLT] [VALUE]
    -agc none none
    -agcthresh 2.0 2.000000e+000
    -alpha 0.97 9.700000e-001
    -cep2spec no no
    -ceplen 13 13
    -cmn current current
    -cmninit 8.0 8.0
    -dither no yes
    -doublebw no no
    -feat 1s_c_d_dd 1s_c_d_dd
    -frate 100 100
    -input_endian little little
    -lda
    -ldadim 0 0
    -lifter 0 0
    -logspec no no
    -lowerf 133.33334 1.333333e+002
    -ncep 13 13
    -nfft 512 512
    -nfilt 40 40
    -remove_dc no no
    -round_filters yes yes
    -samprate 16000 1.600000e+004
    -seed -1 -1
    -smoothspec no no
    -spec2cep no no
    -svspec
    -transform legacy legacy
    -unit_area yes yes
    -upperf 6855.4976 6.855498e+003
    -varnorm no no
    -verbose no no
    -warp_params
    -warp_type inverse_linear inverse_linear
    -wlen 0.025625 2.560000e-002

    INFO: Initialization of the log add table
    INFO: Log-Add table size = 29350 x 2 >> 0
    INFO:
    INFO: feat.c(848): Initializing feature stream to type: '1s_c_d_dd', ceplen=13,
    CMN='current', VARNORM='no', AGC='none'
    INFO: cmn.c(142): mean[0]= 12.00, mean[1..12]= 0.0
    INFO: kbcore.c(480): .cont.
    INFO: Initialization of feat_t, report:
    INFO: Feature type = 1s_c_d_dd
    INFO: Cepstral size = 13
    INFO: Number of streams = 1
    INFO: Vector size of stream[0]: 39
    INFO: Number of subvectors = 0
    INFO: Whether CMN is used = 1
    INFO: Whether AGC is used = 0
    INFO: Whether variance is normalized = 0
    INFO:
    INFO: Reading HMM in Sphinx 3 Model format
    INFO: Model Definition File: \telugu\model_parameters\telugu.cd_cont_1000\/mde
    f
    INFO: Mean File: \telugu\model_parameters\telugu.cd_cont_1000\/means
    INFO: Variance File: \telugu\model_parameters\telugu.cd_cont_1000\/variances
    INFO: Mixture Weight File: \telugu\model_parameters\telugu.cd_cont_1000\/mixtu
    re_weights
    INFO: Transition Matrices File: \telugu\model_parameters\telugu.cd_cont_1000\/
    transition_matrices
    INFO: mdef.c(682): Reading model definition: \telugu\model_parameters\telugu.cd_
    cont_1000\/mdef
    INFO: Initialization of mdef_t, report:
    INFO: 23 CI-phone, 180 CD-phone, 3 emitstate/phone, 69 CI-sen, 219 Sen, 78 Sen
    -Seq
    INFO:
    INFO: kbcore.c(288): Using optimized GMM computation for Continuous HMM, -topn w
    ill be ignored
    INFO: cont_mgau.c(163): Reading mixture gaussian file '\telugu\model_parameters\
    telugu.cd_cont_1000\/means'
    INFO: cont_mgau.c(422): 219 mixture Gaussians, 8 components, 1 streams, veclen 3
    9
    INFO: cont_mgau.c(163): Reading mixture gaussian file '\telugu\model_parameters\
    telugu.cd_cont_1000\/variances'
    INFO: cont_mgau.c(422): 219 mixture Gaussians, 8 components, 1 streams, veclen 3
    9
    INFO: cont_mgau.c(510): Reading mixture weights file '\telugu\model_parameters\t
    elugu.cd_cont_1000\/mixture_weights'
    INFO: cont_mgau.c(665): Read 219 x 8 mixture weights
    INFO: cont_mgau.c(693): Removing uninitialized Gaussian densities
    52 75 78 81 116 143 155 168 169 172 176 184 190 195 201
    WARNING: "cont_mgau.c", line 767: 184 densities removed (15 mixtures removed ent
    irely)
    INFO: cont_mgau.c(783): Applying variance floor
    INFO: cont_mgau.c(801): 7975 variance values floored
    INFO: cont_mgau.c(849): Precomputing Mahalanobis distance invariants
    INFO: tmat.c(169): Reading HMM transition probability matrices: \telugu\model_pa
    rameters\telugu.cd_cont_1000\/transition_matrices
    INFO: Initialization of tmat_t, report:
    INFO: Read 23 transition matrices of size 3x4
    INFO:
    INFO: dict.c(475): Reading main dictionary: \telugu\etc\telugu.dic
    INFO: dict.c(478): 14 words read
    INFO: dict.c(483): Reading filler dictionary: \telugu\etc\telugu.filler
    INFO: dict.c(486): 3 words read
    INFO: Initialization of dict_t, report:
    INFO: No of CI phone: 0
    INFO: Max word: 4113
    INFO: No of word: 17
    INFO:
    INFO: lm.c(606): LM read('\telugu\etc\telugu.ug.lm.DMP', lw= 9.50, wip= 0.70, uw
    = 0.70)
    INFO: lm.c(608): Reading LM file \telugu\etc\telugu.ug.lm.DMP (LM name "default"
    )
    INFO: lm_3g_dmp.c(630): Reading LM in 16 bits format
    INFO: lm_3g_dmp.c(686): Read 16 unigrams [in memory]
    INFO: lm_3g_dmp.c(759): 30 bigrams [on disk]
    INFO: lm_3g_dmp.c(832): 42 bigrams [on disk]
    INFO: lm_3g_dmp.c(902): 4 bigram prob entries
    INFO: lm_3g_dmp.c(936): 4 trigram bowt entries
    INFO: lm_3g_dmp.c(967): 3 trigram prob entries
    INFO: lm_3g_dmp.c(998): 4 trigram segtable entries (8 segsize)
    INFO: lm_3g_dmp.c(1053): 16 word strings
    INFO: lm.c(691): The LM routine is operating at 16 bits mode
    ERROR: "wid.c", line 282: AYIDHU is not a word in dictionary and it is not a cla
    ss tag.
    ERROR: "wid.c", line 282: ENMIDI is not a word in dictionary and it is not a cla
    ss tag.
    ERROR: "wid.c", line 282: VOKATI is not a word in dictionary and it is not a cla
    ss tag.
    ERROR: "wid.c", line 282: YEDU is not a word in dictionary and it is not a class
    tag.
    INFO: wid.c(292): 4 LM words not in dictionary; ignored
    INFO: Initialization of fillpen_t, report:
    INFO: Language weight =9.500000
    INFO: Word Insertion Penalty =0.700000
    INFO: Silence probability =0.100000
    INFO: Filler probability =0.100000
    INFO:
    INFO: dict2pid.c(599): Building PID tables for dictionary
    INFO: Initialization of dict2pid_t, report:
    INFO: Dict2pid is in composite triphone mode
    INFO: 63 composite states; 21 composite sseq
    INFO:
    INFO: kbcore.c(632): Inside kbcore: Verifying models consistency ......
    INFO: kbcore.c(654): End of Initialization of Core Models:
    INFO: Initialization of beam_t, report:
    INFO: Parameters used in Beam Pruning of Viterbi Search:
    INFO: Beam=-422133
    INFO: PBeam=-383758
    INFO: WBeam=-268630 (Skip=0)
    INFO: WEndBeam=-614012
    INFO: No of CI Phone assumed=23
    INFO:
    INFO: Initialization of fast_gmm_t, report:
    INFO: Parameters used in Fast GMM computation:
    INFO: Frame-level: Down Sampling Ratio 1, Conditional Down Sampling? 0, Dis
    tance-based Down Sampling? 0
    INFO: GMM-level: CI phone beam -614012. MAX CD 100000
    INFO: Gaussian-level: GS map would be used for Gaussian Selection? =1, SVQ wou
    ld be used as Gaussian Score? =0 SubVQ Beam -19363
    INFO:
    INFO: Initialization of pl_t, report:
    INFO: Parameters used in phoneme lookahead:
    INFO: Phoneme look-ahead type = 0
    INFO: Phoneme look-ahead beam size = 65945
    INFO: No of CI Phones assumed=23
    INFO:
    INFO: Initialization of ascr_t, report:
    INFO: No. of CI senone =69
    INFO: No. of senone = 219
    INFO: No. of composite senone = 63
    INFO: No. of senone sequence = 78
    INFO: No. of composite senone sequence=21
    INFO: Parameters used in phoneme lookahead:
    INFO: Phoneme lookahead window = 1
    INFO:
    INFO: kb.c(306): SEARCH MODE INDEX 4
    INFO: srch.c(373): Search Initialization.
    WARNING: "srch_time_switch_tree.c", line 283: -Nstalextree is omitted in TST sea
    rch.
    INFO: lextree.c(222): Creating Unigram Table for lm (name: default)
    INFO: lextree.c(235): Size of word table after unigram + words in class: 10.
    INFO: lextree.c(244): Size of word table after adding alternative prons: 14.
    INFO: lextree_t, report:
    INFO: Parameters of the lexical tree.
    INFO: Type of the tree 0 (0:unigram, 1: 2g, 2: 3g etc.)
    INFO: Number of left contexts 4
    INFO: Number of node 68
    INFO: Number of links in the tree 80
    INFO: The previous word for this tree
    INFO: The size of a node of the lexical tree 96
    INFO: The size of a gnode_t 16
    INFO:
    INFO: srch_time_switch_tree.c(343): Lextrees (0) for lm 0, its name is default,
    it has 68 nodes(ug)
    INFO: lextree.c(222): Creating Unigram Table for lm (name: default)
    INFO: lextree.c(235): Size of word table after unigram + words in class: 10.
    INFO: lextree.c(244): Size of word table after adding alternative prons: 14.
    INFO: lextree_t, report:
    INFO: Parameters of the lexical tree.
    INFO: Type of the tree 0 (0:unigram, 1: 2g, 2: 3g etc.)
    INFO: Number of left contexts 4
    INFO: Number of node 68
    INFO: Number of links in the tree 80
    INFO: The previous word for this tree
    INFO: The size of a node of the lexical tree 96
    INFO: The size of a gnode_t 16
    INFO:
    INFO: srch_time_switch_tree.c(343): Lextrees (1) for lm 0, its name is default,
    it has 68 nodes(ug)
    INFO: lextree.c(222): Creating Unigram Table for lm (name: default)
    INFO: lextree.c(235): Size of word table after unigram + words in class: 10.
    INFO: lextree.c(244): Size of word table after adding alternative prons: 14.
    INFO: lextree_t, report:
    INFO: Parameters of the lexical tree.
    INFO: Type of the tree 0 (0:unigram, 1: 2g, 2: 3g etc.)
    INFO: Number of left contexts 4
    INFO: Number of node 68
    INFO: Number of links in the tree 80
    INFO: The previous word for this tree
    INFO: The size of a node of the lexical tree 96
    INFO: The size of a gnode_t 16
    INFO:
    INFO: srch_time_switch_tree.c(343): Lextrees (2) for lm 0, its name is default,
    it has 68 nodes(ug)
    INFO: srch_time_switch_tree.c(350): Time for building trees, 0.0000 CPU 0.0000 C
    lk
    INFO: srch_time_switch_tree.c(372): Lextrees(0), 1 nodes(filler)
    INFO: srch_time_switch_tree.c(372): Lextrees(1), 1 nodes(filler)
    INFO: srch_time_switch_tree.c(372): Lextrees(2), 1 nodes(filler)
    INFO: vithist.c(168): Initializing Viterbi-history module
    INFO: Initialization of srch_t, report:
    INFO: Operation Mode = 4, Operation Name = fwdtree
    INFO:

    INFO: utt.c(195): Processing: usr1_0_1
    INFO: feat.c(1148): At directory \telugu\feat
    INFO: feat.c(378): Reading mfc file: '\telugu\feat/project_test_clstk\usr1_0_1mf
    c'[0..-1]
    SYSTEM_ERROR: "pio.c", line 427: stat(\telugu\feat/project_test_clstk\usr1_0_1mf
    c) failed; retrying...
    ; No such file or directory
    ERROR: "feat.c", line 387: stat_retry/fopen(\telugu\feat/project_test_clstk\usr1
    _0_1mfc) failed
    FATAL_ERROR: "utt.c", line 237: Cannot read file project_test_clstk\usr1_0_1. Fo
    rced exit

     
    • eliasmajic

      eliasmajic - 2009-07-07

      I am not sure but , this:

      ERROR: "wid.c", line 282: AYIDHU is not a word in dictionary and it is not a cla
      ss tag.

      Put it in the dictionary or remove it from the transcription.

       
    • Nickolay V. Shmyrev

      And also this line

      ERROR: "feat.c", line 387: stat_retry/fopen(\telugu\feat/project_test_clstk\usr1
      _0_1mfc) failed

      is self explanatory, I think you'll manage to understand it. The issue is that you don't have dot before mfc in -cepext argument. Use it like -cepext .mfc (note the dot here).

       
    • Padmavathi Jageti

      hi

      Thanks for given reply nshm and eliasmajic

      I placed .mfc instead of mfc,its works fine

      when I decode adapt model using sphinx3 I got the following error

           C:\srisai\sphinx3>sphinx3_decode -mean means -var variances -mixw mixture_weight
      

      s -mdef mdef -tmat transition_matrices -dict adapt5.dic -fdict adapt5.filler -ct
      l adapt5_test.fileids -cepext .mfc -senmgau .cont. -agc none -lm adapt5.ug.lm.DM
      P -cmn current -mllr adapt5_matrix -feat 1s_s_d_dd
      INFO: info.c(70): sphinx3_decode Compiled on: Jun 24 2009, AT: 02:26:52

      INFO: cmd_ln.c(506): Parsing command line:
      sphinx3_decode \
      -mean means \
      -var variances \
      -mixw mixture_weights \
      -mdef mdef \
      -tmat transition_matrices \
      -dict adapt5.dic \
      -fdict adapt5.filler \
      -ctl adapt5_test.fileids \
      -cepext .mfc \
      -senmgau .cont. \
      -agc none \
      -lm adapt5.ug.lm.DMP \
      -cmn current \
      -mllr adapt5_matrix \
      -feat 1s_s_d_dd

      Current configuration:
      [NAME] [DEFLT] [VALUE]
      -adchdr 0 0
      -adcin no no
      -agc none none
      -agcthresh 2.0 2.000000e+000
      -alpha 0.97 9.700000e-001
      -backtrace yes yes
      -beam 1.0e-55 1.000000e-055
      -bestpath no no
      -bestpathlw 0.000000e+000
      -bestscoredir
      -bestsenscrdir
      -bghist no no
      -bptbldir
      -bptblsize 32768 32768
      -cb2mllr .1cls. .1cls.
      -cep2spec no no
      -cepdir
      -cepext .mfc .mfc
      -ceplen 13 13
      -ci_pbeam 1e-80 1.000000e-080
      -cmn current current
      -cmninit 8.0 8.0
      -cond_ds no no
      -ctl adapt5_test.fileids
      -ctlcount 1000000000 1000000000
      -ctloffset 0 0
      -ctl_lm
      -ctl_mllr
      -dagfudge 2 2
      -dict adapt5.dic
      -dist_ds no no
      -dither no no
      -doublebw no no
      -ds 1 1
      -epl 3 3
      -fdict adapt5.filler
      -feat 1s_c_d_dd 1s_s_d_dd
      -featparams
      -fillpen
      -fillprob 0.1 1.000000e-001
      -frate 100 100
      -fsg
      -fsgusealtpron yes yes
      -fsgusefiller yes yes
      -gs
      -gs4gs yes yes
      -hmm
      -hmmdump no no
      -hmmdumpef 200000000 200000000
      -hmmdumpsf 200000000 200000000
      -hmmhistbinsize 5000 5000
      -hyp
      -hypseg
      -hypsegscore_unscale yes yes
      -inlatdir
      -inlatwin 50 50
      -input_endian little little
      -kdmaxbbi -1 -1
      -kdmaxdepth 0 0
      -kdtree
      -latcompress yes yes
      -latext lat.gz lat.gz
      -lda
      -ldadim 0 0
      -lextreedump 0 0
      -lifter 0 0
      -lm adapt5.ug.lm.DMP
      -lmctlfn
      -lmdumpdir
      -lmname
      -log3table yes yes
      -logbase 1.0003 1.000300e+000
      -logfn
      -logspec no no
      -lowerf 133.33334 1.333333e+002
      -lts_mismatch no no
      -lw 9.5 9.500000e+000
      -maxcdsenpf 100000 100000
      -maxedge 2000000 2000000
      -maxhistpf 100 100
      -maxhmmpf 20000 20000
      -maxlmop 100000000 100000000
      -maxlpf 40000 40000
      -maxppath 1000000 1000000
      -maxwpf 20 20
      -mdef mdef
      -mean means
      -min_endfr 3 3
      -mixw mixture_weights
      -mixwfloor 0.0000001 1.000000e-007
      -mllr adapt5_matrix
      -mode fwdtree fwdtree
      -nbest 200 200
      -nbestdir
      -nbestext nbest.gz nbest.gz
      -ncep 13 13
      -nfft 512 512
      -nfilt 40 40
      -Nlextree 3 3
      -Nstalextree 25 25
      -op_mode -1 -1
      -outlatdir
      -outlatfmt s3 s3
      -pbeam 1.0e-50 1.000000e-050
      -pheurtype 0 0
      -phonepen 1.0 1.000000e+000
      -phsegdir
      -pl_beam 1.0e-80 1.000000e-080
      -pl_window 1 1
      -ppathdebug no no
      -ptranskip 0 0
      -remove_dc no no
      -round_filters yes yes
      -samprate 16000 1.600000e+004
      -seed -1 -1
      -senmgau .cont. .cont.
      -silprob 0.1 1.000000e-001
      -smoothspec no no
      -spec2cep no no
      -subvq
      -subvqbeam 3.0e-3 3.000000e-003
      -svq4svq no no
      -svspec
      -tighten_factor 0.5 5.000000e-001
      -tmat transition_matrices
      -tmatfloor 0.0001 1.000000e-004
      -topn 4 4
      -tracewhmm
      -transform legacy legacy
      -treeugprob yes yes
      -unit_area yes yes
      -upperf 6855.4976 6.855498e+003
      -utt
      -uw 0.7 7.000000e-001
      -var variances
      -varfloor 0.0001 1.000000e-004
      -varnorm no no
      -verbose no no
      -vqeval 3 3
      -warp_params
      -warp_type inverse_linear inverse_linear
      -wbeam 1.0e-35 1.000000e-035
      -wend_beam 1.0e-80 1.000000e-080
      -wip 0.7 7.000000e-001
      -wlen 0.025625 2.562500e-002
      -worddumpef 200000000 200000000
      -worddumpsf 200000000 200000000

      INFO: kbcore.c(433): Begin Initialization of Core Models:
      INFO: Initialization of the log add table
      INFO: Log-Add table size = 29350 x 2 >> 0
      INFO:
      INFO: feat.c(848): Initializing feature stream to type: '1s_s_d_dd', ceplen=13,
      CMN='current', VARNORM='no', AGC='none'
      INFO: cmn.c(142): mean[0]= 12.00, mean[1..0]= 0.0
      INFO: kbcore.c(480): .cont.
      INFO: Initialization of feat_t, report:
      INFO: Feature type = 1s_s_d_dd
      INFO: Cepstral size = 1
      INFO: Number of streams = 1
      INFO: Vector size of stream[0]: 1
      INFO: Number of subvectors = 0
      INFO: Whether CMN is used = 1
      INFO: Whether AGC is used = 0
      INFO: Whether variance is normalized = 0
      INFO:
      INFO: Reading HMM in Sphinx 3 Model format
      INFO: Model Definition File: mdef
      INFO: Mean File: means
      INFO: Variance File: variances
      INFO: Mixture Weight File: mixture_weights
      INFO: Transition Matrices File: transition_matrices
      INFO: mdef.c(682): Reading model definition: mdef
      SYSTEM_ERROR: "mdef.c", line 687: fopen(mdef,r) failed
      ; No such file or directory

      C:\srisai\sphinx3>

       

Log in to post a comment.