Menu

Can't load voxforge model with pocketsphinx

Help
Anonymous
2011-10-29
2012-09-22
  • Anonymous

    Anonymous - 2011-10-29

    Hello,

    I am using pocketsphinx with the voxforge acoustic model (available on
    voxforge.org), and it was working fine with the 0.7 release. When I try to
    load the model with the latest revision of sphinxbase/pocketsphinx on svn, I
    get an error and pocketsphinx wont initialize. Here is what I get:

    INFO: acmod.c(246): Parsed model-specific feature parameters from
    /home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//feat.params
    INFO: fe_interface.c(289): You are using the internal mechanism to generate
    the seed.
    INFO: feat.c(684): Initializing feature stream to type: '1s_c_d_dd',
    ceplen=13, CMN='current', VARNORM='no', AGC='none'
    INFO: cmn.c(142): mean= 12.00, mean= 0.0
    INFO: acmod.c(157): Reading linear feature transformation from
    /home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//feature_transform
    INFO: mdef.c(520): Reading model definition:
    /home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//mdef
    INFO: bin_mdef.c(179): Allocating 104810 * 8 bytes (818 KiB) for CD tree
    INFO: tmat.c(205): Reading HMM transition probability matrices: /home/sylvain/
    resources/am/voxforge_en_sphinx.cd_cont_5000//transition_matrices
    INFO: acmod.c(121): Attempting to use SCHMM computation module
    INFO: ms_gauden.c(198): Reading mixture gaussian parameter:
    /home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//means
    INFO: ms_gauden.c(292): 5120 codebook, 1 feature, size:
    INFO: ms_gauden.c(294): 16x29
    INFO: ms_gauden.c(198): Reading mixture gaussian parameter:
    /home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//variances
    INFO: ms_gauden.c(292): 5120 codebook, 1 feature, size:
    INFO: ms_gauden.c(294): 16x29
    INFO: ms_gauden.c(354): 175 variance values floored
    INFO: acmod.c(123): Attempting to use PTHMM computation module
    INFO: ms_gauden.c(198): Reading mixture gaussian parameter:
    /home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//means
    INFO: ms_gauden.c(292): 5120 codebook, 1 feature, size:
    INFO: ms_gauden.c(294): 16x29
    INFO: ms_gauden.c(198): Reading mixture gaussian parameter:
    /home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//variances
    INFO: ms_gauden.c(292): 5120 codebook, 1 feature, size:
    INFO: ms_gauden.c(294): 16x29
    INFO: ms_gauden.c(354): 175 variance values floored
    INFO: ptm_mgau.c(800): Number of codebooks exceeds 256: 5120
    INFO: acmod.c(125): Falling back to general multi-stream GMM computation
    INFO: ms_gauden.c(198): Reading mixture gaussian parameter:
    /home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//means
    INFO: ms_gauden.c(292): 5120 codebook, 1 feature, size:
    INFO: ms_gauden.c(294): 16x29
    INFO: ms_gauden.c(198): Reading mixture gaussian parameter:
    /home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//variances
    INFO: ms_gauden.c(292): 5120 codebook, 1 feature, size:
    INFO: ms_gauden.c(294): 16x29
    INFO: ms_gauden.c(354): 175 variance values floored
    ERROR: "ms_mgau.c", line 112: Dimension of stream 29 does not match: 39 !=
    -1261814208

    Is there a problem with the voxforge model? Is there a way to fix it?

    Many thanks in advance,

    Sylvain

     
  • Nickolay V. Shmyrev

    Please provide the full log including the command line parameters

    Please provide the information about your operating system, installation
    details, hardware

     
  • Anonymous

    Anonymous - 2011-10-30

    Thanks for the quick reply.

    I am using Linux, Windows, and Mac OSX and see the same thing on all 3 OSes
    (it was working well on all 3 OSes with 0.7 release, using the same acoustic
    model). What I do not understand is that the model is a continuous model,
    features are 1s_c_d_dd, and it says "Falling back to general multi-stream GMM
    computation", but I thought 1s_c_d_dd means that there is only one stream.
    Here is the complete output, including the command line parameters:

    INFO: cmd_ln.c(691): Parsing command line:
    \
    -dict /home/sylvain/resources/loop.dic \
    -fsg /home/sylvain/resources/loop.fsg \
    -hmm /home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000/ \
    -bestpath no

    Current configuration:

    -adcdev
    -agc none none
    -agcthresh 2.0 2.000000e+00
    -alpha 0.97 9.700000e-01
    -argfile
    -ascale 20.0 2.000000e+01
    -aw 1 1
    -backtrace no no
    -beam 1e-48 1.000000e-48
    -bestpath yes no
    -bestpathlw 9.5 9.500000e+00
    -bghist no no
    -ceplen 13 13
    -cmn current current
    -cmninit 8.0 8.0
    -compallsen no no
    -debug 0
    -dict /home/sylvain/resources/loop.dic
    -dictcase no no
    -dither no no
    -doublebw no no
    -ds 1 1
    -fdict
    -feat 1s_c_d_dd 1s_c_d_dd
    -featparams
    -fillprob 1e-8 1.000000e-08
    -frate 100 100
    -fsg /home/sylvain/resources/loop.fsg
    -fsgusealtpron yes yes
    -fsgusefiller yes yes
    -fwdflat yes yes
    -fwdflatbeam 1e-64 1.000000e-64
    -fwdflatefwid 4 4
    -fwdflatlw 8.5 8.500000e+00
    -fwdflatsfwin 25 25
    -fwdflatwbeam 7e-29 7.000000e-29
    -fwdtree yes yes
    -hmm /home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000/
    -infile
    -input_endian little little
    -jsgf
    -kdmaxbbi -1 -1
    -kdmaxdepth 0 0
    -kdtree
    -latsize 5000 5000
    -lda
    -ldadim 0 0
    -lextreedump 0 0
    -lifter 0 0
    -lm
    -lmctl
    -lmname default default
    -logbase 1.0001 1.000100e+00
    -logfn
    -logspec no no
    -lowerf 133.33334 1.333333e+02
    -lpbeam 1e-40 1.000000e-40
    -lponlybeam 7e-29 7.000000e-29
    -lw 6.5 6.500000e+00
    -maxhmmpf -1 -1
    -maxnewoov 20 20
    -maxwpf -1 -1
    -mdef
    -mean
    -mfclogdir
    -min_endfr 0 0
    -mixw
    -mixwfloor 0.0000001 1.000000e-07
    -mllr
    -mmap yes yes
    -ncep 13 13
    -nfft 512 512
    -nfilt 40 40
    -nwpen 1.0 1.000000e+00
    -pbeam 1e-48 1.000000e-48
    -pip 1.0 1.000000e+00
    -pl_beam 1e-10 1.000000e-10
    -pl_pbeam 1e-5 1.000000e-05
    -pl_window 0 0
    -rawlogdir
    -remove_dc no no
    -round_filters yes yes
    -samprate 16000 1.600000e+04
    -seed -1 -1
    -sendump
    -senlogdir
    -senmgau
    -silprob 0.005 5.000000e-03
    -smoothspec no no
    -svspec
    -time no no
    -tmat
    -tmatfloor 0.0001 1.000000e-04
    -topn 4 4
    -topn_beam 0 0
    -toprule
    -transform legacy legacy
    -unit_area yes yes
    -upperf 6855.4976 6.855498e+03
    -usewdphones no no
    -uw 1.0 1.000000e+00
    -var
    -varfloor 0.0001 1.000000e-04
    -varnorm no no
    -verbose no no
    -warp_params
    -warp_type inverse_linear inverse_linear
    -wbeam 7e-29 7.000000e-29
    -wip 0.65 6.500000e-01
    -wlen 0.025625 2.562500e-02

    INFO: cmd_ln.c(691): Parsing command line:
    \
    -alpha 0.97 \
    -dither yes \
    -doublebw no \
    -nfilt 40 \
    -ncep 13 \
    -lowerf 133.333334 \
    -upperf 6855.4976 \
    -nfft 512 \
    -wlen 0.025625 \
    -transform legacy \
    -feat 1s_c_d_dd \
    -agc none \
    -cmn current \
    -varnorm no

    Current configuration:

    -agc none none
    -agcthresh 2.0 2.000000e+00
    -alpha 0.97 9.700000e-01
    -ceplen 13 13
    -cmn current current
    -cmninit 8.0 8.0
    -dither no yes
    -doublebw no no
    -feat 1s_c_d_dd 1s_c_d_dd
    -frate 100 100
    -input_endian little little
    -lda /home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//feature_transform
    -ldadim 0 0
    -lifter 0 0
    -logspec no no
    -lowerf 133.33334 1.333333e+02
    -ncep 13 13
    -nfft 512 512
    -nfilt 40 40
    -remove_dc no no
    -round_filters yes yes
    -samprate 16000 1.600000e+04
    -seed -1 -1
    -smoothspec no no
    -svspec
    -transform legacy legacy
    -unit_area yes yes
    -upperf 6855.4976 6.855498e+03
    -varnorm no no
    -verbose no no
    -warp_params
    -warp_type inverse_linear inverse_linear
    -wlen 0.025625 2.562500e-02

    INFO: acmod.c(246): Parsed model-specific feature parameters from
    /home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//feat.params
    INFO: fe_interface.c(289): You are using the internal mechanism to generate
    the seed.
    INFO: feat.c(684): Initializing feature stream to type: '1s_c_d_dd',
    ceplen=13, CMN='current', VARNORM='no', AGC='none'
    INFO: cmn.c(142): mean= 12.00, mean= 0.0
    INFO: acmod.c(157): Reading linear feature transformation from
    /home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//feature_transform
    INFO: mdef.c(520): Reading model definition:
    /home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//mdef
    INFO: bin_mdef.c(179): Allocating 104810 * 8 bytes (818 KiB) for CD tree
    INFO: tmat.c(205): Reading HMM transition probability matrices: /home/sylvain/
    resources/am/voxforge_en_sphinx.cd_cont_5000//transition_matrices
    INFO: acmod.c(121): Attempting to use SCHMM computation module
    INFO: ms_gauden.c(198): Reading mixture gaussian parameter:
    /home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//means
    INFO: ms_gauden.c(292): 5120 codebook, 1 feature, size:
    INFO: ms_gauden.c(294): 16x29
    INFO: ms_gauden.c(198): Reading mixture gaussian parameter:
    /home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//variances
    INFO: ms_gauden.c(292): 5120 codebook, 1 feature, size:
    INFO: ms_gauden.c(294): 16x29
    INFO: ms_gauden.c(354): 175 variance values floored
    INFO: acmod.c(123): Attempting to use PTHMM computation module
    INFO: ms_gauden.c(198): Reading mixture gaussian parameter:
    /home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//means
    INFO: ms_gauden.c(292): 5120 codebook, 1 feature, size:
    INFO: ms_gauden.c(294): 16x29
    INFO: ms_gauden.c(198): Reading mixture gaussian parameter:
    /home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//variances
    INFO: ms_gauden.c(292): 5120 codebook, 1 feature, size:
    INFO: ms_gauden.c(294): 16x29
    INFO: ms_gauden.c(354): 175 variance values floored
    INFO: ptm_mgau.c(800): Number of codebooks exceeds 256: 5120
    INFO: acmod.c(125): Falling back to general multi-stream GMM computation
    INFO: ms_gauden.c(198): Reading mixture gaussian parameter:
    /home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//means
    INFO: ms_gauden.c(292): 5120 codebook, 1 feature, size:
    INFO: ms_gauden.c(294): 16x29
    INFO: ms_gauden.c(198): Reading mixture gaussian parameter:
    /home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//variances
    INFO: ms_gauden.c(292): 5120 codebook, 1 feature, size:
    INFO: ms_gauden.c(294): 16x29
    INFO: ms_gauden.c(354): 175 variance values floored
    ERROR: "ms_mgau.c", line 113: Dimension of stream 29 does not match: 39 !=
    -1260765632

    Many thanks again,

    Sylvain

     
  • Anonymous

    Anonymous - 2011-10-30

    Here is the output when I use the 0.7 release, initialization succeeds and
    recognition works:

    INFO: cmd_ln.c(691): Parsing command line:
    \
    -dict /home/sylvain/resources/loop.dic \
    -fsg /home/sylvain/resources/loop.fsg \
    -hmm /home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000/ \
    -bestpath no

    Current configuration:

    -adcdev
    -agc none none
    -agcthresh 2.0 2.000000e+00
    -alpha 0.97 9.700000e-01
    -argfile
    -ascale 20.0 2.000000e+01
    -aw 1 1
    -backtrace no no
    -beam 1e-48 1.000000e-48
    -bestpath yes no
    -bestpathlw 9.5 9.500000e+00
    -bghist no no
    -ceplen 13 13
    -cmn current current
    -cmninit 8.0 8.0
    -compallsen no no
    -debug 0
    -dict /home/sylvain/resources/loop.dic
    -dictcase no no
    -dither no no
    -doublebw no no
    -ds 1 1
    -fdict
    -feat 1s_c_d_dd 1s_c_d_dd
    -featparams
    -fillprob 1e-8 1.000000e-08
    -frate 100 100
    -fsg /home/sylvain/resources/loop.fsg
    -fsgusealtpron yes yes
    -fsgusefiller yes yes
    -fwdflat yes yes
    -fwdflatbeam 1e-64 1.000000e-64
    -fwdflatefwid 4 4
    -fwdflatlw 8.5 8.500000e+00
    -fwdflatsfwin 25 25
    -fwdflatwbeam 7e-29 7.000000e-29
    -fwdtree yes yes
    -hmm /home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000/
    -infile
    -input_endian little little
    -jsgf
    -kdmaxbbi -1 -1
    -kdmaxdepth 0 0
    -kdtree
    -latsize 5000 5000
    -lda
    -ldadim 0 0
    -lextreedump 0 0
    -lifter 0 0
    -lm
    -lmctl
    -lmname default default
    -logbase 1.0001 1.000100e+00
    -logfn
    -logspec no no
    -lowerf 133.33334 1.333333e+02
    -lpbeam 1e-40 1.000000e-40
    -lponlybeam 7e-29 7.000000e-29
    -lw 6.5 6.500000e+00
    -maxhmmpf -1 -1
    -maxnewoov 20 20
    -maxwpf -1 -1
    -mdef
    -mean
    -mfclogdir
    -min_endfr 0 0
    -mixw
    -mixwfloor 0.0000001 1.000000e-07
    -mllr
    -mmap yes yes
    -ncep 13 13
    -nfft 512 512
    -nfilt 40 40
    -nwpen 1.0 1.000000e+00
    -pbeam 1e-48 1.000000e-48
    -pip 1.0 1.000000e+00
    -pl_beam 1e-10 1.000000e-10
    -pl_pbeam 1e-5 1.000000e-05
    -pl_window 0 0
    -rawlogdir
    -remove_dc no no
    -round_filters yes yes
    -samprate 16000 1.600000e+04
    -seed -1 -1
    -sendump
    -senlogdir
    -senmgau
    -silprob 0.005 5.000000e-03
    -smoothspec no no
    -svspec
    -time no no
    -tmat
    -tmatfloor 0.0001 1.000000e-04
    -topn 4 4
    -topn_beam 0 0
    -toprule
    -transform legacy legacy
    -unit_area yes yes
    -upperf 6855.4976 6.855498e+03
    -usewdphones no no
    -uw 1.0 1.000000e+00
    -var
    -varfloor 0.0001 1.000000e-04
    -varnorm no no
    -verbose no no
    -warp_params
    -warp_type inverse_linear inverse_linear
    -wbeam 7e-29 7.000000e-29
    -wip 0.65 6.500000e-01
    -wlen 0.025625 2.562500e-02

    INFO: cmd_ln.c(691): Parsing command line:
    \
    -alpha 0.97 \
    -dither yes \
    -doublebw no \
    -nfilt 40 \
    -ncep 13 \
    -transform legacy \
    -lowerf 133.333334 \
    -upperf 6855.4976 \
    -nfft 512 \
    -wlen 0.025625 \
    -feat 1s_c_d_dd \
    -agc none \
    -cmn current \
    -varnorm no

    Current configuration:

    -agc none none
    -agcthresh 2.0 2.000000e+00
    -alpha 0.97 9.700000e-01
    -ceplen 13 13
    -cmn current current
    -cmninit 8.0 8.0
    -dither no yes
    -doublebw no no
    -feat 1s_c_d_dd 1s_c_d_dd
    -frate 100 100
    -input_endian little little
    -lda /home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//feature_transform
    -ldadim 0 0
    -lifter 0 0
    -logspec no no
    -lowerf 133.33334 1.333333e+02
    -ncep 13 13
    -nfft 512 512
    -nfilt 40 40
    -remove_dc no no
    -round_filters yes yes
    -samprate 16000 1.600000e+04
    -seed -1 -1
    -smoothspec no no
    -svspec
    -transform legacy legacy
    -unit_area yes yes
    -upperf 6855.4976 6.855498e+03
    -varnorm no no
    -verbose no no
    -warp_params
    -warp_type inverse_linear inverse_linear
    -wlen 0.025625 2.562500e-02

    INFO: acmod.c(242): Parsed model-specific feature parameters from
    /home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//feat.params
    INFO: fe_interface.c(289): You are using the internal mechanism to generate
    the seed.
    INFO: feat.c(684): Initializing feature stream to type: '1s_c_d_dd',
    ceplen=13, CMN='current', VARNORM='no', AGC='none'
    INFO: cmn.c(142): mean= 12.00, mean= 0.0
    INFO: acmod.c(153): Reading linear feature transformation from
    /home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//feature_transform
    INFO: mdef.c(520): Reading model definition:
    /home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//mdef
    INFO: bin_mdef.c(173): Allocating 104810 * 8 bytes (818 KiB) for CD tree
    INFO: tmat.c(205): Reading HMM transition probability matrices: /home/sylvain/
    resources/am/voxforge_en_sphinx.cd_cont_5000//transition_matrices
    INFO: acmod.c(117): Attempting to use SCHMM computation module
    INFO: ms_gauden.c(198): Reading mixture gaussian parameter:
    /home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//means
    INFO: ms_gauden.c(292): 5120 codebook, 1 feature, size:
    INFO: ms_gauden.c(294): 16x29
    INFO: ms_gauden.c(198): Reading mixture gaussian parameter:
    /home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//variances
    INFO: ms_gauden.c(292): 5120 codebook, 1 feature, size:
    INFO: ms_gauden.c(294): 16x29
    INFO: ms_gauden.c(354): 175 variance values floored
    INFO: acmod.c(119): Attempting to use PTHMM computation module
    INFO: ms_gauden.c(198): Reading mixture gaussian parameter:
    /home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//means
    INFO: ms_gauden.c(292): 5120 codebook, 1 feature, size:
    INFO: ms_gauden.c(294): 16x29
    INFO: ms_gauden.c(198): Reading mixture gaussian parameter:
    /home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//variances
    INFO: ms_gauden.c(292): 5120 codebook, 1 feature, size:
    INFO: ms_gauden.c(294): 16x29
    INFO: ms_gauden.c(354): 175 variance values floored
    INFO: ptm_mgau.c(800): Number of codebooks exceeds 256: 5120
    INFO: acmod.c(121): Falling back to general multi-stream GMM computation
    INFO: ms_gauden.c(198): Reading mixture gaussian parameter:
    /home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//means
    INFO: ms_gauden.c(292): 5120 codebook, 1 feature, size:
    INFO: ms_gauden.c(294): 16x29
    INFO: ms_gauden.c(198): Reading mixture gaussian parameter:
    /home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//variances
    INFO: ms_gauden.c(292): 5120 codebook, 1 feature, size:
    INFO: ms_gauden.c(294): 16x29
    INFO: ms_gauden.c(354): 175 variance values floored
    INFO: ms_senone.c(160): Reading senone mixture weights:
    /home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//mixture_weights
    INFO: ms_senone.c(211): Truncating senone logs3(pdf) values by 10 bits
    INFO: ms_senone.c(218): Not transposing mixture weights in memory
    INFO: ms_senone.c(277): Read mixture weights for 5120 senones: 1 features x 16
    codewords
    INFO: ms_senone.c(331): Mapping senones to individual codebooks
    INFO: ms_mgau.c(122): The value of topn: 4
    INFO: dict.c(306): Allocating 4139 * 20 bytes (80 KiB) for word entries
    INFO: dict.c(321): Reading main dictionary: /home/sylvain/resources/loop.dic
    INFO: dict.c(212): Allocated 0 KiB for strings, 0 KiB for phones
    INFO: dict.c(324): 40 words read
    INFO: dict.c(330): Reading filler dictionary:
    /home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//noisedict
    INFO: dict.c(212): Allocated 0 KiB for strings, 0 KiB for phones
    INFO: dict.c(333): 3 words read
    INFO: dict2pid.c(396): Building PID tables for dictionary
    INFO: dict2pid.c(404): Allocating 40^3 * 2 bytes (125 KiB) for word-initial
    triphones
    INFO: dict2pid.c(131): Allocated 19360 bytes (18 KiB) for word-final triphones
    INFO: dict2pid.c(195): Allocated 19360 bytes (18 KiB) for single-phone word
    triphones
    INFO: fsg_search.c(145): FSG(beam: -1080, pbeam: -1080, wbeam: -634; wip: -26,
    pip: 0)
    INFO: fsg_model.c(678): FSG: 1 states, 40 unique words, 40 transitions (0
    null)
    INFO: fsg_model.c(213): Computing transitive closure for null transitions
    INFO: fsg_model.c(264): 0 null transitions added
    INFO: fsg_model.c(411): Adding silence transitions for <sil> to FSG
    INFO: fsg_model.c(431): Added 1 silence word transitions
    INFO: fsg_search.c(364): Added 0 alternate word transitions
    INFO: fsg_lextree.c(108): Allocated 82 bytes (0 KiB) for left and right
    context phones
    INFO: fsg_lextree.c(251): 41 HMM nodes in lextree (41 leaves)
    INFO: fsg_lextree.c(253): Allocated 4428 bytes (4 KiB) for all lextree nodes
    INFO: fsg_lextree.c(256): Allocated 4428 bytes (4 KiB) for lextree leafnodes
    INFO: fsg_lextree.c(108): Allocated 82 bytes (0 KiB) for left and right
    context phones
    INFO: fsg_lextree.c(251): 41 HMM nodes in lextree (41 leaves)
    INFO: fsg_lextree.c(253): Allocated 4428 bytes (4 KiB) for all lextree nodes
    INFO: fsg_lextree.c(256): Allocated 4428 bytes (4 KiB) for lextree leafnodes </sil>

     
  • Anonymous

    Anonymous - 2011-10-30

    I tried a few more acoustic models, each time only changing what -hmm points
    to.

    These ones won't load, with the same error:

    . voxforge on voxforge.org
    . voxforge-en-0.4 on cmusphinx.org
    . voxforge-en-0.3 on cmusphinx.org

    This one loads successfully:
    . hub4opensrc.cd_continuous_8gau

    Here is the output when loading hub4 model:

    INFO: cmd_ln.c(691): Parsing command line:
    \
    -dict /home/sylvain/resources/loop.dic \
    -fsg /home/sylvain/resources/loop.fsg \
    -hmm /home/sylvain/resources/am/hub4opensrc.cd_continuous_8gau/ \
    -bestpath no

    Current configuration:

    -adcdev
    -agc none none
    -agcthresh 2.0 2.000000e+00
    -alpha 0.97 9.700000e-01
    -argfile
    -ascale 20.0 2.000000e+01
    -aw 1 1
    -backtrace no no
    -beam 1e-48 1.000000e-48
    -bestpath yes no
    -bestpathlw 9.5 9.500000e+00
    -bghist no no
    -ceplen 13 13
    -cmn current current
    -cmninit 8.0 8.0
    -compallsen no no
    -debug 0
    -dict /home/sylvain/resources/loop.dic
    -dictcase no no
    -dither no no
    -doublebw no no
    -ds 1 1
    -fdict
    -feat 1s_c_d_dd 1s_c_d_dd
    -featparams
    -fillprob 1e-8 1.000000e-08
    -frate 100 100
    -fsg /home/sylvain/resources/loop.fsg
    -fsgusealtpron yes yes
    -fsgusefiller yes yes
    -fwdflat yes yes
    -fwdflatbeam 1e-64 1.000000e-64
    -fwdflatefwid 4 4
    -fwdflatlw 8.5 8.500000e+00
    -fwdflatsfwin 25 25
    -fwdflatwbeam 7e-29 7.000000e-29
    -fwdtree yes yes
    -hmm /home/sylvain/resources/am/hub4opensrc.cd_continuous_8gau/
    -infile
    -input_endian little little
    -jsgf
    -kdmaxbbi -1 -1
    -kdmaxdepth 0 0
    -kdtree
    -latsize 5000 5000
    -lda
    -ldadim 0 0
    -lextreedump 0 0
    -lifter 0 0
    -lm
    -lmctl
    -lmname default default
    -logbase 1.0001 1.000100e+00
    -logfn
    -logspec no no
    -lowerf 133.33334 1.333333e+02
    -lpbeam 1e-40 1.000000e-40
    -lponlybeam 7e-29 7.000000e-29
    -lw 6.5 6.500000e+00
    -maxhmmpf -1 -1
    -maxnewoov 20 20
    -maxwpf -1 -1
    -mdef
    -mean
    -mfclogdir
    -min_endfr 0 0
    -mixw
    -mixwfloor 0.0000001 1.000000e-07
    -mllr
    -mmap yes yes
    -ncep 13 13
    -nfft 512 512
    -nfilt 40 40
    -nwpen 1.0 1.000000e+00
    -pbeam 1e-48 1.000000e-48
    -pip 1.0 1.000000e+00
    -pl_beam 1e-10 1.000000e-10
    -pl_pbeam 1e-5 1.000000e-05
    -pl_window 0 0
    -rawlogdir
    -remove_dc no no
    -round_filters yes yes
    -samprate 16000 1.600000e+04
    -seed -1 -1
    -sendump
    -senlogdir
    -senmgau
    -silprob 0.005 5.000000e-03
    -smoothspec no no
    -svspec
    -time no no
    -tmat
    -tmatfloor 0.0001 1.000000e-04
    -topn 4 4
    -topn_beam 0 0
    -toprule
    -transform legacy legacy
    -unit_area yes yes
    -upperf 6855.4976 6.855498e+03
    -usewdphones no no
    -uw 1.0 1.000000e+00
    -var
    -varfloor 0.0001 1.000000e-04
    -varnorm no no
    -verbose no no
    -warp_params
    -warp_type inverse_linear inverse_linear
    -wbeam 7e-29 7.000000e-29
    -wip 0.65 6.500000e-01
    -wlen 0.025625 2.562500e-02

    INFO: cmd_ln.c(691): Parsing command line:
    \
    -alpha 0.97 \
    -dither yes \
    -doublebw no \
    -nfilt 31 \
    -ncep 13 \
    -lowerf 200 \
    -upperf 3500 \
    -nfft 256 \
    -wlen 0.025625 \
    -samprate 8000 \
    -transform legacy \
    -feat 1s_c_d_dd \
    -agc none \
    -cmn current \
    -varnorm no

    Current configuration:

    -agc none none
    -agcthresh 2.0 2.000000e+00
    -alpha 0.97 9.700000e-01
    -ceplen 13 13
    -cmn current current
    -cmninit 8.0 8.0
    -dither no yes
    -doublebw no no
    -feat 1s_c_d_dd 1s_c_d_dd
    -frate 100 100
    -input_endian little little
    -lda /home/sylvain/resources/am/hub4opensrc.cd_continuous_8gau//feature_transform
    -ldadim 0 0
    -lifter 0 0
    -logspec no no
    -lowerf 133.33334 2.000000e+02
    -ncep 13 13
    -nfft 512 256
    -nfilt 40 31
    -remove_dc no no
    -round_filters yes yes
    -samprate 16000 8.000000e+03
    -seed -1 -1
    -smoothspec no no
    -svspec
    -transform legacy legacy
    -unit_area yes yes
    -upperf 6855.4976 3.500000e+03
    -varnorm no no
    -verbose no no
    -warp_params
    -warp_type inverse_linear inverse_linear
    -wlen 0.025625 2.562500e-02

    INFO: acmod.c(246): Parsed model-specific feature parameters from
    /home/sylvain/resources/am/hub4opensrc.cd_continuous_8gau//feat.params
    INFO: fe_interface.c(289): You are using the internal mechanism to generate
    the seed.
    INFO: feat.c(684): Initializing feature stream to type: '1s_c_d_dd',
    ceplen=13, CMN='current', VARNORM='no', AGC='none'
    INFO: cmn.c(142): mean= 12.00, mean= 0.0
    INFO: acmod.c(157): Reading linear feature transformation from
    /home/sylvain/resources/am/hub4opensrc.cd_continuous_8gau//feature_transform
    INFO: mdef.c(520): Reading model definition:
    /home/sylvain/resources/am/hub4opensrc.cd_continuous_8gau//mdef
    INFO: bin_mdef.c(179): Allocating 104810 * 8 bytes (818 KiB) for CD tree
    INFO: tmat.c(205): Reading HMM transition probability matrices:
    /home/sylvain/resources/am/hub4opensrc.cd_continuous_8gau//transition_matrices
    INFO: acmod.c(121): Attempting to use SCHMM computation module
    INFO: ms_gauden.c(198): Reading mixture gaussian parameter:
    /home/sylvain/resources/am/hub4opensrc.cd_continuous_8gau//means
    INFO: ms_gauden.c(292): 3120 codebook, 1 feature, size:
    INFO: ms_gauden.c(294): 16x29
    INFO: ms_gauden.c(198): Reading mixture gaussian parameter:
    /home/sylvain/resources/am/hub4opensrc.cd_continuous_8gau//variances
    INFO: ms_gauden.c(292): 3120 codebook, 1 feature, size:
    INFO: ms_gauden.c(294): 16x29
    INFO: ms_gauden.c(354): 0 variance values floored
    INFO: acmod.c(123): Attempting to use PTHMM computation module
    INFO: ms_gauden.c(198): Reading mixture gaussian parameter:
    /home/sylvain/resources/am/hub4opensrc.cd_continuous_8gau//means
    INFO: ms_gauden.c(292): 3120 codebook, 1 feature, size:
    INFO: ms_gauden.c(294): 16x29
    INFO: ms_gauden.c(198): Reading mixture gaussian parameter:
    /home/sylvain/resources/am/hub4opensrc.cd_continuous_8gau//variances
    INFO: ms_gauden.c(292): 3120 codebook, 1 feature, size:
    INFO: ms_gauden.c(294): 16x29
    INFO: ms_gauden.c(354): 0 variance values floored
    INFO: ptm_mgau.c(800): Number of codebooks exceeds 256: 3120
    INFO: acmod.c(125): Falling back to general multi-stream GMM computation
    INFO: ms_gauden.c(198): Reading mixture gaussian parameter:
    /home/sylvain/resources/am/hub4opensrc.cd_continuous_8gau//means
    INFO: ms_gauden.c(292): 3120 codebook, 1 feature, size:
    INFO: ms_gauden.c(294): 16x29
    INFO: ms_gauden.c(198): Reading mixture gaussian parameter:
    /home/sylvain/resources/am/hub4opensrc.cd_continuous_8gau//variances
    INFO: ms_gauden.c(292): 3120 codebook, 1 feature, size:
    INFO: ms_gauden.c(294): 16x29
    INFO: ms_gauden.c(354): 0 variance values floored
    ERROR: "ms_mgau.c", line 112: Dimension of stream 29 does not match: 39 !=
    -1260765504

     
  • Nickolay V. Shmyrev

    Hello

    This bug has been fixed in sphinxbase trunk just now. Please update sphinxbase
    and pocketsphinx and recompile both.

    Thanks for the report.

     
  • Anonymous

    Anonymous - 2011-10-30

    Wow, it works, many thanks for the super-quick fix.

    Sylvain

     

Log in to post a comment.