I am using pocketsphinx with the voxforge acoustic model (available on
voxforge.org), and it was working fine with the 0.7 release. When I try to
load the model with the latest revision of sphinxbase/pocketsphinx on svn, I
get an error and pocketsphinx wont initialize. Here is what I get:
INFO: acmod.c(246): Parsed model-specific feature parameters from
/home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//feat.params
INFO: fe_interface.c(289): You are using the internal mechanism to generate
the seed.
INFO: feat.c(684): Initializing feature stream to type: '1s_c_d_dd',
ceplen=13, CMN='current', VARNORM='no', AGC='none'
INFO: cmn.c(142): mean= 12.00, mean= 0.0
INFO: acmod.c(157): Reading linear feature transformation from
/home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//feature_transform
INFO: mdef.c(520): Reading model definition:
/home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//mdef
INFO: bin_mdef.c(179): Allocating 104810 * 8 bytes (818 KiB) for CD tree
INFO: tmat.c(205): Reading HMM transition probability matrices: /home/sylvain/
resources/am/voxforge_en_sphinx.cd_cont_5000//transition_matrices
INFO: acmod.c(121): Attempting to use SCHMM computation module
INFO: ms_gauden.c(198): Reading mixture gaussian parameter:
/home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//means
INFO: ms_gauden.c(292): 5120 codebook, 1 feature, size:
INFO: ms_gauden.c(294): 16x29
INFO: ms_gauden.c(198): Reading mixture gaussian parameter:
/home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//variances
INFO: ms_gauden.c(292): 5120 codebook, 1 feature, size:
INFO: ms_gauden.c(294): 16x29
INFO: ms_gauden.c(354): 175 variance values floored
INFO: acmod.c(123): Attempting to use PTHMM computation module
INFO: ms_gauden.c(198): Reading mixture gaussian parameter:
/home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//means
INFO: ms_gauden.c(292): 5120 codebook, 1 feature, size:
INFO: ms_gauden.c(294): 16x29
INFO: ms_gauden.c(198): Reading mixture gaussian parameter:
/home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//variances
INFO: ms_gauden.c(292): 5120 codebook, 1 feature, size:
INFO: ms_gauden.c(294): 16x29
INFO: ms_gauden.c(354): 175 variance values floored
INFO: ptm_mgau.c(800): Number of codebooks exceeds 256: 5120
INFO: acmod.c(125): Falling back to general multi-stream GMM computation
INFO: ms_gauden.c(198): Reading mixture gaussian parameter:
/home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//means
INFO: ms_gauden.c(292): 5120 codebook, 1 feature, size:
INFO: ms_gauden.c(294): 16x29
INFO: ms_gauden.c(198): Reading mixture gaussian parameter:
/home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//variances
INFO: ms_gauden.c(292): 5120 codebook, 1 feature, size:
INFO: ms_gauden.c(294): 16x29
INFO: ms_gauden.c(354): 175 variance values floored
ERROR: "ms_mgau.c", line 112: Dimension of stream 29 does not match: 39 !=
-1261814208
Is there a problem with the voxforge model? Is there a way to fix it?
Many thanks in advance,
Sylvain
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Please provide the full log including the command line parameters
Please provide the information about your operating system, installation
details, hardware
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Anonymous
-
2011-10-30
Thanks for the quick reply.
I am using Linux, Windows, and Mac OSX and see the same thing on all 3 OSes
(it was working well on all 3 OSes with 0.7 release, using the same acoustic
model). What I do not understand is that the model is a continuous model,
features are 1s_c_d_dd, and it says "Falling back to general multi-stream GMM
computation", but I thought 1s_c_d_dd means that there is only one stream.
Here is the complete output, including the command line parameters:
Hello,
I am using pocketsphinx with the voxforge acoustic model (available on
voxforge.org), and it was working fine with the 0.7 release. When I try to
load the model with the latest revision of sphinxbase/pocketsphinx on svn, I
get an error and pocketsphinx wont initialize. Here is what I get:
INFO: acmod.c(246): Parsed model-specific feature parameters from
/home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//feat.params
INFO: fe_interface.c(289): You are using the internal mechanism to generate
the seed.
INFO: feat.c(684): Initializing feature stream to type: '1s_c_d_dd',
ceplen=13, CMN='current', VARNORM='no', AGC='none'
INFO: cmn.c(142): mean= 12.00, mean= 0.0
INFO: acmod.c(157): Reading linear feature transformation from
/home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//feature_transform
INFO: mdef.c(520): Reading model definition:
/home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//mdef
INFO: bin_mdef.c(179): Allocating 104810 * 8 bytes (818 KiB) for CD tree
INFO: tmat.c(205): Reading HMM transition probability matrices: /home/sylvain/
resources/am/voxforge_en_sphinx.cd_cont_5000//transition_matrices
INFO: acmod.c(121): Attempting to use SCHMM computation module
INFO: ms_gauden.c(198): Reading mixture gaussian parameter:
/home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//means
INFO: ms_gauden.c(292): 5120 codebook, 1 feature, size:
INFO: ms_gauden.c(294): 16x29
INFO: ms_gauden.c(198): Reading mixture gaussian parameter:
/home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//variances
INFO: ms_gauden.c(292): 5120 codebook, 1 feature, size:
INFO: ms_gauden.c(294): 16x29
INFO: ms_gauden.c(354): 175 variance values floored
INFO: acmod.c(123): Attempting to use PTHMM computation module
INFO: ms_gauden.c(198): Reading mixture gaussian parameter:
/home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//means
INFO: ms_gauden.c(292): 5120 codebook, 1 feature, size:
INFO: ms_gauden.c(294): 16x29
INFO: ms_gauden.c(198): Reading mixture gaussian parameter:
/home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//variances
INFO: ms_gauden.c(292): 5120 codebook, 1 feature, size:
INFO: ms_gauden.c(294): 16x29
INFO: ms_gauden.c(354): 175 variance values floored
INFO: ptm_mgau.c(800): Number of codebooks exceeds 256: 5120
INFO: acmod.c(125): Falling back to general multi-stream GMM computation
INFO: ms_gauden.c(198): Reading mixture gaussian parameter:
/home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//means
INFO: ms_gauden.c(292): 5120 codebook, 1 feature, size:
INFO: ms_gauden.c(294): 16x29
INFO: ms_gauden.c(198): Reading mixture gaussian parameter:
/home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//variances
INFO: ms_gauden.c(292): 5120 codebook, 1 feature, size:
INFO: ms_gauden.c(294): 16x29
INFO: ms_gauden.c(354): 175 variance values floored
ERROR: "ms_mgau.c", line 112: Dimension of stream 29 does not match: 39 !=
-1261814208
Is there a problem with the voxforge model? Is there a way to fix it?
Many thanks in advance,
Sylvain
Please provide the full log including the command line parameters
Please provide the information about your operating system, installation
details, hardware
Thanks for the quick reply.
I am using Linux, Windows, and Mac OSX and see the same thing on all 3 OSes
(it was working well on all 3 OSes with 0.7 release, using the same acoustic
model). What I do not understand is that the model is a continuous model,
features are 1s_c_d_dd, and it says "Falling back to general multi-stream GMM
computation", but I thought 1s_c_d_dd means that there is only one stream.
Here is the complete output, including the command line parameters:
INFO: cmd_ln.c(691): Parsing command line:
\
-dict /home/sylvain/resources/loop.dic \
-fsg /home/sylvain/resources/loop.fsg \
-hmm /home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000/ \
-bestpath no
Current configuration:
-adcdev
-agc none none
-agcthresh 2.0 2.000000e+00
-alpha 0.97 9.700000e-01
-argfile
-ascale 20.0 2.000000e+01
-aw 1 1
-backtrace no no
-beam 1e-48 1.000000e-48
-bestpath yes no
-bestpathlw 9.5 9.500000e+00
-bghist no no
-ceplen 13 13
-cmn current current
-cmninit 8.0 8.0
-compallsen no no
-debug 0
-dict /home/sylvain/resources/loop.dic
-dictcase no no
-dither no no
-doublebw no no
-ds 1 1
-fdict
-feat 1s_c_d_dd 1s_c_d_dd
-featparams
-fillprob 1e-8 1.000000e-08
-frate 100 100
-fsg /home/sylvain/resources/loop.fsg
-fsgusealtpron yes yes
-fsgusefiller yes yes
-fwdflat yes yes
-fwdflatbeam 1e-64 1.000000e-64
-fwdflatefwid 4 4
-fwdflatlw 8.5 8.500000e+00
-fwdflatsfwin 25 25
-fwdflatwbeam 7e-29 7.000000e-29
-fwdtree yes yes
-hmm /home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000/
-infile
-input_endian little little
-jsgf
-kdmaxbbi -1 -1
-kdmaxdepth 0 0
-kdtree
-latsize 5000 5000
-lda
-ldadim 0 0
-lextreedump 0 0
-lifter 0 0
-lm
-lmctl
-lmname default default
-logbase 1.0001 1.000100e+00
-logfn
-logspec no no
-lowerf 133.33334 1.333333e+02
-lpbeam 1e-40 1.000000e-40
-lponlybeam 7e-29 7.000000e-29
-lw 6.5 6.500000e+00
-maxhmmpf -1 -1
-maxnewoov 20 20
-maxwpf -1 -1
-mdef
-mean
-mfclogdir
-min_endfr 0 0
-mixw
-mixwfloor 0.0000001 1.000000e-07
-mllr
-mmap yes yes
-ncep 13 13
-nfft 512 512
-nfilt 40 40
-nwpen 1.0 1.000000e+00
-pbeam 1e-48 1.000000e-48
-pip 1.0 1.000000e+00
-pl_beam 1e-10 1.000000e-10
-pl_pbeam 1e-5 1.000000e-05
-pl_window 0 0
-rawlogdir
-remove_dc no no
-round_filters yes yes
-samprate 16000 1.600000e+04
-seed -1 -1
-sendump
-senlogdir
-senmgau
-silprob 0.005 5.000000e-03
-smoothspec no no
-svspec
-time no no
-tmat
-tmatfloor 0.0001 1.000000e-04
-topn 4 4
-topn_beam 0 0
-toprule
-transform legacy legacy
-unit_area yes yes
-upperf 6855.4976 6.855498e+03
-usewdphones no no
-uw 1.0 1.000000e+00
-var
-varfloor 0.0001 1.000000e-04
-varnorm no no
-verbose no no
-warp_params
-warp_type inverse_linear inverse_linear
-wbeam 7e-29 7.000000e-29
-wip 0.65 6.500000e-01
-wlen 0.025625 2.562500e-02
INFO: cmd_ln.c(691): Parsing command line:
\
-alpha 0.97 \
-dither yes \
-doublebw no \
-nfilt 40 \
-ncep 13 \
-lowerf 133.333334 \
-upperf 6855.4976 \
-nfft 512 \
-wlen 0.025625 \
-transform legacy \
-feat 1s_c_d_dd \
-agc none \
-cmn current \
-varnorm no
Current configuration:
-agc none none
-agcthresh 2.0 2.000000e+00
-alpha 0.97 9.700000e-01
-ceplen 13 13
-cmn current current
-cmninit 8.0 8.0
-dither no yes
-doublebw no no
-feat 1s_c_d_dd 1s_c_d_dd
-frate 100 100
-input_endian little little
-lda /home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//feature_transform
-ldadim 0 0
-lifter 0 0
-logspec no no
-lowerf 133.33334 1.333333e+02
-ncep 13 13
-nfft 512 512
-nfilt 40 40
-remove_dc no no
-round_filters yes yes
-samprate 16000 1.600000e+04
-seed -1 -1
-smoothspec no no
-svspec
-transform legacy legacy
-unit_area yes yes
-upperf 6855.4976 6.855498e+03
-varnorm no no
-verbose no no
-warp_params
-warp_type inverse_linear inverse_linear
-wlen 0.025625 2.562500e-02
INFO: acmod.c(246): Parsed model-specific feature parameters from
/home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//feat.params
INFO: fe_interface.c(289): You are using the internal mechanism to generate
the seed.
INFO: feat.c(684): Initializing feature stream to type: '1s_c_d_dd',
ceplen=13, CMN='current', VARNORM='no', AGC='none'
INFO: cmn.c(142): mean= 12.00, mean= 0.0
INFO: acmod.c(157): Reading linear feature transformation from
/home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//feature_transform
INFO: mdef.c(520): Reading model definition:
/home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//mdef
INFO: bin_mdef.c(179): Allocating 104810 * 8 bytes (818 KiB) for CD tree
INFO: tmat.c(205): Reading HMM transition probability matrices: /home/sylvain/
resources/am/voxforge_en_sphinx.cd_cont_5000//transition_matrices
INFO: acmod.c(121): Attempting to use SCHMM computation module
INFO: ms_gauden.c(198): Reading mixture gaussian parameter:
/home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//means
INFO: ms_gauden.c(292): 5120 codebook, 1 feature, size:
INFO: ms_gauden.c(294): 16x29
INFO: ms_gauden.c(198): Reading mixture gaussian parameter:
/home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//variances
INFO: ms_gauden.c(292): 5120 codebook, 1 feature, size:
INFO: ms_gauden.c(294): 16x29
INFO: ms_gauden.c(354): 175 variance values floored
INFO: acmod.c(123): Attempting to use PTHMM computation module
INFO: ms_gauden.c(198): Reading mixture gaussian parameter:
/home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//means
INFO: ms_gauden.c(292): 5120 codebook, 1 feature, size:
INFO: ms_gauden.c(294): 16x29
INFO: ms_gauden.c(198): Reading mixture gaussian parameter:
/home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//variances
INFO: ms_gauden.c(292): 5120 codebook, 1 feature, size:
INFO: ms_gauden.c(294): 16x29
INFO: ms_gauden.c(354): 175 variance values floored
INFO: ptm_mgau.c(800): Number of codebooks exceeds 256: 5120
INFO: acmod.c(125): Falling back to general multi-stream GMM computation
INFO: ms_gauden.c(198): Reading mixture gaussian parameter:
/home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//means
INFO: ms_gauden.c(292): 5120 codebook, 1 feature, size:
INFO: ms_gauden.c(294): 16x29
INFO: ms_gauden.c(198): Reading mixture gaussian parameter:
/home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//variances
INFO: ms_gauden.c(292): 5120 codebook, 1 feature, size:
INFO: ms_gauden.c(294): 16x29
INFO: ms_gauden.c(354): 175 variance values floored
ERROR: "ms_mgau.c", line 113: Dimension of stream 29 does not match: 39 !=
-1260765632
Many thanks again,
Sylvain
Here is the output when I use the 0.7 release, initialization succeeds and
recognition works:
INFO: cmd_ln.c(691): Parsing command line:
\
-dict /home/sylvain/resources/loop.dic \
-fsg /home/sylvain/resources/loop.fsg \
-hmm /home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000/ \
-bestpath no
Current configuration:
-adcdev
-agc none none
-agcthresh 2.0 2.000000e+00
-alpha 0.97 9.700000e-01
-argfile
-ascale 20.0 2.000000e+01
-aw 1 1
-backtrace no no
-beam 1e-48 1.000000e-48
-bestpath yes no
-bestpathlw 9.5 9.500000e+00
-bghist no no
-ceplen 13 13
-cmn current current
-cmninit 8.0 8.0
-compallsen no no
-debug 0
-dict /home/sylvain/resources/loop.dic
-dictcase no no
-dither no no
-doublebw no no
-ds 1 1
-fdict
-feat 1s_c_d_dd 1s_c_d_dd
-featparams
-fillprob 1e-8 1.000000e-08
-frate 100 100
-fsg /home/sylvain/resources/loop.fsg
-fsgusealtpron yes yes
-fsgusefiller yes yes
-fwdflat yes yes
-fwdflatbeam 1e-64 1.000000e-64
-fwdflatefwid 4 4
-fwdflatlw 8.5 8.500000e+00
-fwdflatsfwin 25 25
-fwdflatwbeam 7e-29 7.000000e-29
-fwdtree yes yes
-hmm /home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000/
-infile
-input_endian little little
-jsgf
-kdmaxbbi -1 -1
-kdmaxdepth 0 0
-kdtree
-latsize 5000 5000
-lda
-ldadim 0 0
-lextreedump 0 0
-lifter 0 0
-lm
-lmctl
-lmname default default
-logbase 1.0001 1.000100e+00
-logfn
-logspec no no
-lowerf 133.33334 1.333333e+02
-lpbeam 1e-40 1.000000e-40
-lponlybeam 7e-29 7.000000e-29
-lw 6.5 6.500000e+00
-maxhmmpf -1 -1
-maxnewoov 20 20
-maxwpf -1 -1
-mdef
-mean
-mfclogdir
-min_endfr 0 0
-mixw
-mixwfloor 0.0000001 1.000000e-07
-mllr
-mmap yes yes
-ncep 13 13
-nfft 512 512
-nfilt 40 40
-nwpen 1.0 1.000000e+00
-pbeam 1e-48 1.000000e-48
-pip 1.0 1.000000e+00
-pl_beam 1e-10 1.000000e-10
-pl_pbeam 1e-5 1.000000e-05
-pl_window 0 0
-rawlogdir
-remove_dc no no
-round_filters yes yes
-samprate 16000 1.600000e+04
-seed -1 -1
-sendump
-senlogdir
-senmgau
-silprob 0.005 5.000000e-03
-smoothspec no no
-svspec
-time no no
-tmat
-tmatfloor 0.0001 1.000000e-04
-topn 4 4
-topn_beam 0 0
-toprule
-transform legacy legacy
-unit_area yes yes
-upperf 6855.4976 6.855498e+03
-usewdphones no no
-uw 1.0 1.000000e+00
-var
-varfloor 0.0001 1.000000e-04
-varnorm no no
-verbose no no
-warp_params
-warp_type inverse_linear inverse_linear
-wbeam 7e-29 7.000000e-29
-wip 0.65 6.500000e-01
-wlen 0.025625 2.562500e-02
INFO: cmd_ln.c(691): Parsing command line:
\
-alpha 0.97 \
-dither yes \
-doublebw no \
-nfilt 40 \
-ncep 13 \
-transform legacy \
-lowerf 133.333334 \
-upperf 6855.4976 \
-nfft 512 \
-wlen 0.025625 \
-feat 1s_c_d_dd \
-agc none \
-cmn current \
-varnorm no
Current configuration:
-agc none none
-agcthresh 2.0 2.000000e+00
-alpha 0.97 9.700000e-01
-ceplen 13 13
-cmn current current
-cmninit 8.0 8.0
-dither no yes
-doublebw no no
-feat 1s_c_d_dd 1s_c_d_dd
-frate 100 100
-input_endian little little
-lda /home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//feature_transform
-ldadim 0 0
-lifter 0 0
-logspec no no
-lowerf 133.33334 1.333333e+02
-ncep 13 13
-nfft 512 512
-nfilt 40 40
-remove_dc no no
-round_filters yes yes
-samprate 16000 1.600000e+04
-seed -1 -1
-smoothspec no no
-svspec
-transform legacy legacy
-unit_area yes yes
-upperf 6855.4976 6.855498e+03
-varnorm no no
-verbose no no
-warp_params
-warp_type inverse_linear inverse_linear
-wlen 0.025625 2.562500e-02
INFO: acmod.c(242): Parsed model-specific feature parameters from
/home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//feat.params
INFO: fe_interface.c(289): You are using the internal mechanism to generate
the seed.
INFO: feat.c(684): Initializing feature stream to type: '1s_c_d_dd',
ceplen=13, CMN='current', VARNORM='no', AGC='none'
INFO: cmn.c(142): mean= 12.00, mean= 0.0
INFO: acmod.c(153): Reading linear feature transformation from
/home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//feature_transform
INFO: mdef.c(520): Reading model definition:
/home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//mdef
INFO: bin_mdef.c(173): Allocating 104810 * 8 bytes (818 KiB) for CD tree
INFO: tmat.c(205): Reading HMM transition probability matrices: /home/sylvain/
resources/am/voxforge_en_sphinx.cd_cont_5000//transition_matrices
INFO: acmod.c(117): Attempting to use SCHMM computation module
INFO: ms_gauden.c(198): Reading mixture gaussian parameter:
/home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//means
INFO: ms_gauden.c(292): 5120 codebook, 1 feature, size:
INFO: ms_gauden.c(294): 16x29
INFO: ms_gauden.c(198): Reading mixture gaussian parameter:
/home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//variances
INFO: ms_gauden.c(292): 5120 codebook, 1 feature, size:
INFO: ms_gauden.c(294): 16x29
INFO: ms_gauden.c(354): 175 variance values floored
INFO: acmod.c(119): Attempting to use PTHMM computation module
INFO: ms_gauden.c(198): Reading mixture gaussian parameter:
/home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//means
INFO: ms_gauden.c(292): 5120 codebook, 1 feature, size:
INFO: ms_gauden.c(294): 16x29
INFO: ms_gauden.c(198): Reading mixture gaussian parameter:
/home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//variances
INFO: ms_gauden.c(292): 5120 codebook, 1 feature, size:
INFO: ms_gauden.c(294): 16x29
INFO: ms_gauden.c(354): 175 variance values floored
INFO: ptm_mgau.c(800): Number of codebooks exceeds 256: 5120
INFO: acmod.c(121): Falling back to general multi-stream GMM computation
INFO: ms_gauden.c(198): Reading mixture gaussian parameter:
/home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//means
INFO: ms_gauden.c(292): 5120 codebook, 1 feature, size:
INFO: ms_gauden.c(294): 16x29
INFO: ms_gauden.c(198): Reading mixture gaussian parameter:
/home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//variances
INFO: ms_gauden.c(292): 5120 codebook, 1 feature, size:
INFO: ms_gauden.c(294): 16x29
INFO: ms_gauden.c(354): 175 variance values floored
INFO: ms_senone.c(160): Reading senone mixture weights:
/home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//mixture_weights
INFO: ms_senone.c(211): Truncating senone logs3(pdf) values by 10 bits
INFO: ms_senone.c(218): Not transposing mixture weights in memory
INFO: ms_senone.c(277): Read mixture weights for 5120 senones: 1 features x 16
codewords
INFO: ms_senone.c(331): Mapping senones to individual codebooks
INFO: ms_mgau.c(122): The value of topn: 4
INFO: dict.c(306): Allocating 4139 * 20 bytes (80 KiB) for word entries
INFO: dict.c(321): Reading main dictionary: /home/sylvain/resources/loop.dic
INFO: dict.c(212): Allocated 0 KiB for strings, 0 KiB for phones
INFO: dict.c(324): 40 words read
INFO: dict.c(330): Reading filler dictionary:
/home/sylvain/resources/am/voxforge_en_sphinx.cd_cont_5000//noisedict
INFO: dict.c(212): Allocated 0 KiB for strings, 0 KiB for phones
INFO: dict.c(333): 3 words read
INFO: dict2pid.c(396): Building PID tables for dictionary
INFO: dict2pid.c(404): Allocating 40^3 * 2 bytes (125 KiB) for word-initial
triphones
INFO: dict2pid.c(131): Allocated 19360 bytes (18 KiB) for word-final triphones
INFO: dict2pid.c(195): Allocated 19360 bytes (18 KiB) for single-phone word
triphones
INFO: fsg_search.c(145): FSG(beam: -1080, pbeam: -1080, wbeam: -634; wip: -26,
pip: 0)
INFO: fsg_model.c(678): FSG: 1 states, 40 unique words, 40 transitions (0
null)
INFO: fsg_model.c(213): Computing transitive closure for null transitions
INFO: fsg_model.c(264): 0 null transitions added
INFO: fsg_model.c(411): Adding silence transitions for <sil> to FSG
INFO: fsg_model.c(431): Added 1 silence word transitions
INFO: fsg_search.c(364): Added 0 alternate word transitions
INFO: fsg_lextree.c(108): Allocated 82 bytes (0 KiB) for left and right
context phones
INFO: fsg_lextree.c(251): 41 HMM nodes in lextree (41 leaves)
INFO: fsg_lextree.c(253): Allocated 4428 bytes (4 KiB) for all lextree nodes
INFO: fsg_lextree.c(256): Allocated 4428 bytes (4 KiB) for lextree leafnodes
INFO: fsg_lextree.c(108): Allocated 82 bytes (0 KiB) for left and right
context phones
INFO: fsg_lextree.c(251): 41 HMM nodes in lextree (41 leaves)
INFO: fsg_lextree.c(253): Allocated 4428 bytes (4 KiB) for all lextree nodes
INFO: fsg_lextree.c(256): Allocated 4428 bytes (4 KiB) for lextree leafnodes </sil>
I tried a few more acoustic models, each time only changing what -hmm points
to.
These ones won't load, with the same error:
. voxforge on voxforge.org
. voxforge-en-0.4 on cmusphinx.org
. voxforge-en-0.3 on cmusphinx.org
This one loads successfully:
. hub4opensrc.cd_continuous_8gau
Here is the output when loading hub4 model:
INFO: cmd_ln.c(691): Parsing command line:
\
-dict /home/sylvain/resources/loop.dic \
-fsg /home/sylvain/resources/loop.fsg \
-hmm /home/sylvain/resources/am/hub4opensrc.cd_continuous_8gau/ \
-bestpath no
Current configuration:
-adcdev
-agc none none
-agcthresh 2.0 2.000000e+00
-alpha 0.97 9.700000e-01
-argfile
-ascale 20.0 2.000000e+01
-aw 1 1
-backtrace no no
-beam 1e-48 1.000000e-48
-bestpath yes no
-bestpathlw 9.5 9.500000e+00
-bghist no no
-ceplen 13 13
-cmn current current
-cmninit 8.0 8.0
-compallsen no no
-debug 0
-dict /home/sylvain/resources/loop.dic
-dictcase no no
-dither no no
-doublebw no no
-ds 1 1
-fdict
-feat 1s_c_d_dd 1s_c_d_dd
-featparams
-fillprob 1e-8 1.000000e-08
-frate 100 100
-fsg /home/sylvain/resources/loop.fsg
-fsgusealtpron yes yes
-fsgusefiller yes yes
-fwdflat yes yes
-fwdflatbeam 1e-64 1.000000e-64
-fwdflatefwid 4 4
-fwdflatlw 8.5 8.500000e+00
-fwdflatsfwin 25 25
-fwdflatwbeam 7e-29 7.000000e-29
-fwdtree yes yes
-hmm /home/sylvain/resources/am/hub4opensrc.cd_continuous_8gau/
-infile
-input_endian little little
-jsgf
-kdmaxbbi -1 -1
-kdmaxdepth 0 0
-kdtree
-latsize 5000 5000
-lda
-ldadim 0 0
-lextreedump 0 0
-lifter 0 0
-lm
-lmctl
-lmname default default
-logbase 1.0001 1.000100e+00
-logfn
-logspec no no
-lowerf 133.33334 1.333333e+02
-lpbeam 1e-40 1.000000e-40
-lponlybeam 7e-29 7.000000e-29
-lw 6.5 6.500000e+00
-maxhmmpf -1 -1
-maxnewoov 20 20
-maxwpf -1 -1
-mdef
-mean
-mfclogdir
-min_endfr 0 0
-mixw
-mixwfloor 0.0000001 1.000000e-07
-mllr
-mmap yes yes
-ncep 13 13
-nfft 512 512
-nfilt 40 40
-nwpen 1.0 1.000000e+00
-pbeam 1e-48 1.000000e-48
-pip 1.0 1.000000e+00
-pl_beam 1e-10 1.000000e-10
-pl_pbeam 1e-5 1.000000e-05
-pl_window 0 0
-rawlogdir
-remove_dc no no
-round_filters yes yes
-samprate 16000 1.600000e+04
-seed -1 -1
-sendump
-senlogdir
-senmgau
-silprob 0.005 5.000000e-03
-smoothspec no no
-svspec
-time no no
-tmat
-tmatfloor 0.0001 1.000000e-04
-topn 4 4
-topn_beam 0 0
-toprule
-transform legacy legacy
-unit_area yes yes
-upperf 6855.4976 6.855498e+03
-usewdphones no no
-uw 1.0 1.000000e+00
-var
-varfloor 0.0001 1.000000e-04
-varnorm no no
-verbose no no
-warp_params
-warp_type inverse_linear inverse_linear
-wbeam 7e-29 7.000000e-29
-wip 0.65 6.500000e-01
-wlen 0.025625 2.562500e-02
INFO: cmd_ln.c(691): Parsing command line:
\
-alpha 0.97 \
-dither yes \
-doublebw no \
-nfilt 31 \
-ncep 13 \
-lowerf 200 \
-upperf 3500 \
-nfft 256 \
-wlen 0.025625 \
-samprate 8000 \
-transform legacy \
-feat 1s_c_d_dd \
-agc none \
-cmn current \
-varnorm no
Current configuration:
-agc none none
-agcthresh 2.0 2.000000e+00
-alpha 0.97 9.700000e-01
-ceplen 13 13
-cmn current current
-cmninit 8.0 8.0
-dither no yes
-doublebw no no
-feat 1s_c_d_dd 1s_c_d_dd
-frate 100 100
-input_endian little little
-lda /home/sylvain/resources/am/hub4opensrc.cd_continuous_8gau//feature_transform
-ldadim 0 0
-lifter 0 0
-logspec no no
-lowerf 133.33334 2.000000e+02
-ncep 13 13
-nfft 512 256
-nfilt 40 31
-remove_dc no no
-round_filters yes yes
-samprate 16000 8.000000e+03
-seed -1 -1
-smoothspec no no
-svspec
-transform legacy legacy
-unit_area yes yes
-upperf 6855.4976 3.500000e+03
-varnorm no no
-verbose no no
-warp_params
-warp_type inverse_linear inverse_linear
-wlen 0.025625 2.562500e-02
INFO: acmod.c(246): Parsed model-specific feature parameters from
/home/sylvain/resources/am/hub4opensrc.cd_continuous_8gau//feat.params
INFO: fe_interface.c(289): You are using the internal mechanism to generate
the seed.
INFO: feat.c(684): Initializing feature stream to type: '1s_c_d_dd',
ceplen=13, CMN='current', VARNORM='no', AGC='none'
INFO: cmn.c(142): mean= 12.00, mean= 0.0
INFO: acmod.c(157): Reading linear feature transformation from
/home/sylvain/resources/am/hub4opensrc.cd_continuous_8gau//feature_transform
INFO: mdef.c(520): Reading model definition:
/home/sylvain/resources/am/hub4opensrc.cd_continuous_8gau//mdef
INFO: bin_mdef.c(179): Allocating 104810 * 8 bytes (818 KiB) for CD tree
INFO: tmat.c(205): Reading HMM transition probability matrices:
/home/sylvain/resources/am/hub4opensrc.cd_continuous_8gau//transition_matrices
INFO: acmod.c(121): Attempting to use SCHMM computation module
INFO: ms_gauden.c(198): Reading mixture gaussian parameter:
/home/sylvain/resources/am/hub4opensrc.cd_continuous_8gau//means
INFO: ms_gauden.c(292): 3120 codebook, 1 feature, size:
INFO: ms_gauden.c(294): 16x29
INFO: ms_gauden.c(198): Reading mixture gaussian parameter:
/home/sylvain/resources/am/hub4opensrc.cd_continuous_8gau//variances
INFO: ms_gauden.c(292): 3120 codebook, 1 feature, size:
INFO: ms_gauden.c(294): 16x29
INFO: ms_gauden.c(354): 0 variance values floored
INFO: acmod.c(123): Attempting to use PTHMM computation module
INFO: ms_gauden.c(198): Reading mixture gaussian parameter:
/home/sylvain/resources/am/hub4opensrc.cd_continuous_8gau//means
INFO: ms_gauden.c(292): 3120 codebook, 1 feature, size:
INFO: ms_gauden.c(294): 16x29
INFO: ms_gauden.c(198): Reading mixture gaussian parameter:
/home/sylvain/resources/am/hub4opensrc.cd_continuous_8gau//variances
INFO: ms_gauden.c(292): 3120 codebook, 1 feature, size:
INFO: ms_gauden.c(294): 16x29
INFO: ms_gauden.c(354): 0 variance values floored
INFO: ptm_mgau.c(800): Number of codebooks exceeds 256: 3120
INFO: acmod.c(125): Falling back to general multi-stream GMM computation
INFO: ms_gauden.c(198): Reading mixture gaussian parameter:
/home/sylvain/resources/am/hub4opensrc.cd_continuous_8gau//means
INFO: ms_gauden.c(292): 3120 codebook, 1 feature, size:
INFO: ms_gauden.c(294): 16x29
INFO: ms_gauden.c(198): Reading mixture gaussian parameter:
/home/sylvain/resources/am/hub4opensrc.cd_continuous_8gau//variances
INFO: ms_gauden.c(292): 3120 codebook, 1 feature, size:
INFO: ms_gauden.c(294): 16x29
INFO: ms_gauden.c(354): 0 variance values floored
ERROR: "ms_mgau.c", line 112: Dimension of stream 29 does not match: 39 !=
-1260765504
Hello
This bug has been fixed in sphinxbase trunk just now. Please update sphinxbase
and pocketsphinx and recompile both.
Thanks for the report.
Wow, it works, many thanks for the super-quick fix.
Sylvain