Anonymous - 2003-02-28

I've trained a Sphinx2 acoustic model using 14 hours of speech data, and I've been testing it with a set of utterances that were held out of the training set.  Results are OK -- good.

Then I noticed that the script I'd been using to run Sphinx (which was adapted from scripts/sphinx-train) had -agcmax TRUE, whereas I'd run the SphinxTrain scripts with the default CFG_AGC = 'none'.  Oops!  I've noted in several places in the Sphinx docs from the CMU website that it's stressed that the AGC parameter of the decoder must match that of the acoustic model.  So I just made a new trial in which I changed only -agcmax to FALSE, and I was surprised to find that the matchfn file is IDENTICAL to before -- even the reported scores are the same.  I did verify from the log files that AGC_MAX was running in the first case, with no AGC in the second.

So what could be happening to cause IDENTICAL results?  Was the AGC-MAX actually doing nothing in the first case?  Here are the sphinx parameters:

sphinx2-continuous -verbose 9 \ -adcin FALSE \ -cepext feat \ -ctlfn ${CTLFILE} \ -ctloffset 0 -ctlcount 100000000 \ -datadir ${FEATS} \ -agcmax TRUE \ -langwt 6.5 \ -fwdflatlw 8.5 \ -rescorelw 9.5 \ -ugwt 0.5 \ -fillpen 1e-10 \ -silpen 0.005 \ -inspen 0.65 \ -top 1 \ -topsenfrm 3 \ -topsenthresh -70000 \ -beam 2e-06 \ -npbeam 2e-06 \ -lpbeam 2e-05 \ -lponlybeam 0.0005 \ -nwbeam 0.0005 \ -fwdflat FALSE \ -fwdflatbeam 1e-08 \ -fwdflatnwbeam 0.0003 \ -bestpath FALSE \ -kbdumpdir ${LMDIR} \ -lmfn ${LMDIR}/7568.lm \ -dictfn $youth/etc/youth.dic \ -noisedict $youth/etc/youth.filler \ -phnfn ${HMM}/phone -mapfn ${HMM}/map -hmmdir ${HMM} -hmmdirlist ${HMM} \ -8bsen TRUE -sendumpfn ${HMM}/sendump \ -cbdir ${HMM} \ -matchfn $youth/test/test.match_7000 \ -logfn $youth/test/test.log