When I try to try it with pocketsphinx this is what I get:
/beta#pocketsphinx_continuous-hmmmodel_parameters/beta.cd_cont_200-lmamdigits.lm-dictetc/beta.dicINFO:cmd_ln.c(691):Parsingcommandline:pocketsphinx_continuous\-hmmmodel_parameters/beta.cd_cont_200\-lmamdigits.lm\-dictetc/beta.dicCurrentconfiguration:[NAME][DEFLT][VALUE]-adcdev-agcnonenone-agcthresh2.02.000000e+00-alpha0.979.700000e-01-argfile-ascale20.02.000000e+01-aw11-backtracenono-beam1e-481.000000e-48-bestpathyesyes-bestpathlw9.59.500000e+00-bghistnono-ceplen1313-cmncurrentcurrent-cmninit8.08.0-compallsennono-debug0-dictetc/beta.dic-dictcasenono-dithernono-doublebwnono-ds11-fdict-feat1s_c_d_dd1s_c_d_dd-featparams-fillprob1e-81.000000e-08-frate100100-fsg-fsgusealtpronyesyes-fsgusefilleryesyes-fwdflatyesyes-fwdflatbeam1e-641.000000e-64-fwdflatefwid44-fwdflatlw8.58.500000e+00-fwdflatsfwin2525-fwdflatwbeam7e-297.000000e-29-fwdtreeyesyes-hmmmodel_parameters/beta.cd_cont_200-infile-input_endianlittlelittle-jsgf-kdmaxbbi-1-1-kdmaxdepth00-kdtree-latsize50005000-lda-ldadim00-lextreedump00-lifter00-lmamdigits.lm-lmctl-lmnamedefaultdefault-logbase1.00011.000100e+00-logfn-logspecnono-lowerf133.333341.333333e+02-lpbeam1e-401.000000e-40-lponlybeam7e-297.000000e-29-lw6.56.500000e+00-maxhmmpf-1-1-maxnewoov2020-maxwpf-1-1-mdef-mean-mfclogdir-min_endfr00-mixw-mixwfloor0.00000011.000000e-07-mllr-mmapyesyes-ncep1313-nfft512512-nfilt4040-nwpen1.01.000000e+00-pbeam1e-481.000000e-48-pip1.01.000000e+00-pl_beam1e-101.000000e-10-pl_pbeam1e-51.000000e-05-pl_window00-rawlogdir-remove_dcnono-round_filtersyesyes-samprate160001.600000e+04-seed-1-1-sendump-senlogdir-senmgau-silprob0.0055.000000e-03-smoothspecnono-svspec-timenono-tmat-tmatfloor0.00011.000000e-04-topn44-topn_beam00-toprule-transformlegacylegacy-unit_areayesyes-upperf6855.49766.855498e+03-usewdphonesnono-uw1.01.000000e+00-var-varfloor0.00011.000000e-04-varnormnono-verbosenono-warp_params-warp_typeinverse_linearinverse_linear-wbeam7e-297.000000e-29-wip0.656.500000e-01-wlen0.0256252.562500e-02INFO:cmd_ln.c(691):Parsingcommandline:\-nfilt40\-lowerf133.3334\-upperf6855.4976\-feat1s_c_d_dd\-agcnone\-cmncurrent\-varnormnoCurrentconfiguration:[NAME][DEFLT][VALUE]-agcnonenone-agcthresh2.02.000000e+00-alpha0.979.700000e-01-ceplen1313-cmncurrentcurrent-cmninit8.08.0-dithernono-doublebwnono-feat1s_c_d_dd1s_c_d_dd-frate100100-input_endianlittlelittle-lda-ldadim00-lifter00-logspecnono-lowerf133.333341.333334e+02-ncep1313-nfft512512-nfilt4040-remove_dcnono-round_filtersyesyes-samprate160001.600000e+04-seed-1-1-smoothspecnono-svspec-transformlegacylegacy-unit_areayesyes-upperf6855.49766.855498e+03-varnormnono-verbosenono-warp_params-warp_typeinverse_linearinverse_linear-wlen0.0256252.562500e-02INFO:acmod.c(246):Parsedmodel-specificfeatureparametersfrommodel_parameters/beta.cd_cont_200/feat.paramsINFO:feat.c(713):Initializingfeaturestreamtotype:'1s_c_d_dd',ceplen=13,CMN='current',VARNORM='no',AGC='none'INFO:cmn.c(142):mean[0]=12.00,mean[1..12]=0.0INFO:mdef.c(517):Readingmodeldefinition:model_parameters/beta.cd_cont_200/mdefINFO:bin_mdef.c(179):Allocating305*8bytes(2KiB)forCDtreeINFO:tmat.c(205):ReadingHMMtransitionprobabilitymatrices:model_parameters/beta.cd_cont_200/transition_matricesINFO:acmod.c(121):AttemptingtouseSCHMMcomputationmoduleINFO:ms_gauden.c(198):Readingmixturegaussianparameter:model_parameters/beta.cd_cont_200/meansINFO:ms_gauden.c(292):150codebook,1feature,size:INFO:ms_gauden.c(294):8x39INFO:ms_gauden.c(198):Readingmixturegaussianparameter:model_parameters/beta.cd_cont_200/variancesINFO:ms_gauden.c(292):150codebook,1feature,size:INFO:ms_gauden.c(294):8x39INFO:ms_gauden.c(354):5variancevaluesflooredINFO:acmod.c(123):AttemptingtousePTHMMcomputationmoduleINFO:ms_gauden.c(198):Readingmixturegaussianparameter:model_parameters/beta.cd_cont_200/meansINFO:ms_gauden.c(292):150codebook,1feature,size:INFO:ms_gauden.c(294):8x39INFO:ms_gauden.c(198):Readingmixturegaussianparameter:model_parameters/beta.cd_cont_200/variancesINFO:ms_gauden.c(292):150codebook,1feature,size:INFO:ms_gauden.c(294):8x39INFO:ms_gauden.c(354):5variancevaluesflooredINFO:ptm_mgau.c(804):Numberofcodebooksdoesn't match number of ciphones, doesn'tlooklikePTM:150!=17INFO:acmod.c(125):Fallingbacktogeneralmulti-streamGMMcomputationINFO:ms_gauden.c(198):Readingmixturegaussianparameter:model_parameters/beta.cd_cont_200/meansINFO:ms_gauden.c(292):150codebook,1feature,size:INFO:ms_gauden.c(294):8x39INFO:ms_gauden.c(198):Readingmixturegaussianparameter:model_parameters/beta.cd_cont_200/variancesINFO:ms_gauden.c(292):150codebook,1feature,size:INFO:ms_gauden.c(294):8x39INFO:ms_gauden.c(354):5variancevaluesflooredINFO:ms_senone.c(160):Readingsenonemixtureweights:model_parameters/beta.cd_cont_200/mixture_weightsINFO:ms_senone.c(211):Truncatingsenonelogs3(pdf)valuesby10bitsINFO:ms_senone.c(218):NottransposingmixtureweightsinmemoryINFO:ms_senone.c(277):Readmixtureweightsfor150senones:1featuresx8codewordsINFO:ms_senone.c(331):MappingsenonestoindividualcodebooksINFO:ms_mgau.c(141):Thevalueoftopn:4INFO:dict.c(317):Allocating4110*20bytes(80KiB)forwordentriesINFO:dict.c(332):Readingmaindictionary:etc/beta.dicINFO:dict.c(211):Allocated0KiBforstrings,0KiBforphonesINFO:dict.c(335):11wordsreadINFO:dict.c(341):Readingfillerdictionary:model_parameters/beta.cd_cont_200/noisedictINFO:dict.c(211):Allocated0KiBforstrings,0KiBforphonesINFO:dict.c(344):3wordsreadINFO:dict2pid.c(396):BuildingPIDtablesfordictionaryINFO:dict2pid.c(404):Allocating17^3*2bytes(9KiB)forword-initialtriphonesINFO:dict2pid.c(131):Allocated3536bytes(3KiB)forword-finaltriphonesINFO:dict2pid.c(195):Allocated3536bytes(3KiB)forsingle-phonewordtriphonesINFO:ngram_model_arpa.c(477):ngrams1=12,2=20,3=10INFO:ngram_model_arpa.c(135):ReadingunigramsINFO:ngram_model_arpa.c(516):12=#unigramscreatedINFO:ngram_model_arpa.c(195):ReadingbigramsINFO:ngram_model_arpa.c(533):20=#bigramscreatedINFO:ngram_model_arpa.c(534):3=#prob2entriesINFO:ngram_model_arpa.c(542):3=#bo_wt2entriesINFO:ngram_model_arpa.c(292):ReadingtrigramsINFO:ngram_model_arpa.c(555):10=#trigramscreatedINFO:ngram_model_arpa.c(556):2=#prob3entriesINFO:ngram_search_fwdtree.c(99):11uniqueinitialdiphonesINFO:ngram_search_fwdtree.c(147):0root,0non-rootchannels,4single-phonewordsINFO:ngram_search_fwdtree.c(186):CreatingsearchtreeINFO:ngram_search_fwdtree.c(191):before:0root,0non-rootchannels,4single-phonewordsINFO:ngram_search_fwdtree.c(326):after:maxnonrootchanincreasedto142INFO:ngram_search_fwdtree.c(338):after:11root,14non-rootchannels,3single-phonewordsINFO:ngram_search_fwdflat.c(156):fwdflat:min_ef_width=4,max_sf_win=25INFO:continuous.c(371):pocketsphinx_continuousCOMPILEDON:Feb282012,AT:10:20:20Warning:CouldnotfindMicelementREADY....Listening...Recordingisstopped,startrecordingwithad_start_recStoppedlistening,pleasewait...INFO:cmn_prior.c(121):cmn_prior_update:from<8.000.000.000.000.000.000.000.000.000.000.000.000.00>INFO:cmn_prior.c(139):cmn_prior_update:to<12.58-1.37-0.26-0.32-0.34-0.09-0.12-0.23-0.02-0.05-0.21-0.01-0.17>INFO:ngram_search_fwdtree.c(1549):582wordsrecognized(2/fr)INFO:ngram_search_fwdtree.c(1551):8751senonesevaluated(36/fr)INFO:ngram_search_fwdtree.c(1553):3514channelssearched(14/fr),26511st,589lastINFO:ngram_search_fwdtree.c(1557):589wordsforwhichlastchannelsevaluated(2/fr)INFO:ngram_search_fwdtree.c(1560):0candidatewordsforenteringlastphone(0/fr)INFO:ngram_search_fwdtree.c(1562):fwdtree0.04CPU0.018xRTINFO:ngram_search_fwdtree.c(1565):fwdtree2.82wall1.150xRTINFO:ngram_search_fwdflat.c(305):Utterancevocabularycontains1wordsINFO:ngram_search_fwdflat.c(940):283wordsrecognized(1/fr)INFO:ngram_search_fwdflat.c(942):732senonesevaluated(3/fr)INFO:ngram_search_fwdflat.c(944):411channelssearched(1/fr)INFO:ngram_search_fwdflat.c(946):411wordssearched(1/fr)INFO:ngram_search_fwdflat.c(948):26wordtransitions(0/fr)INFO:ngram_search_fwdflat.c(951):fwdflat0.00CPU0.002xRTINFO:ngram_search_fwdflat.c(954):fwdflat0.00wall0.001xRTINFO:ngram_search.c(1214):</s>notfoundinlastframe,using<sil>.243insteadINFO:ngram_search.c(1266):latticestartnode<s>.0endnode<sil>.236INFO:ngram_search.c(1294):Eliminated0nodesbeforeendnodeINFO:ngram_search.c(1399):Latticehas17nodes,20linksINFO:ps_lattice.c(1365):NormalizerP(O)=alpha(<sil>:236:243)=-193566INFO:ps_lattice.c(1403):JointP(O,S)=-194077P(S|O)=-511INFO:ngram_search.c(888):bestpath0.00CPU0.000xRTINFO:ngram_search.c(891):bestpath0.00wall0.000xRT000000000:READY....
And it keeps runing this without recognizing any word :s :s
did I miss something?!!!
Please answer me this times
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Hi,
I just finished training acoustic model using my data, those are the ruselts:
Insertions: 0 Deletions: 1 Substitutions: 0
TOTAL Words: 100 Correct: 34 Errors: 66
TOTAL Percent correct = 34.00% Error = 66.00% Accuracy = 34.00%
TOTAL Insertions: 0 Deletions: 61 Substitutions: 5
When I try to try it with pocketsphinx this is what I get:
And it keeps runing this without recognizing any word :s :s
did I miss something?!!!
Please answer me this times
Most likely input audio had wrong format and features were not extracted
correctly. See tutorial for details
http://cmusphinx.sourceforge.net/wiki/tutorialam