I'm trying to use pocketsphinx to decode audio from video files in another
language using the gstreamer pocketsphinx plugin.
However, it seems pocketsphinx would not use the files I specified, except for
the dictionary. As a result, I get many errors saying "missing phone in the
acoustic model", and many words in the dictionary are left out.
The program works fine except for this. The decoder is able to produce partial
and final results per utterance, as expected, and I am able to access and
output these strings.
I will be posting my code and the log.
Any thoughts? Thanks! ^-^
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
INFO:cmd_ln.c(691):Parsingcommandline:gst-pocketsphinx\-samprate8000\-cmnprior\-nfft256\-fwdflatno\-bestpathno\-maxhmmpf1000\-maxwpf10Currentconfiguration:[NAME][DEFLT][VALUE]-agcnonenone-agcthresh2.02.000000e+00-alpha0.979.700000e-01-ascale20.02.000000e+01-aw11-backtracenono-beam1e-481.000000e-48-bestpathyesno-bestpathlw9.59.500000e+00-bghistnono-ceplen1313-cmncurrentprior-cmninit8.08.0-compallsennono-debug0-dict-dictcasenono-dithernono-doublebwnono-ds11-fdict-feat1s_c_d_dd1s_c_d_dd-featparams-fillprob1e-81.000000e-08-frate100100-fsg-fsgusealtpronyesyes-fsgusefilleryesyes-fwdflatyesno-fwdflatbeam1e-641.000000e-64-fwdflatefwid44-fwdflatlw8.58.500000e+00-fwdflatsfwin2525-fwdflatwbeam7e-297.000000e-29-fwdtreeyesyes-hmm-input_endianlittlelittle-jsgf-kdmaxbbi-1-1-kdmaxdepth00-kdtree-latsize50005000-lda-ldadim00-lextreedump00-lifter00-lm-lmctl-lmnamedefaultdefault-logbase1.00011.000100e+00-logfn-logspecnono-lowerf133.333341.333333e+02-lpbeam1e-401.000000e-40-lponlybeam7e-297.000000e-29-lw6.56.500000e+00-maxhmmpf-11000-maxnewoov2020-maxwpf-110-mdef-mean-mfclogdir-min_endfr00-mixw-mixwfloor0.00000011.000000e-07-mllr-mmapyesyes-ncep1313-nfft512256-nfilt4040-nwpen1.01.000000e+00-pbeam1e-481.000000e-48-pip1.01.000000e+00-pl_beam1e-101.000000e-10-pl_pbeam1e-51.000000e-05-pl_window00-rawlogdir-remove_dcnono-round_filtersyesyes-samprate160008.000000e+03-seed-1-1-sendump-senlogdir-senmgau-silprob0.0055.000000e-03-smoothspecnono-svspec-tmat-tmatfloor0.00011.000000e-04-topn44-topn_beam00-toprule-transformlegacylegacy-unit_areayesyes-upperf6855.49766.855498e+03-usewdphonesnono-uw1.01.000000e+00-var-varfloor0.00011.000000e-04-varnormnono-verbosenono-warp_params-warp_typeinverse_linearinverse_linear-wbeam7e-297.000000e-29-wip0.656.500000e-01-wlen0.0256252.562500e-02INFO:cmd_ln.c(691):Parsingcommandline:\-nfilt20\-lowerf1\-upperf4000\-wlen0.025\-transformdct\-round_filtersno\-remove_dcyes\-svspec0-12/13-25/26-38\-feat1s_c_d_dd\-agcnone\-cmncurrent\-cmninit56,-3,1\-varnormnoCurrentconfiguration:[NAME][DEFLT][VALUE]-agcnonenone-agcthresh2.02.000000e+00-alpha0.979.700000e-01-ceplen1313-cmncurrentcurrent-cmninit8.056,-3,1-dithernono-doublebwnono-feat1s_c_d_dd1s_c_d_dd-frate100100-input_endianlittlelittle-lda-ldadim00-lifter00-logspecnono-lowerf133.333341.000000e+00-ncep1313-nfft512256-nfilt4020-remove_dcnoyes-round_filtersyesno-samprate160008.000000e+03-seed-1-1-smoothspecnono-svspec0-12/13-25/26-38-transformlegacydct-unit_areayesyes-upperf6855.49764.000000e+03-varnormnono-verbosenono-warp_params-warp_typeinverse_linearinverse_linear-wlen0.0256252.500000e-02INFO:acmod.c(242):Parsedmodel-specificfeatureparametersfrom/usr/local/share/pocketsphinx/model/hmm/en_US/hub4wsj_sc_8k/feat.paramsINFO:feat.c(684):Initializingfeaturestreamtotype:'1s_c_d_dd',ceplen=13,CMN='current',VARNORM='no',AGC='none'INFO:cmn.c(142):mean[0]=12.00,mean[1..12]=0.0INFO:acmod.c(163):Usingsubvectorspecification0-12/13-25/26-38INFO:mdef.c(520):Readingmodeldefinition:/usr/local/share/pocketsphinx/model/hmm/en_US/hub4wsj_sc_8k/mdefINFO:mdef.c(531):Foundbyte-ordermarkBMDF,assumingthisisabinarymdeffileINFO:bin_mdef.c(330):Readingbinarymodeldefinition:/usr/local/share/pocketsphinx/model/hmm/en_US/hub4wsj_sc_8k/mdefINFO:bin_mdef.c(507):50CI-phone,143047CD-phone,3emitstate/phone,150CI-sen,5150Sen,27135Sen-SeqINFO:tmat.c(205):ReadingHMMtransitionprobabilitymatrices:/usr/local/share/pocketsphinx/model/hmm/en_US/hub4wsj_sc_8k/transition_matricesINFO:acmod.c(117):AttemptingtouseSCHMMcomputationmoduleINFO:ms_gauden.c(198):Readingmixturegaussianparameter:/usr/local/share/pocketsphinx/model/hmm/en_US/hub4wsj_sc_8k/meansINFO:ms_gauden.c(292):1codebook,3feature,size:INFO:ms_gauden.c(294):256x13INFO:ms_gauden.c(294):256x13INFO:ms_gauden.c(294):256x13INFO:ms_gauden.c(198):Readingmixturegaussianparameter:/usr/local/share/pocketsphinx/model/hmm/en_US/hub4wsj_sc_8k/variancesINFO:ms_gauden.c(292):1codebook,3feature,size:INFO:ms_gauden.c(294):256x13INFO:ms_gauden.c(294):256x13INFO:ms_gauden.c(294):256x13INFO:ms_gauden.c(354):0variancevaluesflooredINFO:s2_semi_mgau.c(908):Loadingsenonesfromdumpfile/usr/local/share/pocketsphinx/model/hmm/en_US/hub4wsj_sc_8k/sendumpINFO:s2_semi_mgau.c(932):BEGINFILEFORMATDESCRIPTIONINFO:s2_semi_mgau.c(1027):Usingmemory-mappedI/OforsenonesINFO:s2_semi_mgau.c(1304):Maximumtop-N:4Top-Nbeams:000INFO:dict.c(306):Allocating137542*32bytes(4298KiB)forwordentriesINFO:dict.c(321):Readingmaindictionary:/usr/local/share/pocketsphinx/model/lm/en_US/cmu07a.dicINFO:dict.c(212):Allocated1010KiBforstrings,1664KiBforphonesINFO:dict.c(324):133436wordsreadINFO:dict.c(330):Readingfillerdictionary:/usr/local/share/pocketsphinx/model/hmm/en_US/hub4wsj_sc_8k/noisedictINFO:dict.c(212):Allocated0KiBforstrings,0KiBforphonesINFO:dict.c(333):11wordsreadINFO:dict2pid.c(396):BuildingPIDtablesfordictionaryINFO:dict2pid.c(404):Allocating50^3*2bytes(244KiB)forword-initialtriphonesINFO:dict2pid.c(131):Allocated60400bytes(58KiB)forword-finaltriphonesINFO:dict2pid.c(195):Allocated60400bytes(58KiB)forsingle-phonewordtriphonesINFO:ngram_model_arpa.c(77):No\data\markinLMfileINFO:ngram_model_dmp.c(142):Willusememory-mappedI/OforLMfileINFO:ngram_model_dmp.c(196):ngrams1=5001,2=436879,3=418286INFO:ngram_model_dmp.c(242):5001=LM.unigrams(+trailer)readINFO:ngram_model_dmp.c(291):436879=LM.bigrams(+trailer)readINFO:ngram_model_dmp.c(317):418286=LM.trigramsreadINFO:ngram_model_dmp.c(342):37293=LM.prob2entriesreadINFO:ngram_model_dmp.c(362):14370=LM.bo_wt2entriesreadINFO:ngram_model_dmp.c(382):36094=LM.prob3entriesreadINFO:ngram_model_dmp.c(410):854=LM.tseg_baseentriesreadINFO:ngram_model_dmp.c(466):5001=asciiwordstringsreadINFO:ngram_search_fwdtree.c(99):788uniqueinitialdiphonesINFO:ngram_search_fwdtree.c(147):0root,0non-rootchannels,60single-phonewordsINFO:ngram_search_fwdtree.c(186):CreatingsearchtreeINFO:ngram_search_fwdtree.c(191):before:0root,0non-rootchannels,60single-phonewordsINFO:ngram_search_fwdtree.c(326):after:maxnonrootchanincreasedto13428INFO:ngram_search_fwdtree.c(338):after:457root,13300non-rootchannels,26single-phonewordsINFO:ngram_search_fwdtree.c(430):TOTALfwdtree0.00CPU-nanxRTINFO:ngram_search_fwdtree.c(433):TOTALfwdtree0.00wall-nanxRTINFO:cmd_ln.c(691):Parsingcommandline:\-nfilt20\-lowerf1\-upperf4000\-wlen0.025\-transformdct\-round_filtersno\-remove_dcyes\-svspec0-12/13-25/26-38\-feat1s_c_d_dd\-agcnone\-cmncurrent\-cmninit56,-3,1\-varnormnoCurrentconfiguration:[NAME][DEFLT][VALUE]-agcnonenone-agcthresh2.02.000000e+00-alpha0.979.700000e-01-ceplen1313-cmncurrentcurrent-cmninit8.056,-3,1-dithernono-doublebwnono-feat1s_c_d_dd1s_c_d_dd-frate100100-input_endianlittlelittle-lda-ldadim00-lifter00-logspecnono-lowerf133.333341.000000e+00-ncep1313-nfft512256-nfilt4020-remove_dcnoyes-round_filtersyesno-samprate160008.000000e+03-seed-1-1-smoothspecnono-svspec0-12/13-25/26-38-transformlegacydct-unit_areayesyes-upperf6855.49764.000000e+03-varnormnono-verbosenono-warp_params-warp_typeinverse_linearinverse_linear-wlen0.0256252.500000e-02INFO:acmod.c(242):Parsedmodel-specificfeatureparametersfrom/usr/local/share/pocketsphinx/model/hmm/en_US/hub4wsj_sc_8k/feat.paramsINFO:feat.c(684):Initializingfeaturestreamtotype:'1s_c_d_dd',ceplen=13,CMN='current',VARNORM='no',AGC='none'INFO:cmn.c(142):mean[0]=12.00,mean[1..12]=0.0INFO:acmod.c(163):Usingsubvectorspecification0-12/13-25/26-38INFO:mdef.c(520):Readingmodeldefinition:/usr/local/share/pocketsphinx/model/hmm/en_US/hub4wsj_sc_8k/mdefINFO:mdef.c(531):Foundbyte-ordermarkBMDF,assumingthisisabinarymdeffileINFO:bin_mdef.c(330):Readingbinarymodeldefinition:/usr/local/share/pocketsphinx/model/hmm/en_US/hub4wsj_sc_8k/mdefINFO:bin_mdef.c(507):50CI-phone,143047CD-phone,3emitstate/phone,150CI-sen,5150Sen,27135Sen-SeqINFO:tmat.c(205):ReadingHMMtransitionprobabilitymatrices:/usr/local/share/pocketsphinx/model/hmm/en_US/hub4wsj_sc_8k/transition_matricesINFO:acmod.c(117):AttemptingtouseSCHMMcomputationmoduleINFO:ms_gauden.c(198):Readingmixturegaussianparameter:/usr/local/share/pocketsphinx/model/hmm/en_US/hub4wsj_sc_8k/meansINFO:ms_gauden.c(292):1codebook,3feature,size:INFO:ms_gauden.c(294):256x13INFO:ms_gauden.c(294):256x13INFO:ms_gauden.c(294):256x13INFO:ms_gauden.c(198):Readingmixturegaussianparameter:/usr/local/share/pocketsphinx/model/hmm/en_US/hub4wsj_sc_8k/variancesINFO:ms_gauden.c(292):1codebook,3feature,size:INFO:ms_gauden.c(294):256x13INFO:ms_gauden.c(294):256x13INFO:ms_gauden.c(294):256x13INFO:ms_gauden.c(354):0variancevaluesflooredINFO:s2_semi_mgau.c(908):Loadingsenonesfromdumpfile/usr/local/share/pocketsphinx/model/hmm/en_US/hub4wsj_sc_8k/sendumpINFO:s2_semi_mgau.c(932):BEGINFILEFORMATDESCRIPTIONINFO:s2_semi_mgau.c(1027):Usingmemory-mappedI/OforsenonesINFO:s2_semi_mgau.c(1304):Maximumtop-N:4Top-Nbeams:000INFO:dict.c(306):Allocating137542*32bytes(4298KiB)forwordentriesINFO:dict.c(321):Readingmaindictionary:/usr/local/share/pocketsphinx/model/lm/en_US/cmu07a.dicINFO:dict.c(212):Allocated1010KiBforstrings,1664KiBforphonesINFO:dict.c(324):133436wordsreadINFO:dict.c(330):Readingfillerdictionary:/usr/local/share/pocketsphinx/model/hmm/en_US/hub4wsj_sc_8k/noisedictINFO:dict.c(212):Allocated0KiBforstrings,0KiBforphonesINFO:dict.c(333):11wordsreadINFO:dict2pid.c(396):BuildingPIDtablesfordictionaryINFO:dict2pid.c(404):Allocating50^3*2bytes(244KiB)forword-initialtriphonesINFO:dict2pid.c(131):Allocated60400bytes(58KiB)forword-finaltriphonesINFO:dict2pid.c(195):Allocated60400bytes(58KiB)forsingle-phonewordtriphonesINFO:ngram_model_arpa.c(77):No\data\markinLMfileINFO:ngram_model_dmp.c(142):Willusememory-mappedI/OforLMfileINFO:ngram_model_dmp.c(196):ngrams1=5001,2=436879,3=418286INFO:ngram_model_dmp.c(242):5001=LM.unigrams(+trailer)readINFO:ngram_model_dmp.c(291):436879=LM.bigrams(+trailer)readINFO:ngram_model_dmp.c(317):418286=LM.trigramsreadINFO:ngram_model_dmp.c(342):37293=LM.prob2entriesreadINFO:ngram_model_dmp.c(362):14370=LM.bo_wt2entriesreadINFO:ngram_model_dmp.c(382):36094=LM.prob3entriesreadINFO:ngram_model_dmp.c(410):854=LM.tseg_baseentriesreadINFO:ngram_model_dmp.c(466):5001=asciiwordstringsreadINFO:ngram_search_fwdtree.c(99):788uniqueinitialdiphonesINFO:ngram_search_fwdtree.c(147):0root,0non-rootchannels,60single-phonewordsINFO:ngram_search_fwdtree.c(186):CreatingsearchtreeINFO:ngram_search_fwdtree.c(191):before:0root,0non-rootchannels,60single-phonewordsINFO:ngram_search_fwdtree.c(326):after:maxnonrootchanincreasedto13428INFO:ngram_search_fwdtree.c(338):after:457root,13300non-rootchannels,26single-phonewordsINFO:ngram_model_arpa.c(77):No\data\markinLMfileINFO:ngram_model_dmp.c(142):Willusememory-mappedI/OforLMfileINFO:ngram_model_dmp.c(196):ngrams1=6685,2=10803,3=11727INFO:ngram_model_dmp.c(242):6685=LM.unigrams(+trailer)readINFO:ngram_model_dmp.c(291):10803=LM.bigrams(+trailer)readINFO:ngram_model_dmp.c(317):11727=LM.trigramsreadINFO:ngram_model_dmp.c(342):2265=LM.prob2entriesreadINFO:ngram_model_dmp.c(362):1943=LM.bo_wt2entriesreadINFO:ngram_model_dmp.c(382):492=LM.prob3entriesreadINFO:ngram_model_dmp.c(410):22=LM.tseg_baseentriesreadINFO:ngram_model_dmp.c(466):6685=asciiwordstringsreadINFO:ngram_search_fwdtree.c(99):788uniqueinitialdiphonesINFO:ngram_search_fwdtree.c(147):0root,0non-rootchannels,60single-phonewordsINFO:ngram_search_fwdtree.c(186):CreatingsearchtreeINFO:ngram_search_fwdtree.c(191):before:0root,0non-rootchannels,60single-phonewordsINFO:ngram_search_fwdtree.c(338):after:457root,13300non-rootchannels,26single-phonewordsINFO:ngram_search_fwdtree.c(430):TOTALfwdtree0.00CPU-nanxRTINFO:ngram_search_fwdtree.c(433):TOTALfwdtree0.00wall-nanxRTINFO:cmd_ln.c(691):Parsingcommandline:\-nfilt20\-lowerf1\-upperf4000\-wlen0.025\-transformdct\-round_filtersno\-remove_dcyes\-svspec0-12/13-25/26-38\-feat1s_c_d_dd\-agcnone\-cmncurrent\-cmninit56,-3,1\-varnormnoCurrentconfiguration:[NAME][DEFLT][VALUE]-agcnonenone-agcthresh2.02.000000e+00-alpha0.979.700000e-01-ceplen1313-cmncurrentcurrent-cmninit8.056,-3,1-dithernono-doublebwnono-feat1s_c_d_dd1s_c_d_dd-frate100100-input_endianlittlelittle-lda-ldadim00-lifter00-logspecnono-lowerf133.333341.000000e+00-ncep1313-nfft512256-nfilt4020-remove_dcnoyes-round_filtersyesno-samprate160008.000000e+03-seed-1-1-smoothspecnono-svspec0-12/13-25/26-38-transformlegacydct-unit_areayesyes-upperf6855.49764.000000e+03-varnormnono-verbosenono-warp_params-warp_typeinverse_linearinverse_linear-wlen0.0256252.500000e-02INFO:acmod.c(242):Parsedmodel-specificfeatureparametersfrom/usr/local/share/pocketsphinx/model/hmm/en_US/hub4wsj_sc_8k/feat.paramsINFO:feat.c(684):Initializingfeaturestreamtotype:'1s_c_d_dd',ceplen=13,CMN='current',VARNORM='no',AGC='none'INFO:cmn.c(142):mean[0]=12.00,mean[1..12]=0.0INFO:acmod.c(163):Usingsubvectorspecification0-12/13-25/26-38INFO:mdef.c(520):Readingmodeldefinition:/usr/local/share/pocketsphinx/model/hmm/en_US/hub4wsj_sc_8k/mdefINFO:mdef.c(531):Foundbyte-ordermarkBMDF,assumingthisisabinarymdeffileINFO:bin_mdef.c(330):Readingbinarymodeldefinition:/usr/local/share/pocketsphinx/model/hmm/en_US/hub4wsj_sc_8k/mdefINFO:bin_mdef.c(507):50CI-phone,143047CD-phone,3emitstate/phone,150CI-sen,5150Sen,27135Sen-SeqINFO:tmat.c(205):ReadingHMMtransitionprobabilitymatrices:/usr/local/share/pocketsphinx/model/hmm/en_US/hub4wsj_sc_8k/transition_matricesINFO:acmod.c(117):AttemptingtouseSCHMMcomputationmoduleINFO:ms_gauden.c(198):Readingmixturegaussianparameter:/usr/local/share/pocketsphinx/model/hmm/en_US/hub4wsj_sc_8k/meansINFO:ms_gauden.c(292):1codebook,3feature,size:INFO:ms_gauden.c(294):256x13INFO:ms_gauden.c(294):256x13INFO:ms_gauden.c(294):256x13INFO:ms_gauden.c(198):Readingmixturegaussianparameter:/usr/local/share/pocketsphinx/model/hmm/en_US/hub4wsj_sc_8k/variancesINFO:ms_gauden.c(292):1codebook,3feature,size:INFO:ms_gauden.c(294):256x13INFO:ms_gauden.c(294):256x13INFO:ms_gauden.c(294):256x13INFO:ms_gauden.c(354):0variancevaluesflooredINFO:s2_semi_mgau.c(908):Loadingsenonesfromdumpfile/usr/local/share/pocketsphinx/model/hmm/en_US/hub4wsj_sc_8k/sendumpINFO:s2_semi_mgau.c(932):BEGINFILEFORMATDESCRIPTIONINFO:s2_semi_mgau.c(1027):Usingmemory-mappedI/OforsenonesINFO:s2_semi_mgau.c(1304):Maximumtop-N:4Top-Nbeams:000INFO:dict.c(306):Allocating20902*32bytes(653KiB)forwordentriesINFO:dict.c(321):Readingmaindictionary:/usr/local/share/pocketsphinx/model/lm/news_SMS_models/mobileasr_test.dicERROR:"dict.c",line194:Line1:Phone'IX'ismisingintheacousticmodel;word''di' ignoredERROR: "dict.c", line 194: Line 3: Phone 'OX' is mising in the acoustic model; word ''ko'ignoredERROR:"dict.c",line194:Line4:Phone'OX'ismisingintheacousticmodel;word''kong' ignoredERROR: "dict.c", line 194: Line 6: Phone 'OX' is mising in the acoustic model; word ''no'ignoredERROR:"dict.c",line194:Line7:Phone'AX'ismisingintheacousticmodel;word''pag' ignoredERROR: "dict.c", line 194: Line 8: Phone 'IX' is mising in the acoustic model; word ''pinas'ignoredERROR:"dict.c",line194:Line16794:Phone'H'ismisingintheacousticmodel;word'zucchini'ignoredINFO:dict.c(212):Allocated12KiBforstrings,20KiBforphonesINFO:dict.c(324):2327wordsreadINFO:dict.c(330):Readingfillerdictionary:/usr/local/share/pocketsphinx/model/hmm/en_US/hub4wsj_sc_8k/noisedictINFO:dict.c(212):Allocated0KiBforstrings,0KiBforphonesINFO:dict.c(333):11wordsreadINFO:dict2pid.c(396):BuildingPIDtablesfordictionaryINFO:dict2pid.c(404):Allocating50^3*2bytes(244KiB)forword-initialtriphonesINFO:dict2pid.c(131):Allocated60400bytes(58KiB)forword-finaltriphonesINFO:dict2pid.c(195):Allocated60400bytes(58KiB)forsingle-phonewordtriphonesINFO:ngram_model_arpa.c(77):No\data\markinLMfileINFO:ngram_model_dmp.c(142):Willusememory-mappedI/OforLMfileINFO:ngram_model_dmp.c(196):ngrams1=6685,2=10803,3=11727INFO:ngram_model_dmp.c(242):6685=LM.unigrams(+trailer)readINFO:ngram_model_dmp.c(291):10803=LM.bigrams(+trailer)readINFO:ngram_model_dmp.c(317):11727=LM.trigramsreadINFO:ngram_model_dmp.c(342):2265=LM.prob2entriesreadINFO:ngram_model_dmp.c(362):1943=LM.bo_wt2entriesreadINFO:ngram_model_dmp.c(382):492=LM.prob3entriesreadINFO:ngram_model_dmp.c(410):22=LM.tseg_baseentriesreadINFO:ngram_model_dmp.c(466):6685=asciiwordstringsreadINFO:ngram_search_fwdtree.c(99):374uniqueinitialdiphonesINFO:ngram_search_fwdtree.c(147):0root,0non-rootchannels,26single-phonewordsINFO:ngram_search_fwdtree.c(186):CreatingsearchtreeINFO:ngram_search_fwdtree.c(191):before:0root,0non-rootchannels,26single-phonewordsINFO:ngram_search_fwdtree.c(326):after:maxnonrootchanincreasedto2738INFO:ngram_search_fwdtree.c(338):after:336root,2610non-rootchannels,23single-phonewordsNowplaying:05-111101-001.flvRunning...
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Sometimes this happens; other times, everything works fine. I do not see a
trend when it fails or not, and I have not experienced such when using the
default models.
Is this a problem with the models I'm using?
Again, thanks for the help! ^-^
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Sometimes, when I run the program, the decoder initialization stops at this
error:
This means that sometimes sample rate is not properly set or properly
negotiated in pipeline. It might be that another model doesn't properly
configure the frontentd through feat.params options or something else.
I recommend you to use latest version, it it the messages are btter.
E_FATAL("Failed to create filterbank, frequency range does not match. ""Sample rate %f, FFT size %d, lowerf %f < freq %f > upperf %f.\n",mel_fb->s
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Good day!
I'm trying to use pocketsphinx to decode audio from video files in another
language using the gstreamer pocketsphinx plugin.
However, it seems pocketsphinx would not use the files I specified, except for
the dictionary. As a result, I get many errors saying "missing phone in the
acoustic model", and many words in the dictionary are left out.
The program works fine except for this. The decoder is able to produce partial
and final results per utterance, as expected, and I am able to access and
output these strings.
I will be posting my code and the log.
Any thoughts? Thanks! ^-^
The relevant part:
The log file:
You need to set
property as a last step. You need to set
property before
property.
I see, thank you very much!
I'm not sure if I can ask this in the same thread, but..
Now I'm encountering a problem I do not understand.
Sometimes, when I run the program, the decoder initialization stops at this
error:
Sometimes this happens; other times, everything works fine. I do not see a
trend when it fails or not, and I have not experienced such when using the
default models.
Is this a problem with the models I'm using?
Again, thanks for the help! ^-^
This means that sometimes sample rate is not properly set or properly
negotiated in pipeline. It might be that another model doesn't properly
configure the frontentd through feat.params options or something else.
I recommend you to use latest version, it it the messages are btter.