Hi, I have to realize a speech recognition with keywords, about 400, I have coined my model in Italian, I created two keywords with respective 16-bit 16-bit mono audio files, I used jsff as grammar, the template is compiled without errors, but when i go to test with -inmic or with its wave files i do not get the hoped result :(
my project: https://github.com/McNamara10/acoustic-italian
my complete error:
pocketsphinx_mdef_convertsphinx_fesphinx_lm_evalsynapse@mcnamara:/usr/local/bin$pocketsphinx_continuous-infile/usr/local/share/pocketsphinx/model/ara/wav/speaker1/file_1.wav-hmm/usr/local/share/pocketsphinx/model/ara/model_parameters/ara.ci_cont-jsgf/usr/local/share/pocketsphinx/model/ara/etc/ara.jsgf-dict/usr/local/share/pocketsphinx/model/ara/etc/ara.dicINFO:pocketsphinx.c(152):Parsedmodel-specificfeatureparametersfrom/usr/local/share/pocketsphinx/model/ara/model_parameters/ara.ci_cont/feat.paramsCurrentconfiguration:[NAME][DEFLT][VALUE]-agcnonenone-agcthresh2.02.000000e+00-allphone-allphone_cinono-alpha0.979.700000e-01-ascale20.02.000000e+01-aw11-backtracenono-beam1e-481.000000e-48-bestpathyesyes-bestpathlw9.59.500000e+00-ceplen1313-cmnlivebatch-cmninit40,3,-140,3,-1-compallsennono-debug0-dict/usr/local/share/pocketsphinx/model/ara/etc/ara.dic-dictcasenono-dithernono-doublebwnono-ds11-fdict-feat1s_c_d_dd1s_c_d_dd-featparams-fillprob1e-81.000000e-08-frate100100-fsg-fsgusealtpronyesyes-fsgusefilleryesyes-fwdflatyesyes-fwdflatbeam1e-641.000000e-64-fwdflatefwid44-fwdflatlw8.58.500000e+00-fwdflatsfwin2525-fwdflatwbeam7e-297.000000e-29-fwdtreeyesyes-hmm/usr/local/share/pocketsphinx/model/ara/model_parameters/ara.ci_cont-input_endianlittlelittle-jsgf/usr/local/share/pocketsphinx/model/ara/etc/ara.jsgf-keyphrase-kws-kws_delay1010-kws_plp1e-11.000000e-01-kws_threshold11.000000e+00-latsize50005000-lda-ldadim00-lifter022-lm-lmctl-lmname-logbase1.00011.000100e+00-logfn-logspecnono-lowerf133.333341.300000e+02-lpbeam1e-401.000000e-40-lponlybeam7e-297.000000e-29-lw6.56.500000e+00-maxhmmpf3000030000-maxwpf-1-1-mdef-mean-mfclogdir-min_endfr00-mixw-mixwfloor0.00000011.000000e-07-mllr-mmapyesyes-ncep1313-nfft512512-nfilt4025-nwpen1.01.000000e+00-pbeam1e-481.000000e-48-pip1.01.000000e+00-pl_beam1e-101.000000e-10-pl_pbeam1e-101.000000e-10-pl_pip1.01.000000e+00-pl_weight3.03.000000e+00-pl_window55-rawlogdir-remove_dcnono-remove_noiseyesyes-remove_silenceyesyes-round_filtersyesyes-samprate160001.600000e+04-seed-1-1-sendump-senlogdir-senmgau-silprob0.0055.000000e-03-smoothspecnono-svspec-tmat-tmatfloor0.00011.000000e-04-topn44-topn_beam00-toprule-transformlegacydct-unit_areayesyes-upperf6855.49763.500000e+03-uw1.01.000000e+00-vad_postspeech5050-vad_prespeech2020-vad_startspeech1010-vad_threshold2.02.000000e+00-var-varfloor0.00011.000000e-04-varnormnono-verbosenono-warp_params-warp_typeinverse_linearinverse_linear-wbeam7e-297.000000e-29-wip0.656.500000e-01-wlen0.0256252.562500e-02INFO:feat.c(715):Initializingfeaturestreamtotype:'1s_c_d_dd',ceplen=13,CMN='batch',VARNORM='no',AGC='none'INFO:mdef.c(518):Readingmodeldefinition:/usr/local/share/pocketsphinx/model/ara/model_parameters/ara.ci_cont/mdefINFO:bin_mdef.c(181):Allocating52*8bytes(0KiB)forCDtreeINFO:tmat.c(149):ReadingHMMtransitionprobabilitymatrices:/usr/local/share/pocketsphinx/model/ara/model_parameters/ara.ci_cont/transition_matricesINFO:acmod.c(113):AttemptingtousePTMcomputationmoduleINFO:ms_gauden.c(127):Readingmixturegaussianparameter:/usr/local/share/pocketsphinx/model/ara/model_parameters/ara.ci_cont/meansINFO:ms_gauden.c(242):36codebook,1feature,size:INFO:ms_gauden.c(244):1x39INFO:ms_gauden.c(127):Readingmixturegaussianparameter:/usr/local/share/pocketsphinx/model/ara/model_parameters/ara.ci_cont/variancesINFO:ms_gauden.c(242):36codebook,1feature,size:INFO:ms_gauden.c(244):1x39INFO:ms_gauden.c(304):0variancevaluesflooredINFO:ptm_mgau.c(808):Numberofcodebooksdoesn't match number of ciphones, doesn'tlooklikePTM:36!=12INFO:acmod.c(115):Attemptingtousesemi-continuouscomputationmoduleINFO:ms_gauden.c(127):Readingmixturegaussianparameter:/usr/local/share/pocketsphinx/model/ara/model_parameters/ara.ci_cont/meansINFO:ms_gauden.c(242):36codebook,1feature,size:INFO:ms_gauden.c(244):1x39INFO:ms_gauden.c(127):Readingmixturegaussianparameter:/usr/local/share/pocketsphinx/model/ara/model_parameters/ara.ci_cont/variancesINFO:ms_gauden.c(242):36codebook,1feature,size:INFO:ms_gauden.c(244):1x39INFO:ms_gauden.c(304):0variancevaluesflooredINFO:acmod.c(117):Fallingbacktogeneralmulti-streamGMMcomputationINFO:ms_gauden.c(127):Readingmixturegaussianparameter:/usr/local/share/pocketsphinx/model/ara/model_parameters/ara.ci_cont/meansINFO:ms_gauden.c(242):36codebook,1feature,size:INFO:ms_gauden.c(244):1x39INFO:ms_gauden.c(127):Readingmixturegaussianparameter:/usr/local/share/pocketsphinx/model/ara/model_parameters/ara.ci_cont/variancesINFO:ms_gauden.c(242):36codebook,1feature,size:INFO:ms_gauden.c(244):1x39INFO:ms_gauden.c(304):0variancevaluesflooredINFO:ms_senone.c(149):Readingsenonemixtureweights:/usr/local/share/pocketsphinx/model/ara/model_parameters/ara.ci_cont/mixture_weightsINFO:ms_senone.c(200):Truncatingsenonelogs3(pdf)valuesby10bitsINFO:ms_senone.c(207):NottransposingmixtureweightsinmemoryINFO:ms_senone.c(268):Readmixtureweightsfor36senones:1featuresx1codewordsINFO:ms_senone.c(320):MappingsenonestoindividualcodebooksINFO:ms_mgau.c(144):Thevalueoftopn:4WARN:"ms_mgau.c",line148:-topnargument(4)invalidor>#densitycodewords(1);settolatterINFO:phone_loop_search.c(114):Statebeam-225Phoneexitbeam-225Insertionpenalty0INFO:dict.c(320):Allocating4101*20bytes(80KiB)forwordentriesINFO:dict.c(333):Readingmaindictionary:/usr/local/share/pocketsphinx/model/ara/etc/ara.dicINFO:dict.c(213):Dictionarysize2,allocated0KiBforstrings,0KiBforphonesINFO:dict.c(336):2wordsreadINFO:dict.c(358):Readingfillerdictionary:/usr/local/share/pocketsphinx/model/ara/model_parameters/ara.ci_cont/noisedictINFO:dict.c(213):Dictionarysize5,allocated0KiBforstrings,0KiBforphonesINFO:dict.c(361):3wordsreadINFO:dict2pid.c(396):BuildingPIDtablesfordictionaryINFO:dict2pid.c(406):Allocating12^3*2bytes(3KiB)forword-initialtriphonesINFO:dict2pid.c(132):Allocated1776bytes(1KiB)forword-finaltriphonesINFO:dict2pid.c(196):Allocated1776bytes(1KiB)forsingle-phonewordtriphonesINFO:jsgf.c(706):Definedrule:PUBLIC<hello.greet>INFO:fsg_model.c(208):ComputingtransitiveclosurefornulltransitionsINFO:fsg_model.c(270):0nulltransitionsaddedINFO:fsg_search.c(227):FSG(beam:-1080,pbeam:-1080,wbeam:-634;wip:-26,pip:0)INFO:fsg_model.c(423):Addingsilencetransitionsfor<sil>toFSGINFO:fsg_model.c(443):Added2silencewordtransitionsINFO:fsg_search.c(173):Added0alternatewordtransitionsINFO:fsg_lextree.c(110):Allocated52bytes(0KiB)forleftandrightcontextphonesINFO:fsg_lextree.c(256):20HMMnodesinlextree(4leaves)INFO:fsg_lextree.c(259):Allocated2400bytes(2KiB)foralllextreenodesINFO:fsg_lextree.c(262):Allocated480bytes(0KiB)forlextreeleafnodesINFO:continuous.c(307):pocketsphinx_continuousCOMPILEDON:Oct192017,AT:18:51:22INFO:cmn_live.c(120):Updatefrom<40.003.00-1.000.000.000.000.000.000.000.000.000.000.00>INFO:cmn_live.c(138):Updateto<31.122.26-3.02-9.04-5.501.910.679.94-7.86-1.07-5.21-2.400.22>INFO:fsg_search.c(859):201frames,379HMMs(1/fr),1137senones(5/fr),62historyentries(0/fr)INFO:fsg_search.c(869):fsg0.00CPU0.002xRTINFO:fsg_search.c(871):fsg0.00wall0.002xRTERROR:"fsg_search.c",line940:Finalresultdoesnotmatchthegrammarinframe201INFO:fsg_search.c(265):TOTALfsg0.00CPU0.002xRTINFO:fsg_search.c(268):TOTALfsg0.00wall0.002xRTsynapse@mcnamara:/usr/local/bin$pocketsphinx_continuous-infile/usr/local/share/pocketsphinx/model/ara/wav/speaker1/file_1.wav-hmm/usr/local/share/pocketsphinx/model/ara/model_parameters/ara.ci_cont-jsgf/usr/local/share/pocketsphinx/model/ara/etc/ara.jsgf-dict/usr/local/share/pocketsphinx/model/ara/etc/ara.dic
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Hi, I have to realize a speech recognition with keywords, about 400, I have coined my model in Italian, I created two keywords with respective 16-bit 16-bit mono audio files, I used jsff as grammar, the template is compiled without errors, but when i go to test with -inmic or with its wave files i do not get the hoped result :(
my project: https://github.com/McNamara10/acoustic-italian
my complete error: