Menu

No transcription using pocketsphinx_continuous on WAV

Help
Max Lay
2016-08-30
2016-09-02
  • Max Lay

    Max Lay - 2016-08-30

    I am trying to transcribe a wave file using the command:


    pocketsphinx_continuous -infile infile.wav -hmm /root/sphinx-source/pt4/model_parameters/your_db.ci_cont -samprate 8000 -dict etc/your_db.dic


    But get no transcription, only the following output:


    INFO: pocketsphinx.c(152): Parsed model-specific feature parameters from /root/sphinx-source/pt4/model_parameters/your_db.ci_cont/feat.params
    Current configuration:
    [NAME] [DEFLT] [VALUE]
    -agc none none
    -agcthresh 2.0 2.000000e+00
    -allphone
    -allphone_ci no no
    -alpha 0.97 9.700000e-01
    -ascale 20.0 2.000000e+01
    -aw 1 1
    -backtrace no no
    -beam 1e-48 1.000000e-48
    -bestpath yes yes
    -bestpathlw 9.5 9.500000e+00
    -ceplen 13 13
    -cmn live batch
    -cmninit 40,3,-1 32.83,0.32,-0.61
    -compallsen no no
    -debug 0
    -dict etc/your_db.dic
    -dictcase no no
    -dither no no
    -doublebw no no
    -ds 1 1
    -fdict
    -feat 1s_c_d_dd 1s_c_d_dd
    -featparams
    -fillprob 1e-8 1.000000e-08
    -frate 100 100
    -fsg
    -fsgusealtpron yes yes
    -fsgusefiller yes yes
    -fwdflat yes yes
    -fwdflatbeam 1e-64 1.000000e-64
    -fwdflatefwid 4 4
    -fwdflatlw 8.5 8.500000e+00
    -fwdflatsfwin 25 25
    -fwdflatwbeam 7e-29 7.000000e-29
    -fwdtree yes yes
    -hmm /root/sphinx-source/pt4/model_parameters/your_db.ci_cont
    -input_endian little little
    -jsgf
    -keyphrase
    -kws
    -kws_delay 10 10
    -kws_plp 1e-1 1.000000e-01
    -kws_threshold 1 1.000000e+00
    -latsize 5000 5000
    -lda
    -ldadim 0 0
    -lifter 0 22
    -lm /usr/local/share/pocketsphinx/model/en-us/en-us.lm.bin
    -lmctl
    -lmname
    -logbase 1.0001 1.000100e+00
    -logfn
    -logspec no no
    -lowerf 133.33334 2.000000e+02
    -lpbeam 1e-40 1.000000e-40
    -lponlybeam 7e-29 7.000000e-29
    -lw 6.5 6.500000e+00
    -maxhmmpf 30000 30000
    -maxwpf -1 -1
    -mdef
    -mean
    -mfclogdir
    -min_endfr 0 0
    -mixw
    -mixwfloor 0.0000001 1.000000e-07
    -mllr
    -mmap yes yes
    -ncep 13 13
    -nfft 512 512
    -nfilt 40 15
    -nwpen 1.0 1.000000e+00
    -pbeam 1e-48 1.000000e-48
    -pip 1.0 1.000000e+00
    -pl_beam 1e-10 1.000000e-10
    -pl_pbeam 1e-10 1.000000e-10
    -pl_pip 1.0 1.000000e+00
    -pl_weight 3.0 3.000000e+00
    -pl_window 5 5
    -rawlogdir
    -remove_dc no no
    -remove_noise yes yes
    -remove_silence yes yes
    -round_filters yes yes
    -samprate 16000 8.000000e+03
    -seed -1 -1
    -sendump
    -senlogdir
    -senmgau
    -silprob 0.005 5.000000e-03
    -smoothspec no no
    -svspec
    -tmat
    -tmatfloor 0.0001 1.000000e-04
    -topn 4 4
    -topn_beam 0 0
    -toprule
    -transform legacy dct
    -unit_area yes yes
    -upperf 6855.4976 3.500000e+03
    -uw 1.0 1.000000e+00
    -vad_postspeech 50 50
    -vad_prespeech 20 20
    -vad_startspeech 10 10
    -vad_threshold 2.0 2.000000e+00
    -var
    -varfloor 0.0001 1.000000e-04
    -varnorm no no
    -verbose no no
    -warp_params
    -warp_type inverse_linear inverse_linear
    -wbeam 7e-29 7.000000e-29
    -wip 0.65 6.500000e-01
    -wlen 0.025625 2.562500e-02

    INFO: feat.c(715): Initializing feature stream to type: '1s_c_d_dd', ceplen=13, CMN='batch', VARNORM='no', AGC='none'
    INFO: mdef.c(518): Reading model definition: /root/sphinx-source/pt4/model_parameters/your_db.ci_cont/mdef
    INFO: bin_mdef.c(181): Allocating 200 * 8 bytes (1 KiB) for CD tree
    INFO: tmat.c(149): Reading HMM transition probability matrices: /root/sphinx-source/pt4/model_parameters/your_db.ci_cont/transition_matrices
    INFO: acmod.c(117): Attempting to use PTM computation module
    INFO: ms_gauden.c(127): Reading mixture gaussian parameter: /root/sphinx-source/pt4/model_parameters/your_db.ci_cont/means
    INFO: ms_gauden.c(242): 147 codebook, 1 feature, size:
    INFO: ms_gauden.c(244): 1x39
    INFO: ms_gauden.c(127): Reading mixture gaussian parameter: /root/sphinx-source/pt4/model_parameters/your_db.ci_cont/variances
    INFO: ms_gauden.c(242): 147 codebook, 1 feature, size:
    INFO: ms_gauden.c(244): 1x39
    INFO: ms_gauden.c(304): 0 variance values floored
    INFO: ptm_mgau.c(808): Number of codebooks doesn't match number of ciphones, doesn't look like PTM: 147 != 49
    INFO: acmod.c(119): Attempting to use semi-continuous computation module
    INFO: ms_gauden.c(127): Reading mixture gaussian parameter: /root/sphinx-source/pt4/model_parameters/your_db.ci_cont/means
    INFO: ms_gauden.c(242): 147 codebook, 1 feature, size:
    INFO: ms_gauden.c(244): 1x39
    INFO: ms_gauden.c(127): Reading mixture gaussian parameter: /root/sphinx-source/pt4/model_parameters/your_db.ci_cont/variances
    INFO: ms_gauden.c(242): 147 codebook, 1 feature, size:
    INFO: ms_gauden.c(244): 1x39
    INFO: ms_gauden.c(304): 0 variance values floored
    INFO: acmod.c(121): Falling back to general multi-stream GMM computation
    INFO: ms_gauden.c(127): Reading mixture gaussian parameter: /root/sphinx-source/pt4/model_parameters/your_db.ci_cont/means
    INFO: ms_gauden.c(242): 147 codebook, 1 feature, size:
    INFO: ms_gauden.c(244): 1x39
    INFO: ms_gauden.c(127): Reading mixture gaussian parameter: /root/sphinx-source/pt4/model_parameters/your_db.ci_cont/variances
    INFO: ms_gauden.c(242): 147 codebook, 1 feature, size:
    INFO: ms_gauden.c(244): 1x39
    INFO: ms_gauden.c(304): 0 variance values floored
    INFO: ms_senone.c(149): Reading senone mixture weights: /root/sphinx-source/pt4/model_parameters/your_db.ci_cont/mixture_weights
    INFO: ms_senone.c(200): Truncating senone logs3(pdf) values by 10 bits
    INFO: ms_senone.c(207): Not transposing mixture weights in memory
    INFO: ms_senone.c(268): Read mixture weights for 147 senones: 1 features x 1 codewords
    INFO: ms_senone.c(320): Mapping senones to individual codebooks
    INFO: ms_mgau.c(144): The value of topn: 4
    WARN: "ms_mgau.c", line 148: -topn argument (4) invalid or > #density codewords (1); set to latter
    INFO: phone_loop_search.c(114): State beam -225 Phone exit beam -225 Insertion penalty 0
    INFO: dict.c(320): Allocating 29650 * 32 bytes (926 KiB) for word entries
    INFO: dict.c(333): Reading main dictionary: etc/your_db.dic
    INFO: dict.c(213): Dictionary size 25551, allocated 199 KiB for strings, 458 KiB for phones
    INFO: dict.c(336): 25551 words read
    INFO: dict.c(358): Reading filler dictionary: /root/sphinx-source/pt4/model_parameters/your_db.ci_cont/noisedict
    INFO: dict.c(213): Dictionary size 25554, allocated 0 KiB for strings, 0 KiB for phones
    INFO: dict.c(361): 3 words read
    INFO: dict2pid.c(396): Building PID tables for dictionary
    INFO: dict2pid.c(406): Allocating 49^3 * 2 bytes (229 KiB) for word-initial triphones
    INFO: dict2pid.c(132): Allocated 58016 bytes (56 KiB) for word-final triphones
    INFO: dict2pid.c(196): Allocated 58016 bytes (56 KiB) for single-phone word triphones
    INFO: ngram_model_trie.c(354): Trying to read LM in trie binary format
    INFO: ngram_search_fwdtree.c(74): Initializing search tree
    INFO: ngram_search_fwdtree.c(101): 546 unique initial diphones
    INFO: ngram_search_fwdtree.c(186): Creating search channels
    INFO: ngram_search_fwdtree.c(323): Max nonroot chan increased to 128
    INFO: ngram_search_fwdtree.c(333): Created 0 root, 0 non-root channels, 3 single-phone words
    INFO: ngram_search_fwdflat.c(157): fwdflat: min_ef_width = 4, max_sf_win = 25
    INFO: continuous.c(307): pocketsphinx_continuous COMPILED ON: Aug 30 2016, AT: 06:52:46

    INFO: cmn_live.c(120): Update from < 32.83 0.32 -0.61 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 >
    INFO: cmn_live.c(138): Update to < 32.83 0.32 -0.61 -8.74 -2.64 -5.97 -5.55 -1.27 -2.80 4.42 0.80 2.10 0.52 >
    INFO: ngram_search_fwdtree.c(1550): 615 words recognized (2/fr)
    INFO: ngram_search_fwdtree.c(1552): 861 senones evaluated (3/fr)
    INFO: ngram_search_fwdtree.c(1556): 647 channels searched (2/fr), 0 1st, 647 last
    INFO: ngram_search_fwdtree.c(1559): 647 words for which last channels evaluated (2/fr)
    INFO: ngram_search_fwdtree.c(1561): 0 candidate words for entering last phone (0/fr)
    INFO: ngram_search_fwdtree.c(1564): fwdtree 0.00 CPU 0.000 xRT
    INFO: ngram_search_fwdtree.c(1567): fwdtree 0.00 wall 0.000 xRT
    INFO: ngram_search_fwdflat.c(302): Utterance vocabulary contains 2 words
    INFO: ngram_search_fwdflat.c(948): 814 words recognized (3/fr)
    INFO: ngram_search_fwdflat.c(950): 864 senones evaluated (3/fr)
    INFO: ngram_search_fwdflat.c(952): 856 channels searched (2/fr)
    INFO: ngram_search_fwdflat.c(954): 856 words searched (2/fr)
    INFO: ngram_search_fwdflat.c(957): 76 word transitions (0/fr)
    INFO: ngram_search_fwdflat.c(960): fwdflat 0.00 CPU 0.000 xRT
    INFO: ngram_search_fwdflat.c(963): fwdflat 0.00 wall 0.000 xRT
    INFO: ngram_search.c(1250): lattice start node .0 end node .195
    INFO: ngram_search.c(1276): Eliminated 0 nodes before end node
    INFO: ngram_search.c(1381): Lattice has 13 nodes, 17 links
    INFO: ps_lattice.c(1380): Bestpath score: -1586
    INFO: ps_lattice.c(1384): Normalizer P(O) = alpha(:195:287) = -105912
    INFO: ps_lattice.c(1441): Joint P(O,S) = -112665 P(S|O) = -6753
    INFO: ngram_search.c(872): bestpath 0.00 CPU 0.000 xRT
    INFO: ngram_search.c(875): bestpath 0.00 wall 0.000 xRT

    INFO: ngram_search_fwdtree.c(429): TOTAL fwdtree 0.00 CPU 0.000 xRT
    INFO: ngram_search_fwdtree.c(432): TOTAL fwdtree 0.00 wall 0.000 xRT
    INFO: ngram_search_fwdflat.c(176): TOTAL fwdflat 0.00 CPU 0.000 xRT
    INFO: ngram_search_fwdflat.c(179): TOTAL fwdflat 0.00 wall 0.000 xRT
    INFO: ngram_search.c(303): TOTAL bestpath 0.00 CPU 0.000 xRT
    INFO: ngram_search.c(306): TOTAL bestpath 0.00 wall 0.000 xRT


    I have already tried adding -cmninit 32.83,0.32,-0.61 to etc/feat.params as described here: https://sourceforge.net/p/cmusphinx/discussion/help/thread/399685e2/

    Any ideas?
    Thanks

     
    • Nickolay V. Shmyrev

      You need to specify the compatible language model, you are using english model by default and that probably does not have any words from your dictionary.

       

Log in to post a comment.

Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.