Menu

repetitions of syllabel are recognized as one syllable

Help
tfpeach
2016-10-13
2016-10-17
  • tfpeach

    tfpeach - 2016-10-13

    Hi, dear all,

    I am working on recognizing the repetitions of syllables, like "DA DA DA DA DA DA". However, when the speaker speak fast, it will only output one "DA". But the duration is the whole sequence. I am using adapted WSJ generic model, and the grammar like this:
    SEQ = (DA)*

    I am using the command:
    pocketsphinx_batch -hmm hub4wsj_sc_8k -feat 1s_c_d_dd -ceplen 13 -ncep 13 -lw 10 -fwdflatlw 10 -bestpathlw 10 -beam 1e-80 -wbeam 1e-40 -fwdflatbeam 1e-80 -fwdflatwbeam 1e-40 -pbeam 1e-80 -lpbeam 1e-80 -lponlybeam 1e-80 -jsgf hello.gram -dict hello.dic -wip 0.2 -ctl UTDallas-concussion_test.fileids -ctloffset 0 -ctlcount 1 -cepdir . -cepext .wav -hyp hello.hyp -agc none -varnorm no -cmn current -hypseg hello.hypseg -remove_noise no -remove_silence yes -transform dct -adcin yes

    It seems the recognizer skip the short pause after each syllable. Is there any way to insert the short pause after each syllable and make it recognize all the syllables?

    Thank you.

     
    • Nickolay V. Shmyrev

      Hi, tfpeach, you still working on the same.

      I believe you screwed something in your model, this behaviour does not seem correct for pocketsphinx_batch. Again, its better for you to provide an example

      -fwdflatlw 10 -fwdflatbeam 1e-80 -fwdflatwbeam 1e-40 -lpbeam 1e-80 -lponlybeam 1e-80 -transform dct -feat 1s_c_d_dd -ceplen 13 -ncep 13

      Those options should have no effect, you can remove them.

       
      • tfpeach

        tfpeach - 2016-10-17

        Thank you, Nickolay.

        I found a problem that I was testing 16000 hz audio with the model "hub4wsj_sc_8k". If I don't mistake, this model should be used with 8k hz audio. Is that right?

        Thank you once again.

         
        • Nickolay V. Shmyrev

          I found a problem that I was testing 16000 hz audio with the model "hub4wsj_sc_8k". If I don't mistake, this model should be used with 8k hz audio. Is that right?

          hub4wsj_sc_8k should work both for 16khz and 8khz, you just need to specify a sample rate properly.

           

Log in to post a comment.

Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.