Menu

Too many incorrect detections with pocketsphinx

Help
Hitesh
2016-12-08
2016-12-13
  • Hitesh

    Hitesh - 2016-12-08

    Hi,

    I built a custom JSGF grammar consisting of about 1000 words (common words such as egg, hat and a lot of the numbers. This is for a kids app). I also used the g2p model to generate pronunciations for all of these and am using that as the dictionary. I'm using the default English model that's part of the pocketsphinx-android demo, which I believe is the en-us-ptm-5.2 model. The application is built on top of the pocketsphinx demo removing the wake up and menu selection and only running the above specified grammar.
    When I run this on a mobile device, I see a lot of incorrect detections (currently being tested by an adult itself). Sometimes, the partial hypothesis seems to be predicting the target word but the final result is often incorrect. At other times, the hypothesis is not close to the word at all.
    Could anyone help me and let me know what I'm doing wrong and how I can fix this?

    Thanks,
    Hitesh

     
    • Nickolay V. Shmyrev

      Mobile device is too slow to recognize such a large grammar, you can see xRT details in the log. Speedups are discussed on our wiki.

       
  • Hitesh

    Hitesh - 2016-12-13

    Hi Nickolay,
    Thanks for that. Could you post the link to where the speedup is discussed? Is my approach for this right or should I be using some kind of language model (n-gram or RNNLM)?

    Also, does it make a difference if I use something like the cmudict with the extra words added as the lexicon, versus using only the words in the grammar as the lexicon?

     
    • Nickolay V. Shmyrev

      Thanks for that. Could you post the link to where the speedup is discussed?

      http://cmusphinx.sourceforge.net/wiki/pocketsphinxhandhelds

      Is my approach for this right or should I be using some kind of language model (n-gram or RNNLM)?

      You didn't provide enough information about application you want to implement to enable answer on this question. You also didn't provide enough information about the problem - logcat output, data files, etc.

      Also, does it make a difference if I use something like the cmudict with the extra words added as the lexicon, versus using only the words in the grammar as the lexicon?

      It does not matter.

       

Log in to post a comment.

Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.