Menu

Phoneme recognition in real time

Help
2020-01-14
2020-01-14
  • Daniel Natarelli

    Hi! I'm working right now with pocketsphinx and its amazing so far.
    I'm currently investigating if with this tool I can recognize phonemes in real time (as soon as the user speak get that input and later the phoneme) but the time between the input and the output is very high for what we are trying to do (3D models that move their lips when the user is talking on a microphone).
    So my question is, there is a way to improve this time and get the phoneme instantly when the user is speaking?
    I've tried to modify pocketsphinx_continuous in order to get the hyp as soon as posible when a counter reach a certain number of loops in the for(;;) but the ouput isn't instanlty.

    Thanks in advance,
    Kind regards!

     
    • Nickolay V. Shmyrev

      Phonemes have length, so "instantly" is certainly not possible. You should have a delay of 50ms at least.

       
      • Daniel Natarelli

        Thanks for your quick response! I'm gonna investigate more.

        Regards!

         
        • Daniel Natarelli

          Another question crossed my mind. Which speech recognition process is faster, word recognition or phoneme recognition? In all of my test is a lot faster when I get the word than the phoneme.

           
          • Nickolay V. Shmyrev

            For the fastest recognition you need to try kaldi models from kaldi-android-demo. You will have to compile phonetic graph though.

             

Log in to post a comment.

Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.