Menu

Name recognition

2014-09-26
2014-10-01
  • K R Srinidhi

    K R Srinidhi - 2014-09-26

    Hi,

    I have a list of movie names and song names (millions). The list keeps getting updated with new names (songs and movie names) every week. It is required to recognize movie/song names from user utterances. For such a requirement
    which is better model:
    1)large grammar file being updated regularly
    OR
    2)Hybrid language model to detect out of vocabulary words .

    Thanks
    Srinidhi

     
    • Nickolay V. Shmyrev

      What do you mean by "better"?

       
  • K R Srinidhi

    K R Srinidhi - 2014-09-26

    For the requirement of recognizing against millions of names (being updated regularly) will recognition work against large grammar (being appended regularly) or an hybrid language model needs to be built to recognize OOV names (for new songname/moviename)
    For user utterance the best possible match from the list of songnames/movienames
    must be recognized.

     
    • Nickolay V. Shmyrev

      will recognition work against large grammar (being appended regularly)

      Yes.

      or an hybrid language model needs to be built to recognize OOV names (for new songname/moviename)

      You can do that way too if you don't want to update the grammar.

       
  • K R Srinidhi

    K R Srinidhi - 2014-09-26

    If grammar file is used and updated regularly then is there any size limit on grammar file.

     
    • Nickolay V. Shmyrev

      Limits are pretty high (MAX_INT32).

       
  • K R Srinidhi

    K R Srinidhi - 2014-10-01

    If a language model is used for my requirement then an open vocabulary language model would be required to recognize OOV words. Does sphinx support open vocabulary language models ? How can I use sphinx to recognize OOV words using language model?

     
  • Nickolay V. Shmyrev

    If a language model is used for my requirement then an open vocabulary language model would be required to recognize OOV words.

    This statement is not quite correct. Speech recognizers can not work with open vocabulary language models. You have to modify speech recognizer itself to be able to recognize arbitrary words, not just replace the language model.

    Does sphinx support open vocabulary language models ?

    No

    How can I use sphinx to recognize OOV words using language model?

    You can not use it without modifications.

     
  • K R Srinidhi

    K R Srinidhi - 2014-10-01

    As we get new data (song and movie names) regularly what are the options we can consider for using sphinx with language model:
    1)With new data(song and movie names) is it required to retrain and build the new language model every time ?
    2)Is there any option where we can add new data without rebuilding language model?

     
    • Nickolay V. Shmyrev

      1)With new data(song and movie names) is it required to retrain and build the new language model every time ?

      Language model requires you to have proper name frequencies, not just the names themselves. If you don't have frequencies you should use grammars, not language models.

      2)Is there any option where we can add new data without rebuilding language model?

      It is usually not a problem to rebuild the language model given you have source data for it.

       

Log in to post a comment.