Menu

How to create a phonetic dictionary and phonetic language model for mandarin?

Help
iridium
2015-08-29
2015-08-29
  • iridium

    iridium - 2015-08-29

    Can anyone tell me how can I create a phonetic dictionary and phonetic language model for mandarin?
    the wiki here does not detail enough:
    http://cmusphinx.sourceforge.net/wiki/phonemerecognition?s[]=allphone
    Many thanks

     
    • Nickolay V. Shmyrev

      Can anyone tell me how can I create a phonetic dictionary

      Phonetic dictionary is created with rules where you map the words into pinyin. Pinyin is expaned as is. We have mandarin dicitonary available for download.

      phonetic language model for mandarin?

      The process is described on the wiki:

      For other languages you need a phonetic language model for your phoneset, steps are the following. You can take a text, convert it to a phonetic strings using the phonetic dictionary for your langauge. Just replace the words with their corresponding transcription. Since number of phones is small, text shouldn't be big either, just a book will do. If you have training data, you can use forced alignment to get transcription with dictionary variants. This way the phonetic transcription will be more precise. That you can build a language model from the phonetic transcription using any language model building tool like cmuclmtk or SRILM.

       
  • iridium

    iridium - 2015-08-29

    OK, thank you very much.

     

Log in to post a comment.