Menu

A problem in using Chinese language model on pocketshpinx

Help
midori
2013-04-03
2013-04-05
  • midori

    midori - 2013-04-03

    I want to use the zh_broadcastnews_64000_utf8.DMP language model in pocketsphinx, but I recognized the result could not be normally presented.I'm using it on windows .Should I convert the utf8 format to GBK foramt?
    If true, should I use the sphinx_lm_convert tool to change it to lm file and convert it? but when I try to convert,it would get stuck in the progress meanwhile the CPU keeping busy and the lm file doesn't increase any more.
    Beg for your help sincerely,thanks!

     
  • Nickolay V. Shmyrev

    hould I convert the utf8 format to GBK foramt?

    Yes

    If true, should I use the sphinx_lm_convert tool to change it to lm file and convert it?

    No, you can just convert the output

     
    • midori

      midori - 2013-04-03

      How to convert the DMP file directly?

       
    • midori

      midori - 2013-04-04

      No, you can just convert the output

      Could you give me more details? It look like I cannot convert the encode of DMP file directly in programs like notepad++

       
  • dovark

    dovark - 2013-04-03

    To convert DMP to ARPA, run
    sphinx_lm_convert -i /path/to/DMP -ifmt DMP -o /path/to/LM -ofmt ARPA

     
    • midori

      midori - 2013-04-04

      But using the command you given there will be a phenomenon like I have mentioned above ..the ARPA file cannot be generated completely

       
  • dovark

    dovark - 2013-04-04

    I just now did the same and an ARPA file was successfully generated. It took some time, but the file was created. Get more RAM maybe.

     
    • midori

      midori - 2013-04-05

      Thanks! I did it in sphinx_lm_convert the release version before .Now I have tried it in debug version and it works successfully now. Thanks for your help !

       

Log in to post a comment.

Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.