Menu

How to rebuild the corpus from a language model (Reverse Engineering)

Hemu
2014-08-12
2014-08-12
  • Hemu

    Hemu - 2014-08-12

    I had a Language Model (LM) from that I want to rebuild the corpus file which is used to built the LM. Is there is any tools or way to do this reverse engineering ?

     
    • bic-user

      bic-user - 2014-08-12

      That's impossible since language model contains only info on "how often certain word goes after certain word". Though you can use langauge model as probabilistic automata to generate sentences. You can try that with

      ngram -gen

      from SRILM. Check this man: http://www.speech.sri.com/projects/srilm/manpages/ngram.1.html

       

Log in to post a comment.