Dear Sir,
I want to produce 7 gram language model for my experiment. The toolkit produces wngram and idngram but failes to produce lm.out. I am sharing my files here. Please give me suggestion.
http://www.4shared.com/file/230161477/5e1065ad/cmulmmodeltar.html
Actually I added this topic in wrong forum then I was suggested to create here. Sorry for multiple posts.
Since you are using 7-grams, you need to point 7 discount ranges if you want to point them at all:
-disc_ranges 1 1 1 1 1 1 1 1
Also, for subword LM I'd recommend to add marker to the morpheme than to make it separate word. It's more reasonable hack I think.
Log in to post a comment.
Dear Sir,
I want to produce 7 gram language model for my experiment. The toolkit
produces wngram and idngram but failes to produce lm.out. I am sharing my
files here. Please give me suggestion.
http://www.4shared.com/file/230161477/5e1065ad/cmulmmodeltar.html
Actually I added this topic in wrong forum then I was suggested to create
here. Sorry for multiple posts.
Since you are using 7-grams, you need to point 7 discount ranges if you want
to point them at all:
-disc_ranges 1 1 1 1 1 1 1 1
Also, for subword LM I'd recommend to add marker to the morpheme than to make
it separate word. It's more reasonable hack I think.