Hi
I trained a 1 GB text file using with SRILM. when I try to convert it to DMP
format i have a problem because of large size of n-grams. I want to use LIUM
in order to convert my model to binary format. From where can I download LIUM
and how to use it?
What is a general approach to training large text corpus. Please help me. Tnx
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Hi
I trained a 1 GB text file using with SRILM. when I try to convert it to DMP
format i have a problem because of large size of n-grams. I want to use LIUM
in order to convert my model to binary format. From where can I download LIUM
and how to use it?
What is a general approach to training large text corpus. Please help me. Tnx
http://cmusphinx.svn.sourceforge.net/viewvc/cmusphinx/branches/LIUM/
Checkout, read README, compile, use
You can also try to restrict the vocabulary size to 64k words.