|
From: Neal S. <sn...@st...> - 2006-05-19 19:46:28
|
infomap gurus: I am trying to build an LSA model with the Arabic Gigaword corpus, which is actually about 4GB of text. However, I keep getting a "wordlist File size limit exceeded" error about half way through. I tried increasing the ROWS and COLUMNS parameters: infomap-build -D ROWS=1000000 -D COLUMNS=1000000 -m ./list.txt aragiga_full2 but that doesn't help. Do you have any suggestions? Thanks! Neal PS: Here's the exact output: make: *** [/user/snider/scr/aragiga_full2/wordlist] File size limit exceeded make: *** Deleting file `/user/snider/scr/aragiga_full2/wordlist' ------ Neal Snider Ph.D. Student Department of Linguistics Stanford University Margaret Jacks Hall, Bldg 460 - Room 118 Stanford CA 94305-2150 (650) 723-4284; Fax: (650) 723-5666 http://www.stanford.edu/~snider |