From: Rahul J. <rj...@ya...> - 2005-10-18 00:12:30
|
Hi: Few days ago I saw a question on this list relating to large corpus (large number of words). I have different but relevant questions: Is there a known limit for number of files in a multi-document corpus, expecting optimal performance? Is there a known break-down point? Does the uniformity (or lack of it) in the sizes of individual documents in a corpus affect the quality model? Thanks! Rahul. __________________________________ Yahoo! Mail - PC Magazine Editors' Choice 2005 http://mail.yahoo.com |