From: Michael S. <sta...@us...> - 2005-10-06 01:07:39
|
Update of /cvsroot/archive-access/archive-access/projects/nutch/conf In directory sc8-pr-cvs1.sourceforge.net:/tmp/cvs-serv1231/conf Modified Files: nutch-site.xml.all Log Message: * conf/nutch-site.xml.all Change value for indexer.maxMergeDocs. * src/web/search.jsp Chnage comments. Also force encoding. response.encodeURL will noop if already some encoding. Index: nutch-site.xml.all =================================================================== RCS file: /cvsroot/archive-access/archive-access/projects/nutch/conf/nutch-site.xml.all,v retrieving revision 1.1 retrieving revision 1.2 diff -C2 -d -r1.1 -r1.2 *** nutch-site.xml.all 25 Jul 2005 20:35:00 -0000 1.1 --- nutch-site.xml.all 6 Oct 2005 01:07:31 -0000 1.2 *************** *** 63,66 **** --- 63,81 ---- </property> + <property> + <name>indexer.maxMergeDocs</name> + <value>2147483647</value> + <description>This number determines the maximum number of Lucene + Documents to be merged into a new Lucene segment. Larger values + increase indexing speed and reduce the number of Lucene segments, + which reduces the number of open file handles; however, this also + increases RAM usage during indexing. + + Doug says: "There was a bogus value for indexer.maxMergeDocs in + nutch-default.xml which made indexing really slow. The correct + value is something really big (like Integer.MAX_VALUE)." + </description> + </property> + <!-- make summaries a little longer than the default --> <property> |