Update of /cvsroot/archive-access/archive-access/projects/nutch/conf
In directory sc8-pr-cvs1.sourceforge.net:/tmp/cvs-serv1231/conf
Modified Files:
nutch-site.xml.all
Log Message:
* conf/nutch-site.xml.all
Change value for indexer.maxMergeDocs.
* src/web/search.jsp
Chnage comments. Also force encoding. response.encodeURL will noop if
already some encoding.
Index: nutch-site.xml.all
===================================================================
RCS file: /cvsroot/archive-access/archive-access/projects/nutch/conf/nutch-site.xml.all,v
retrieving revision 1.1
retrieving revision 1.2
diff -C2 -d -r1.1 -r1.2
*** nutch-site.xml.all 25 Jul 2005 20:35:00 -0000 1.1
--- nutch-site.xml.all 6 Oct 2005 01:07:31 -0000 1.2
***************
*** 63,66 ****
--- 63,81 ----
</property>
+ <property>
+ <name>indexer.maxMergeDocs</name>
+ <value>2147483647</value>
+ <description>This number determines the maximum number of Lucene
+ Documents to be merged into a new Lucene segment. Larger values
+ increase indexing speed and reduce the number of Lucene segments,
+ which reduces the number of open file handles; however, this also
+ increases RAM usage during indexing.
+
+ Doug says: "There was a bogus value for indexer.maxMergeDocs in
+ nutch-default.xml which made indexing really slow. The correct
+ value is something really big (like Integer.MAX_VALUE)."
+ </description>
+ </property>
+
<!-- make summaries a little longer than the default -->
<property>
|