From: Michael S. <sta...@us...> - 2005-10-18 19:38:47
|
Update of /cvsroot/archive-access/archive-access/projects/nutch/xdocs In directory sc8-pr-cvs1.sourceforge.net:/tmp/cvs-serv8042/xdocs Modified Files: srcbuild.xml Log Message: * project.properties Point at nutch-0.7.1. Thats what we'll release against. * xdocs/srcbuild.xml More edits of src build notes. Index: srcbuild.xml =================================================================== RCS file: /cvsroot/archive-access/archive-access/projects/nutch/xdocs/srcbuild.xml,v retrieving revision 1.6 retrieving revision 1.7 diff -C2 -d -r1.6 -r1.7 *** srcbuild.xml 18 Oct 2005 02:20:55 -0000 1.6 --- srcbuild.xml 18 Oct 2005 19:34:53 -0000 1.7 *************** *** 23,28 **** <a href="http://www.apache.org/dyn/closer.cgi/lucene/nutch/">nutch downloads</a>. ! The below has been tested working using nutch 0.7.0 and 0.7.1. Revert to ! this version of Nutch if problems building (Nutchwax will not work with release 0.6 of Nutch). Unbundle the nutch release It usually untars as nutch-0.?.?. The build scripts are looking for 'nutch' in --- 23,29 ---- <a href="http://www.apache.org/dyn/closer.cgi/lucene/nutch/">nutch downloads</a>. ! The below has been tested working using nutch 0.7.0 and 0.7.1. Revert to ! either of these versions of Nutch if problems building (Nutchwax will not ! work with release 0.6 of Nutch). Unbundle the nutch release It usually untars as nutch-0.?.?. The build scripts are looking for 'nutch' in *************** *** 32,50 **** If building against 0.7.1, you'll need to create the directory <literal>${NUTCH_HOME}/src/plugins/nutch-extensionpoints/src/java</literal> ! else the nutch ant build fails. You'll also have to update ! ${NUTCHWAX}/project.properties to point at the nutch 0.7.1 jar rather ! than at the 0.7.0 jar. </p> ! <p>Symlink ${NUTCHWAX}/nutch/conf/nutch-site.xml to ${NUTCHWAX}/conf/nutch-site.xml. Doing this, there is only one nutch-site.xml shared by core Nutch and by NutchWAX extensions. ! <pre> % cd ${NUTCHWAX}/nutch/conf % mv nutch-site.xml nutch-site.xml.original ! % ln -s ${NUTCHWAX}/conf/nutch-site.xml</pre> ! The version of nutch-site.xml that is in ${NUTCHWAX} has NutchWAX specific ! configuration overrides as well as hardcodings of collection names and the ! name ! of the archive host that holds archived pages. Edit these to suit your environment particularly 'archive.host' and 'archive.collection'. The NutchWAX search.jsp compounds these values to come up with the WAC URL --- 33,51 ---- If building against 0.7.1, you'll need to create the directory <literal>${NUTCH_HOME}/src/plugins/nutch-extensionpoints/src/java</literal> ! else the nutch ant build fails. You'll may also have to update ! ${NUTCHWAX}/project.properties to rename the corenutch jar if building ! against a nutch that is other than that which nutchwax is currently working ! against. </p> ! <p>Symlink <literal>${NUTCHWAX}/nutch/conf/nutch-site.xml.all</literal> to ${NUTCHWAX}/conf/nutch-site.xml. Doing this, there is only one nutch-site.xml shared by core Nutch and by NutchWAX extensions. ! <pre> % cd ${NUTCH_HOME}/nutch/conf % mv nutch-site.xml nutch-site.xml.original ! % ln -s ${NUTCHWAX}/conf/nutch-site.xml.all nutch-site.xml</pre> ! The <literal>nutch-site.xml.all</literal> that is in ${NUTCHWAX} has NutchWAX ! specific configuration overrides as well as hardcodings of collection names and ! the name of the archive host that holds archived pages. Edit these to suit your environment particularly 'archive.host' and 'archive.collection'. The NutchWAX search.jsp compounds these values to come up with the WAC URL |