From: Michael S. <sta...@us...> - 2005-09-01 20:58:32
|
Update of /cvsroot/archive-access/archive-access/projects/nutch/xdocs In directory sc8-pr-cvs1.sourceforge.net:/tmp/cvs-serv26986/xdocs Modified Files: gettingstarted.xml Log Message: * xdocs/gettingstarted.xml Edits from Sverre. Index: gettingstarted.xml =================================================================== RCS file: /cvsroot/archive-access/archive-access/projects/nutch/xdocs/gettingstarted.xml,v retrieving revision 1.8 retrieving revision 1.9 diff -C2 -d -r1.8 -r1.9 *** gettingstarted.xml 29 Jul 2005 22:12:23 -0000 1.8 --- gettingstarted.xml 1 Sep 2005 20:58:24 -0000 1.9 *************** *** 28,33 **** indexing step. It takes a bunch of options. To do the most basic indexing operation, point it a few ARC files and let it run: ! <pre>% ./bin/indexarcs -s ${HOME}/arcs/ -d ${HOME}/nutch-data</pre> ! This will build an index for you in <code>${HOME}/nutch-data</code>. </p> <p> --- 28,36 ---- indexing step. It takes a bunch of options. To do the most basic indexing operation, point it a few ARC files and let it run: ! <pre>% ./bin/indexarcs.sh -s ${HOME}/arcs/ -d ${HOME}/nutch-data -c COLLECTION_NAME</pre> ! This will build an index for you in <code>${HOME}/nutch-data</code> (-n says ! do not run the deduplication step -- necessary if you are using nutchwax ! with wera -- and the '-c' is the name of the collection the indexed ! content will belong to). </p> <p> |