Update of /cvsroot/archive-access/archive-access/projects/nutch/xdocs
In directory sc8-pr-cvs1.sourceforge.net:/tmp/cvs-serv26986/xdocs
Modified Files:
gettingstarted.xml
Log Message:
* xdocs/gettingstarted.xml
Edits from Sverre.
Index: gettingstarted.xml
===================================================================
RCS file: /cvsroot/archive-access/archive-access/projects/nutch/xdocs/gettingstarted.xml,v
retrieving revision 1.8
retrieving revision 1.9
diff -C2 -d -r1.8 -r1.9
*** gettingstarted.xml 29 Jul 2005 22:12:23 -0000 1.8
--- gettingstarted.xml 1 Sep 2005 20:58:24 -0000 1.9
***************
*** 28,33 ****
indexing step. It takes a bunch of options. To do the most basic indexing
operation, point it a few ARC files and let it run:
! <pre>% ./bin/indexarcs -s ${HOME}/arcs/ -d ${HOME}/nutch-data</pre>
! This will build an index for you in <code>${HOME}/nutch-data</code>.
</p>
<p>
--- 28,36 ----
indexing step. It takes a bunch of options. To do the most basic indexing
operation, point it a few ARC files and let it run:
! <pre>% ./bin/indexarcs.sh -s ${HOME}/arcs/ -d ${HOME}/nutch-data -c COLLECTION_NAME</pre>
! This will build an index for you in <code>${HOME}/nutch-data</code> (-n says
! do not run the deduplication step -- necessary if you are using nutchwax
! with wera -- and the '-c' is the name of the collection the indexed
! content will belong to).
</p>
<p>
|