Update of /cvsroot/archive-access/archive-access/projects/nutch/src/java/org/archive/access/nutch
In directory sc8-pr-cvs1.sourceforge.net:/tmp/cvs-serv7054/src/java/org/archive/access/nutch
Modified Files:
Arc2Segment.java
Log Message:
* project.properties
* src/articles/releasenotes.xml
* xdocs/srcbuild.xml
Revert to 0.7.0 nutch. 0.7.1 has problems.
* src/java/org/archive/access/nutch/Arc2Segment.java
If we fail parse, don't add to index (Shouldd get rid of those
no arcoffset, etc., messages we used get indexing).
* src/plugin/index-ia/src/java/org/archive/access/nutch/indexer/IaIndexingFilter.java
Don't warn if 'encoding' not present -- won't be present for many types.
Index: Arc2Segment.java
===================================================================
RCS file: /cvsroot/archive-access/archive-access/projects/nutch/src/java/org/archive/access/nutch/Arc2Segment.java,v
retrieving revision 1.29
retrieving revision 1.30
diff -C2 -d -r1.29 -r1.30
*** Arc2Segment.java 20 Aug 2005 00:09:36 -0000 1.29
--- Arc2Segment.java 18 Oct 2005 23:21:11 -0000 1.30
***************
*** 253,262 ****
LOG.info("Failed parse: " +
p.getData().getStatus().getMessage());
}
- // FetchList.append(fle);
- this.fetcher.append(fo);
- // Content.append(c);
- this.parseText.append(new ParseText(p.getText()));
- this.parseData.append(p.getData());
}
} catch (ParseException e) {
--- 253,264 ----
LOG.info("Failed parse: " +
p.getData().getStatus().getMessage());
+ // Don't add if failed parse.
+ } else {
+ // FetchList.append(fle);
+ this.fetcher.append(fo);
+ // Content.append(c);
+ this.parseText.append(new ParseText(p.getText()));
+ this.parseData.append(p.getData());
}
}
} catch (ParseException e) {
|