From: Michael S. <sta...@us...> - 2005-10-18 23:21:21
|
Update of /cvsroot/archive-access/archive-access/projects/nutch/src/java/org/archive/access/nutch In directory sc8-pr-cvs1.sourceforge.net:/tmp/cvs-serv7054/src/java/org/archive/access/nutch Modified Files: Arc2Segment.java Log Message: * project.properties * src/articles/releasenotes.xml * xdocs/srcbuild.xml Revert to 0.7.0 nutch. 0.7.1 has problems. * src/java/org/archive/access/nutch/Arc2Segment.java If we fail parse, don't add to index (Shouldd get rid of those no arcoffset, etc., messages we used get indexing). * src/plugin/index-ia/src/java/org/archive/access/nutch/indexer/IaIndexingFilter.java Don't warn if 'encoding' not present -- won't be present for many types. Index: Arc2Segment.java =================================================================== RCS file: /cvsroot/archive-access/archive-access/projects/nutch/src/java/org/archive/access/nutch/Arc2Segment.java,v retrieving revision 1.29 retrieving revision 1.30 diff -C2 -d -r1.29 -r1.30 *** Arc2Segment.java 20 Aug 2005 00:09:36 -0000 1.29 --- Arc2Segment.java 18 Oct 2005 23:21:11 -0000 1.30 *************** *** 253,262 **** LOG.info("Failed parse: " + p.getData().getStatus().getMessage()); } - // FetchList.append(fle); - this.fetcher.append(fo); - // Content.append(c); - this.parseText.append(new ParseText(p.getText())); - this.parseData.append(p.getData()); } } catch (ParseException e) { --- 253,264 ---- LOG.info("Failed parse: " + p.getData().getStatus().getMessage()); + // Don't add if failed parse. + } else { + // FetchList.append(fle); + this.fetcher.append(fo); + // Content.append(c); + this.parseText.append(new ParseText(p.getText())); + this.parseData.append(p.getData()); } } } catch (ParseException e) { |