Reported by Bjarne up on the list:
Dec 25, 2005 9:18:01 PM
org.archive.crawler.extractor.Extractor innerProcess
WARNING: ExtractorJS: NullPointerException
java.lang.NullPointerException
at org.archive.net.UURIFactory.create(UURIFactory.java:336)
at
org.archive.net.UURIFactory.getInstance(UURIFactory.java:285)
at
org.archive.crawler.datamodel.CrawlURI.createAndAddLinkRelativeToVia(CrawlU
RI.java:1183)
at
org.archive.crawler.extractor.ExtractorJS.considerStrings(ExtractorJS.java:
152)
at
org.archive.crawler.extractor.ExtractorJS.extract(ExtractorJS.java:118)
at
org.archive.crawler.extractor.Extractor.innerProcess(Extractor.java:67)
at
org.archive.crawler.framework.Processor.process(Processor.java:103)
at
org.archive.crawler.framework.ToeThread.processCrawlUri(ToeThread.java:306)
at
org.archive.crawler.framework.ToeThread.run(ToeThread.java:153)
Any ideas ?
Here is what I wrote back:
We're speculating there's a link at this point in the
javascript. Looks like we're passing a null 'base'
into UURIFactory (See
http://crawler.archive.org/xref/org/archive/net/UURIFactory.html#336).
Should add a check in UURIFactory and probably to
ExtractorJS since its in speculative mode (I opened an
issue). I suppose you have no idea how to reproduce
since we're not logging the page we found the NPE on?
Karl Thiessen
Extraction
1.8.0
Public
|
Date: 2007-03-14 01:04
|
|
Date: 2006-05-05 00:17 Logged In: YES |
|
Date: 2006-01-02 23:55 Logged In: YES |
| Field | Old Value | Date | By |
|---|---|---|---|
| status_id | Open | 2006-05-05 00:17 | karl-ia |
| close_date | - | 2006-05-05 00:17 | karl-ia |
| resolution_id | None | 2006-01-02 23:55 | gojomo |
| assigned_to | gojomo | 2006-01-02 23:55 | gojomo |
| priority | 5 | 2006-01-02 23:46 | gojomo |
| assigned_to | nobody | 2006-01-02 23:46 | gojomo |
| artifact_group_id | None | 2006-01-02 23:46 | gojomo |