Share

Heritrix: Internet Archive Web Crawler

Tracker: Bugs

9 npe in extractorjs doing broad crawl w/ HEAD - ID: 1170562
Last Update: Comment added ( karl-ia )

Title: Problem occured processing
'http://www.cs.tcd.ie/Donal.OMahony/dublin.parms'
Time: Mar. 25, 2005 06:45:58 GMT
Level: SEVERE
Message:

Problem java.lang.NullPointerException occured when
trying to process
'http://www.cs.tcd.ie/Donal.OMahony/dublin.parms' at
step ABOUT_TO_BEGIN_PROCESSOR in ExtractorJS


Associated Throwable: java.lang.NullPointerException

Stacktrace:
java.lang.NullPointerException
at
org.archive.crawler.extractor.ExtractorJS.innerProcess(ExtractorJS.java:89)

at
org.archive.crawler.framework.Processor.process(Processor.java:103)
at
org.archive.crawler.framework.ToeThread.processCrawlUri(ToeThread.java:273)

at
org.archive.crawler.framework.ToeThread.run(ToeThread.java:143)


Michael Stack ( stack-sf ) - 2005-03-25 15:24

9

Closed

Fixed

Nobody/Anonymous

None

None

Public


Comments ( 4 )

Date: 2007-03-14 00:22
Sender: karl-ia


This issue is now discussed in the new JIRA tracker at
http://webteam.archive.org/jira/browse/HER-380 -- please add further
comments at that location.


Date: 2005-04-01 01:19
Sender: stack-sfProject Admin

Logged In: YES
user_id=924942

Fixed. Closing. Below is commit.
Fix for '[ 1170562 ] npe in extractorjs doing broad crawl w/
HEAD'
* src/java/org/archive/crawler/extractor/ExtractorJS.java
Allow that viaContext can be null (e.g. on seeds).



Date: 2005-03-31 20:00
Sender: ia_igorProject Admin

Logged In: YES
user_id=715474

To reproduce the problem use
http://www.army.mod.uk/linked_files/1pwo/career_anim.swf
as the seed + default profile.




Date: 2005-03-31 19:47
Sender: stack-sfProject Admin

Logged In: YES
user_id=924942

Stops seeds being crawled (Observed by Dan and Igor).


Attached File

No Files Currently Attached

Changes ( 4 )

Field Old Value Date By
status_id Open 2005-04-01 01:19 stack-sf
resolution_id None 2005-04-01 01:19 stack-sf
close_date - 2005-04-01 01:19 stack-sf
priority 5 2005-03-31 19:47 stack-sf