Share

Heritrix: Internet Archive Web Crawler

Tracker: Bugs

4 CrawlStateUpdater - NullPointerException - ID: 931565
Last Update: Comment added ( karl-ia )

Hello,

Im trying to use heritrix on a windows plattform.
Whenever i submit a job via the web interface i get an
error - here is the log (alert) message :



Title: RuntimeException occured
processing 'dns:www.db.com'
Time: Apr. 8, 2004 07:28:13 GMT
Level: SEVERE
Message: The following RuntimeException occure when
trying to process 'dns:www.db.com'


Associated Throwable: java.lang.NullPointerException

Stacktrace:
java.lang.NullPointerException
at
org.archive.crawler.basic.CrawlStateUpdater.innerProcess
(CrawlStateUpdater.java:57)
at
org.archive.crawler.framework.Processor.process
(Processor.java:101)
at
org.archive.crawler.framework.ToeThread.processCrawlUri
(ToeThread.java:169)
at org.archive.crawler.framework.ToeThread.run
(ToeThread.java:124)



I looked into the CrawlStateUpdater source code that
says:

if (!curi.getServer().getHost().hasBeenLookedUp())

it seems that getServer() returns a null-value.

When I start the selftest or even build a new job upon a
selftest the system works fine.
When is build a new job based on a selftest and change
the seed url (i tried both, text and ip-number) the job
crashes as soon as i try to submit it. meaning i dont
even get as far as to start the job.
(The same happens of course when i'm trying to create
my own job)


my email address is thomas.schmegg@gmx.de


Nobody/Anonymous ( nobody ) - 2004-04-08 07:47

4

Closed

Fixed

Michael Stack

General

0.6.0

Public


Comments ( 3 )

Date: 2007-03-14 00:09
Sender: karl-ia


This issue is now discussed in the new JIRA tracker at
http://webteam.archive.org/jira/browse/HER-106 -- please add further
comments at that location.


Date: 2004-10-08 19:06
Sender: stack-sfProject Admin

Logged In: YES
user_id=924942

Fixed. Below is commit. Closing.

Fix for [ 931565 ] CrawlStateUpdater - NullPointerException
*
src/java/org/archive/crawler/postprocessor/CrawlStateUpdater.java
Added test for null server. Since we now test for
scheme too before
we get the server, its even less likely we'll run across
this problem.



Date: 2004-07-07 21:30
Sender: gojomoProject Admin

Logged In: YES
user_id=144912

Can't reproduce here --- contact reporter, see if he still
has problem?


Attached File

No Files Currently Attached

Changes ( 6 )

Field Old Value Date By
status_id Open 2004-10-08 19:06 stack-sf
resolution_id None 2004-10-08 19:06 stack-sf
close_date - 2004-10-08 19:06 stack-sf
priority 5 2004-07-07 21:30 gojomo
assigned_to ia_igor 2004-07-07 21:30 gojomo
assigned_to nobody 2004-04-13 18:55 gojomo