(for post-1.4.0)
Bad URIs with trailing garbage can cause FetchDNS to
throw a NullPointerException:
Problem java.lang.NullPointerException occured when
trying to process 'dns:www.treasurequest.com.%0A%3CBR'
at step PROCESSING in DNS
Associated Throwable: java.lang.NullPointerException
Stacktrace:
java.lang.NullPointerException
at
java.util.regex.Matcher.getTextLength(Matcher.java:1127)
at java.util.regex.Matcher.reset(Matcher.java:284)
at java.util.regex.Matcher.(Matcher.java:205)
at java.util.regex.Pattern.matcher(Pattern.java:879)
at
org.archive.crawler.fetcher.FetchDNS.innerProcess(FetchDNS.java:114)
at
org.archive.crawler.framework.Processor.process(Processor.java:103)
at
org.archive.crawler.framework.ToeThread.processCrawlUri(ToeThread.java:282)
at
org.archive.crawler.framework.ToeThread.run(ToeThread.java:151)
The reason is that for the URI
'dns:www.treasurequest.com.%0A%3CBR',
UURI.getReferencedHost() returns null.
FetchDNS should therefore either check if null is
returned, or UURI.getReferencedHost() should throw an
URIException in that case
However, getReferencedHost's javadoc says nothing about
'null' return-values, so the latter option should be
preferred, in my opinion.
Nobody/Anonymous
None
1.6.0
Public
|
Date: 2007-03-14 00:22
|
|
Date: 2005-09-22 19:41 Logged In: YES |
|
Date: 2005-04-22 16:19 Logged In: YES |
| Field | Old Value | Date | By |
|---|---|---|---|
| artifact_group_id | None | 2005-09-23 18:01 | gojomo |
| status_id | Open | 2005-09-22 19:41 | gojomo |
| resolution_id | None | 2005-09-22 19:41 | gojomo |
| close_date | - | 2005-09-22 19:41 | gojomo |
Copyright © 2010 Geeknet, Inc. All rights reserved. Terms of Use