Share

Heritrix: Internet Archive Web Crawler

Tracker: Bugs

5 DomainSensitiveFrontier broken by uri-included-structure - ID: 1226365
Last Update: Comment added ( karl-ia )

Bjarne notes the problem here:
http://groups.yahoo.com/group/archive-crawler/message/1988


Michael Stack ( stack-sf ) - 2005-06-23 16:27

5

Closed

Fixed

Karl Thiessen

None

1.6.0

Public


Comments ( 2 )

Date: 2007-03-14 00:56
Sender: karl-ia


This issue is now discussed in the new JIRA tracker at
http://webteam.archive.org/jira/browse/HER-450 -- please add further
comments at that location.


Date: 2005-06-23 16:38
Sender: stack-sfProject Admin

Logged In: YES
user_id=924942

Commit message below.

Assigning to Karl to verify.

HOW TO VERIFY FIX:

This fix is hard to test because you have to get a build
from just after the commit of bloom filter stuff and from
just before this commit to see the issue. Here is such a
build:
http://crawltools.archive.org:8080/cruisecontrol/artifacts/HEAD-heritrix/20050623090223/heritrix-1.5.0-200506230903.tar.gz

Once the build has been obtained, verify that attempts at
setting up a crawl with DSF throw FatalConfig. exceptions.
Builds subsequent should not have this problem.

Fix for '[ 1226365 ] DomainSensitiveFrontier broken by
uri-included-structure'
* src/java/org/archive/crawler/frontier/BdbFrontier.java
Don't fail initialization if uri-included-structure
attribute not found.




Attached File

No Files Currently Attached

Changes ( 5 )

Field Old Value Date By
artifact_group_id None 2005-09-23 18:02 gojomo
resolution_id None 2005-09-23 17:58 gojomo
status_id Open 2005-06-23 16:38 stack-sf
assigned_to nobody 2005-06-23 16:38 stack-sf
close_date - 2005-06-23 16:38 stack-sf