Share

Heritrix: Internet Archive Web Crawler

Tracker: Bugs

5 Corrupt job.state files obstruct crawl resumption - ID: 1419272
Last Update: Comment added ( karl-ia )

Crashing crawlers in archiveit can leave job.state
files that are empty. They get in the way of a crawler
restart .


Michael Stack ( stack-sf ) - 2006-01-30 22:37

5

Closed

Fixed

Michael Stack

configuration

1.8.0

Public


Comments ( 2 )

Date: 2007-03-14 01:04
Sender: karl-ia


This issue is now discussed in the new JIRA tracker at
http://webteam.archive.org/jira/browse/HER-539 -- please add further
comments at that location.


Date: 2006-01-30 22:41
Sender: stack-sfProject Admin

Logged In: YES
user_id=924942

Fix for '[ 1419272 ] Corrupt job.state files obstruct crawl
resumption'
* src/java/org/archive/crawler/admin/CrawlJob.java
Check file length of job.state before using (Can be
empty if crawler
crashed).




Attached File

No Files Currently Attached

Changes ( 5 )

Field Old Value Date By
artifact_group_id None 2006-03-17 19:57 gojomo
status_id Open 2006-01-30 22:41 stack-sf
resolution_id None 2006-01-30 22:41 stack-sf
summary Corrupt job.state files 2006-01-30 22:41 stack-sf
close_date - 2006-01-30 22:41 stack-sf