Share

Heritrix: Internet Archive Web Crawler

Tracker: Feature Requests

5 exponential backoff URI/host retries - ID: 925734
Last Update: Comment added ( karl-ia )

Especially when a URI connection fails, we may not want
to use a constant-length retry-delay, but rather one
that grows with each failure. This could be achived by
providing a factor which is raised to the power of the
number of retries, that is multiplied by the 'initial'
retry-delay.

That is, delay = base_delay *
(backoff_factor^consecutive_retries).

A typical value for base_delay might be 1 minute; a
typical factor for backoff_factor might be 2. Then
retries would occur after 1 minute, then another 2,
then another 4, then another 8, etc.

When a site is really down for many hours (or a day or
more), this sort of patience could be better than
considering a URI a failure after only an hour or two
of retrying.


Gordon Mohr ( gojomo ) - 2004-03-30 01:20

5

Closed

None

Gordon Mohr

Network/Protocols

None

Public


Comments ( 2 )

Date: 2007-03-14 01:28
Sender: karl-ia


This issue is now discussed in the new JIRA tracker at
http://webteam.archive.org/jira/browse/HER-745 -- please add further
comments at that location.


Date: 2004-06-03 22:08
Sender: gojomoProject Admin

Logged In: YES
user_id=144912

I have code to implement this, but it risks operator
frustration unless there's a way to force a faster retry
when the backoff has reached long periods (hours or days)
and the site is known to be back up.


Attached File

No Files Currently Attached

Changes ( 4 )

Field Old Value Date By
status_id Open 2004-06-03 22:08 gojomo
close_date - 2004-06-03 22:08 gojomo
assigned_to johnerik 2004-06-03 19:09 gojomo
assigned_to nobody 2004-03-30 23:34 gojomo