Share

Heritrix: Internet Archive Web Crawler

The forum address has changed, you have been automatically redirected. Please update any bookmarks to use the new URL.

Tracker: Feature Requests

5 [contrib] Show estimated remaining time - ID: 1196602
Last Update: Comment added ( karl-ia )

I am often curious how long the current crawl would
take. Of course, in the beginning, this is
unforeseeable in many cases. But after crawling about
50% it usually is easy to estimate.

This patch shows that estimated remaining time directly
below the run time. The estimate is based on a simple
calculation of runtime and download/queue ratio.


Christian


Christian Kohlschütter ( ck-heritrix ) - 2005-05-06 14:19

5

Closed

None

Michael Stack

None

1.6.0

Public


Comments ( 2 )

Date: 2007-03-14 01:41
Sender: karl-ia


This issue is now discussed in the new JIRA tracker at
http://webteam.archive.org/jira/browse/HER-925 -- please add further
comments at that location.


Date: 2005-05-07 03:15
Sender: stack-sfProject Admin

Logged In: YES
user_id=924942

Applied. Closing. Commit msg below.

Thanks for the patch Christian. Nice feature.

Applied patch '[ 1196602 ] [contrib] Show estimated
remaining time'
Contributed by Christian Kohlschuetter. Tested by St.Ack
Here is comment from Christian:

I am often curious how long the current crawl would
take. Of course, in the beginning, this is
unforeseeable in many cases. But after crawling about
50% it usually is easy to estimate.

This patch shows that estimated remaining time directly
below the run time. The estimate is based on a simple
calculation of runtime and download/queue ratio.

St.Ack actually moved the display to the right so two
columns of stats are
balanced.
* src/java/org/archive/crawler/admin/StatisticsTracker.java
Added 'Remaining (estimated)' statistic. Formatting.
(totalCount): Added.


Attached File ( 1 )

Filename Description Download
remainTime.patch Patch: show the remaining time of the current job Download

Changes ( 5 )

Field Old Value Date By
artifact_group_id None 2005-09-23 21:08 gojomo
close_date - 2005-05-07 03:15 stack-sf
status_id Open 2005-05-07 03:15 stack-sf
assigned_to nobody 2005-05-07 03:15 stack-sf
File Added 133267: remainTime.patch 2005-05-06 14:19 ck-heritrix