A crawl that had been progressing very well suddenly
stopped making progress about 75 minutes into the
crawl. No alerts and the runtime-errors and
local-errors logs did not contain any relevant errors.
Looking through the heritrix_out.log the a flurry of
the following Exceptions could be found at the time the
crawl stopped making progress
----
java.util.NoSuchElementException
at
org.archive.util.DiskQueue.dequeue()Ljava.lang.Object;(DiskQueue.java:181)
at
org.archive.util.DiskBackedDeque.backingDequeue()Ljava.lang.Object;(DiskBac
kedDeque.java:114)
at
org.archive.util.DiskBackedQueue.dequeue()Ljava.lang.Object;(DiskBackedQueu
e.java:104)
at
org.archive.crawler.frontier.KeyedQueue.dequeue()Lorg.archive.crawler.datam
odel.CrawlURI;(Optimized
Method)
at
org.archive.crawler.frontier.Frontier.dequeueFromReady()Lorg.archive.crawle
r.datamodel.CrawlURI;(Optimized
Method)
at
org.archive.crawler.frontier.Frontier.next()Lorg.archive.crawler.datamodel.
CrawlURI;(Optimized
Method)
at
org.archive.crawler.frontier.Frontier.next(I)Lorg.archive.crawler.datamodel
.CrawlURI;(Frontier.java:573)
at
org.archive.crawler.framework.ToeThread.run()V(ToeThread.java:135)
at
java.lang.Thread.startThreadFromVM(Ljava.lang.Thread;)V(Unknown
Source)
-----
The crawl was a 'wide' crawl of 1000 .is domains using
a custom scope based off the Domain scope (I can not
see any relation between it and this error). It had
covered approx. 230,000 documents (with 310,000
waiting) when this occured. The Frontier report did not
contain any hint that anything was amiss.
Gordon Mohr
Disk I/O
None
Public
|
Date: 2007-03-14 00:13
|
|
Date: 2004-10-04 18:55 Logged In: YES |
|
Date: 2004-09-30 23:37 Logged In: YES |
|
Date: 2004-09-30 00:46 Logged In: YES |
|
Date: 2004-09-29 20:45 Logged In: YES |
|
Date: 2004-09-28 18:34 Logged In: YES |
|
Date: 2004-06-30 19:24 Logged In: YES |
|
Date: 2004-06-16 14:45 Logged In: YES |
|
Date: 2004-06-16 14:17 Logged In: YES |
|
Date: 2004-06-16 13:16 Logged In: YES |
|
Date: 2004-06-15 17:32 Logged In: YES |
|
Date: 2004-06-15 17:14 Logged In: YES |
|
Date: 2004-06-15 15:03 Logged In: YES |
| Field | Old Value | Date | By |
|---|---|---|---|
| close_date | 2004-06-30 19:24 | 2004-10-04 18:55 | stack-sf |
| status_id | Open | 2004-10-04 18:55 | stack-sf |
| resolution_id | Accepted | 2004-10-04 18:55 | stack-sf |
| resolution_id | Fixed | 2004-09-28 18:34 | stack-sf |
| status_id | Closed | 2004-09-28 18:34 | stack-sf |
| status_id | Open | 2004-06-30 19:24 | gojomo |
| close_date | - | 2004-06-30 19:24 | gojomo |
| resolution_id | None | 2004-06-30 19:24 | gojomo |
Copyright © 2010 Geeknet, Inc. All rights reserved. Terms of Use