In 1.0.x, the thread killing process creates serious
problems; it assumes threads are always at their
serialnumber position in the toes list, which isn't
true after any kills. Also, the setSize() method, which
creates new threads if the current size is less than
the newly set size, assumes it can start numbering the
new threads just above the size of the current toes.
This can lead to (the last few) numbers being reused,
which is disasterous for the use of
recording-scratch-storage (the tt## files) -- more than
one thread is trying to use the same files.
The result is error stacks in heritrix_out like the
following two examples:
10/21/2004 13:19:43 -0700 WARNING
org.archive.util.DevUtils warnHandle Gap between
expected and actual: 681404
#198
http://www.dau.mil/conferences/presentations/2003/presentations/T1-FiscalLa
w-RexBragaw.pdf
(0 attempts)
XXRELLXLLL
Current processor: Archiver
ACTIVE for 10s760ms
Where: ABOUT_TO_BEGIN_PROCESSOR
java.lang.Throwable: Gap between expected and actual:
681404
#198
http://www.dau.mil/conferences/presentations/2003/presentations/T1-FiscalLa
w-RexBragaw.pdf
(0 attempts)
XXRELLXLLL
Current processor: Archiver
ACTIVE for 10s760ms
Where: ABOUT_TO_BEGIN_PROCESSOR
at
org.archive.io.arc.ARCWriter.write(ARCWriter.java(Compiled
Code))
at
org.archive.crawler.writer.ARCWriterProcessor.writeHttp(ARCWriterProcessor.
java(Compiled
Code))
at
org.archive.crawler.writer.ARCWriterProcessor.innerProcess(ARCWriterProcess
or.java(Compiled
Code))
at
org.archive.crawler.framework.Processor.process(Processor.java(Compiled
Code))
at
org.archive.crawler.framework.ToeThread.processCrawlUri(ToeThread.java(Comp
iled
Code))
at
org.archive.crawler.framework.ToeThread.run(ToeThread.java(Compiled
Code))
---- or -----
10/21/2004 15:29:01 -0700 SEVERE
org.archive.io.ReplayCharSequenceFactory$ByteReplayCharSequence
loadBuffer
raFile.seekraFile.readFully(wraparoundBuffer,0,65536)
raFile.length()0
#197
https://www.caps.navsea.navy.mil/caps/caps.nsf/f1bf587d9f0a49c7852568a10057
8c56/d3c34484c393a5d98525679f0053
LLLRLLRLRELLL
Current processor: ExtractorHTML
ACTIVE for 24s144ms
Where: ABOUT_TO_BEGIN_PROCESSOR
java.io.EOFException
at
java.io.RandomAccessFile.readFully(RandomAccessFile.java(Compiled
Code))
at
org.archive.io.ReplayCharSequenceFactory$ByteReplayCharSequence.loadBuffer(
ReplayCharSequenceFactory.java(Co
at
org.archive.io.ReplayCharSequenceFactory$ByteReplayCharSequence.recenterBuf
fer(ReplayCharSequenceFactory.jav
at
org.archive.io.ReplayCharSequenceFactory$ByteReplayCharSequence.faultCharAt
(ReplayCharSequenceFactory.java(C
at
org.archive.io.ReplayCharSequenceFactory$ByteReplayCharSequence.charAt(Repl
ayCharSequenceFactory.java(Compil
at
java.util.regex.Pattern$Ctype.match(Pattern.java(Compiled
Code))
at
java.util.regex.Pattern$Curly.match(Pattern.java(Compiled
Code))
at
java.util.regex.Pattern$GroupTail.match(Pattern.java(Compiled
Code))
at
java.util.regex.Pattern$GroupTail.match(Pattern.java(Compiled
Code))
at
java.util.regex.Pattern$Curly.match0(Pattern.java(Compiled
Code))
at
java.util.regex.Pattern$Curly.match(Pattern.java(Compiled
Code))
at
java.util.regex.Pattern$GroupHead.match(Pattern.java(Compiled
Code))
at
java.util.regex.Pattern$Branch.match(Pattern.java(Compiled
Code))
at
java.util.regex.Pattern$GroupHead.match(Pattern.java(Compiled
Code))
at
java.util.regex.Pattern$GroupHead.match(Pattern.java(Compiled
Code))
at
java.util.regex.Pattern$Branch.match(Pattern.java(Compiled
Code))
at
java.util.regex.Pattern$Branch.match(Pattern.java(Compiled
Code))
at
java.util.regex.Pattern$GroupHead.match(Pattern.java(Compiled
Code))
at
java.util.regex.Pattern$SingleA.match(Pattern.java(Compiled
Code))
at
java.util.regex.Pattern$Start.match(Pattern.java(Compiled
Code))
at
java.util.regex.Matcher.find(Matcher.java(Inlined
Compiled Code))
at
java.util.regex.Matcher.find(Matcher.java(Inlined
Compiled Code))
at
org.archive.crawler.extractor.ExtractorHTML.extract(ExtractorHTML.java(Comp
iled
Code))
at
org.archive.crawler.extractor.ExtractorHTML.innerProcess(ExtractorHTML.java
(Compiled
Code))
at
org.archive.crawler.framework.Processor.process(Processor.java(Compiled
Code))
at
org.archive.crawler.framework.ToeThread.processCrawlUri(ToeThread.java(Comp
iled
Code))
at
org.archive.crawler.framework.ToeThread.run(ToeThread.java(Compiled
Code))
---
Gordon Mohr
Disk I/O
None
Public
|
Date: 2007-03-14 00:17
|
|
Date: 2004-10-28 00:03 Logged In: YES |
|
Date: 2004-10-23 02:44 Logged In: YES |
|
Date: 2004-10-23 01:12 Logged In: YES |
| Field | Old Value | Date | By |
|---|---|---|---|
| status_id | Open | 2004-10-28 00:03 | gojomo |
| resolution_id | None | 2004-10-28 00:03 | gojomo |
| close_date | - | 2004-10-28 00:03 | gojomo |
| priority | 9 | 2004-10-23 02:44 | gojomo |
Copyright © 2010 Geeknet, Inc. All rights reserved. Terms of Use