Doug, the hard drives are IBM DPSS-309170M, revision 03. They're u160
drives running on a "LSI Logic / Symbios Logic 53c1010 Ultra3 SCSI Adapter
(rev 01)" using the sym53c8xx kernel driver on a vanilla 2.6.6 kernel.
Thanks for decoding the [SK ASC ASQ] sequence. I couldn't find this in the
draft I looked at. Do you have any idea what failed in segment 3 means?
What's a segment?
Thanks for your help,
On Thu, 15 Jul 2004, Douglas Gilbert wrote:
> Bruce Allen wrote:
> > Doug, another one for you...
> > Bruce
> > ---------- Forwarded message ----------
> > Date: Wed, 14 Jul 2004 22:49:20 -0400 (EDT)
> > From: Ross Skaliotis <mross@...>
> > To: smartmontools-support@...
> > Subject: [smartmontools-support]SCSI error log
> > I've just recently started performing short and long self-tests on my 4
> > identical SCSI drives. Quite strangely, I've been getting identical errors
> > written to the error logs of these drives. Even stranger is that these
> > errors come and go... Below is a sample smartctl output. There are no
> > errors in the error counter log. The "failed in segment..." errors you see
> > also appear sporadically and identically in the selftest logs of my other
> > three drives. Can anyone tell me what the problem (if any) actually is?
> > Thanks so much for your help.
> > -Ross Skaliotis
> > ---------------------------------
> > #smartctl -H -l selftest /dev/sdd
> > smartctl version 5.26 Copyright (C) 2002-3 Bruce Allen
> > Home page is http://smartmontools.sourceforge.net/
> > SMART Health Status: OK
> > Current Drive Temperature: 30 C
> > Drive Trip Temperature: 85 C
> > Manufactured in week 43 of year 2000
> > Current start stop count: 29 times
> > Recommended maximum start stop count: 10000 times
> > SMART Self-test log
> > Num Test Status segment LifeTime LBA_first_err [SK ASC ASQ]
> > Description number (hours)
> > # 1 Background long Failed in segment --> 3 192 - [0x4 0x3e 0x3]
> > # 2 Background short Completed - 192 - [- - -]
> > # 3 Background short Completed - 192 - [- - -]
> > # 4 Background short Failed in segment --> 3 192 - [0x4 0x3e 0x3]
> > # 5 Background short Failed in segment --> 3 192 - [0x4 0x3e 0x3]
> > # 6 Background short Failed in segment --> 3 192 - [0x4 0x3e 0x3]
> The [SK ASC ASQ] sequence means: "hardware error: logical unit
> failed self test". That doesn't look like a random error;
> however it could be a little more precise.
> As to why it appears sometimes and not others I cannot tell.
> Why 4 identical SCSI drives would exhibit the same
> characteristics is even more surprising. What model disks
> are they?
> Doug Gilbert