I've been playing with smartmontools, and have had an error diagnosed by
one of the selftest, and I'm hoping someone can provide some pointers as
to what to do next...
Quick summary - system is my main desktop, Debian Linux on x86, 4 SCSI
disks set up using kernel software RAID-5. All the md devices are
formatted with XFS. After running "smartctl -t long /dev/sdb", I get the
following output from "smartctl -a /dev/sdb":-
smartctl version 5.32 Copyright (C) 2002-4 Bruce Allen
Home page is http://smartmontools.sourceforge.net/
Device: IBM DDYS-T18350N Version: S96H
Serial number: 5EG73154
Device type: disk
Transport protocol: Fibre channel (FCP-2)
Local Time is: Thu Dec 23 12:16:29 2004 GMT
Device supports SMART and is Enabled
Temperature Warning Disabled or Not Supported
SMART Health Status: OK
Current Drive Temperature: 49 C
Drive Trip Temperature: 85 C
Manufactured in week 07 of year 2001
Current start stop count: 436 times
Recommended maximum start stop count: 10000 times
Error counter log:
Errors Corrected Total Total Correction Gigabytes Total
delay: [rereads/ errors algorithm processed uncorrected
minor | major rewrites] corrected invocations [10^9 bytes] errors
read: 0 0 4 8 48 24.088 40
write: 0 0 0 0 0 83.515 0
Non-medium error count: 0
SMART Self-test log
Num Test Status segment LifeTime LBA_first_err [SK ASC ASQ]
Description number (hours)
# 1 Background long Failed in segment --> 5 3010 0x 2168629 [0x4 0x3e 0x3]
My questions are:-
1/ How serious is this? Disk on way out now, or just a bad area?
2/ Can I map it out?
3/ How can I map the LBA address to a file?
Things I've read make it clear how to answer 3/ for ext2 filesystems,
but not XFS on RAID, and 5 minutes with the XFS debugger has not
provided enough enlightenment yet...
P.S. I'm not a list subscriber at the moment, so I'd really appreciate a
CC on any replies. Thanks.