Thread: [smartmontools-support]Bad block reallocation not triggered automatically ?

Disk Inspection and Monitoring

Brought to you by: ballen4705, chrfranke, dipohl

smartmontools-support

[smartmontools-support]Bad block reallocation not triggered automatically ?

From: Delian K. <sma...@kr...> - 2005-02-03 14:23:36

Please do not read the whole message if You don't have the time. Just
the sections between the stars are important.

Hello,

I have a failing 60Gigs Maxtor hard drive. When I've tried to diagnose
what was wrong with it, I've found it had 266 already reallocated blocks,
and 260 more pending for reallocation. I've found the howto on the
smartmontools site, and tried to write some data on the failing sectors.
This was no success. I've actually tried zeroing the complete drive
previously, with the same result.

I've decided the drive has phisical problems at this points, and that it'=
s
"spare sectors pool" has been exausted. At this point comes my first majo=
r
question:

******
* Is there a way to determine how many "spare sectors" a drive offers ?
* If not at run time, shouldn't it be specified by the manufacturer
* somewhere ?
* If not, are You aware of such a data for some of the modern IDE drives =
?
******

I've googled around to see how to force the reallocation, since I just
had the feeling this drive was not yet gone. I wasn't quite successfull
though, and found nothing, but the method your HOWTO offers - writing som=
e
data on the failing sector.

AFAIKnew, the "HDD low level format", was doing nothing but writing the
complete drive with some pattern, usually zeroes. I've read about it
on the maxtor site, and their "powermax" tool was really documented to
do it this way. I didn't have too much hope in it, but I've decided to
give it a try. I've also hoped there might be some feature, not yet
known by me, which this software might offer. Maxtor provides nothing
but a windows executable to create the boot disquette with powermax.
I've tried "apt-get install wine", and was quite dissapointed when the
"./wine ./powermax.exe" didn't found the floppy drive. A quick look at
the wine's config options showed me that everything "should" be ok.
Anyway, it obviously wasn't. So I've left this tool until several
days later when I was in the office and onother HDD with the same
simptoms was given to me in the office. Since I've got a win machine
there I was able to create this stupid disquette. I've run powermax,
and a quick look at it's features showed me It had nothing to offer
that I haven't tried yet. It just runs the smart tests, and fills
the drive with zeroes. Anyway, that feeling of mine made me run the
low level format. I was quite amazed when it finished successfully,
more amazed when the smart test were successfull later, and even
more amazed when I've seen this low lever format was able to force
the reallocation event, and the number of reallocated sectors had
lowered to just 6, and the pending ones to 0. But hey, I've already
tried filling the drive
with zeroes. Why did I fail previously ? I've remembered I've read
s.t. about the bufferening, raw devices, etc .. in the smartmoontoos
ML archives, and concluded this should be the reason why my previous
tries weren't successfull. So

*******
* Is there a way to force the bad block reallocation event under linux ?
* Does the success of writing to a hdd dependend on the driver's
* buffering settings ?
*******

I just have the feeling thish should work transparently, not to be
forced by "unbuffered low level format". The reason I'm asking this
is because I have another drive, which did not showed any such simptoms,
until I've run the long smart test on it. The test failed, and I've
mentioned there is one pending sector for reallocation. The only way I
see it could be done currently, is to:
- migrate all the readable data to another drive
- low level format the complete drive
- put the data back.

Also note that this drive has just one pending sector, and it's
allready reallocad sectors are 0.

Additionally, it's quite strange powermax says that the drive should
be returned if these problems are correctable by a simple zeroing ..

Thanks for your attention.
Cheers, Delian

Fwd: Re: [smartmontools-support]Bad block reallocation not triggered automatically ?

From: Delian K. <sma...@kr...> - 2005-02-03 15:54:28

I hope You do not mind bringing your message to the public list.
Please reply to the list in the future. I find the opened discussion
more convinient.

----------  Forwarded Message  ----------

Subject: Re: [smartmontools-support]Bad block reallocation not triggered =
automatically ?
Date: Thu, 3 Feb 2005 10:22:09 -0500 (EST)
From: Eric Praetzel <XXX@XXX>
To: sma...@kr... (Delian Krustev)

> I have a failing 60Gigs Maxtor hard drive. When I've tried to diagnose

Send it back for warranty repair or throw it out!

I've had to deal with a good 20 failing drives in the past few months.
Maxtor/WD's suffer from early failures.
IBMs are reliable for a good 2.5 years and then they earn the name DeathS=
tar!

> what was wrong with it, I've found it had 266 already reallocated block=
s,
> and 260 more pending for reallocation. I've found the howto on the

Heave that thing in the garbage or get a warranty replacement!
Whenever I've seen this the drive is well on it's way to failure.
Running PowerMax should confirm that.

> I've decided the drive has phisical problems at this points, and that i=
t's
> "spare sectors pool" has been exausted. At this point comes my first ma=
jor

There is no "spare sectors pool".  All sectors are available for use.  As
they fail they get marked bad and avoided.  You start with the maximum
number of sectors and go down from there.

Any drive which is loosing sectors over time is on a slope to failure.

I rarely see a drive with 1 "pending" bad sector that continues to work
and doesn't fail the mfg's tests.  Most fail quickly - an exception
being some older 15G Maxtors/Quantums I have in service.

> *******
> * Is there a way to force the bad block reallocation event under linux =
?

shutdown -F -r now
will force a "fsck" or file system check at boot time on RedHat/Fedora.

> * Does the success of writing to a hdd dependend on the driver's
> * buffering settings ?

Yes and no.  Generally no - but if you're misconfigured the settings
then lots can go wrong - usually hanging the machine quickly.

> is because I have another drive, which did not showed any such simptoms=
,
> until I've run the long smart test on it. The test failed, and I've

I don't put any drive into a server unless it's had a 3 day burn-in
of read-writes via the Powermax software - I've just had tooo many
WD / Maxtor drives keel over on me within a month of installs.

> - migrate all the readable data to another drive
> - low level format the complete drive
> - put the data back.

Don't trust the drive - don't trust the drive.

Modern drives are dirt cheap for their performance.  What's been
sacrified is reliability and a "soft" failure.  Gone are the days
when I'd watch a drive slowly head towards failure over the space
of a year.  Now they drop dead really really fast.  I've seen 2
drop half way thru the MaxBlast test - totally, electically gone.

> Also note that this drive has just one pending sector, and it's
> allready reallocad sectors are 0.

I have 2 drives like that (out of 20 bad ones) which continue to work.
I get no messages about data corruption; fsck or Windoze never marks
the sector as bad and so I use the drives as backups - non critical use
with a very low usage.

Your time is worth more than dealing with a suspect drive that is likely
on it's way to failure.

> Additionally, it's quite strange powermax says that the drive should
> be returned if these problems are correctable by a simple zeroing ..

If PowerMax says that the drive has failed - it's failed.  It may not
be in the grave yet - but it's on it's way.

 - Eric

-------------------------------------------------------