From: Eduard M. <mar...@ro...> - 2004-10-19 01:29:10
|
Good idea. -----Forwarded Message----- From: Bruce Allen <ba...@gr...> To: Eduard Martinescu <mar...@ro...> Cc: mo...@ol... Subject: Re: [smartmontools-support]bad sectors with freebsd (fwd) Date: Mon, 18 Oct 2004 19:59:55 -0500 I suggest that you both copy the smartmontools-support mailing list on this type of correspondence. Some people do search it for answers before writing their own posts.... Cheers, Bruce On Mon, 18 Oct 2004, Eduard Martinescu wrote: > Bruce/Ciro, > > This is a hold over from when the mechanism to read back ATA registers > after execution of a command was no defined. With Smartmontools 5.33, > this has been changed, and instead of testing for some specific IOCTL > define, I examine the FreeBSD Version instead. > > Ciro, can you test out the CVS version of Smartmontools? Or ignore the > message until the next stable release, when the port will get bumped > also. > > Ed > > On Sun, 2004-10-17 at 21:42, Bruce Allen wrote: > > See question at the end... > > > > ---------- Forwarded message ---------- > > Date: Sun, 17 Oct 2004 18:07:59 +0200 > > From: Ciro Scognamiglio <mo...@ol...> > > To: sma...@li... > > Subject: Re: [smartmontools-support]bad sectors with freebsd > > > > On Fri, Oct 15, 2004 at 08:46:36AM -0500 > > Bruce Allen <ba...@gr...> wrote: > > > > > > Oct 15 14:53:46 linc smartd[575]: Device: /dev/ad0, 2 Offline > > > > uncorrectable sectors > > > > > > > > How do I get rid of them? I thought those two sectors had been fixed > > > > (well, isolated maybe)...but still smartd complains...am I missing > > > > something? > > > > > > Run an offline test: > > > smartctl -t long /dev/ad0 > > > if it finds no unreadable sectors it should reset the OFFLINE > > > uncorrectable count to zero. > > > > > > > I run that test 2 times, but I am still getting the > > > > smartd[575]: Device: /dev/ad0, 2 Offline > > uncorrectable sectors > > > > error in the system log. > > > > smartctl -l selftest /dev/ad0 gives: > > > > SMART Self-test log structure revision number 1 > > Num Test_Description Status Remaining > > LifeTime(hours) LBA_of_first_error > > # 1 Extended offline Completed without error 00% 136 > > # - > > # 2 Extended offline Completed without error 00% 124 > > # - > > # 3 Extended offline Completed: read failure 00% 118 > > # 26926850 > > > > I have also a couple more questions: > > > > - is the "Remaining" value the % of the remaining test to be done? or > > what? > > - I always get this messages with smartctl: > > smartctl version 5.32 Copyright (C) 2002-4 Bruce Allen > > Home page is http://smartmontools.sourceforge.net/ > > > > The SMART RETURN STATUS return value (smartmontools -H option/Directive) > > can not be retrieved with this version of ATAng, please do not rely on > > this value > > > > what does it mean? My freebsd box is updated with the 5.3 beta7 > > release... > > > > thnx > > > > Ciro. > > > > > > ------------------------------------------------------- > > This SF.net email is sponsored by: IT Product Guide on ITManagersJournal > > Use IT products in your business? Tell us what you think of them. Give us > > Your Opinions, Get Free ThinkGeek Gift Certificates! Click to find out more > > http://productguide.itmanagersjournal.com/guidepromo.tmpl > > _______________________________________________ > > Smartmontools-support mailing list > > Sma...@li... > > https://lists.sourceforge.net/lists/listinfo/smartmontools-support > -- > Eduard Martinescu <mar...@ro...> > > -- Eduard Martinescu <mar...@ro...> |
From: Ciro S. <mo...@ol...> - 2004-10-19 14:01:30
|
uh...I have got a strange thing here... Yesterday I built the cvs version of smartmontools and moved the smartctl and smartd executables over the old files (the two built from the current FreeBSD port)... They seemed to work but noticed that neither the long test nor the offline test produced any change even though they seemed to start correctly. Now that I rebooted (and didn't touch anything else) I get the following with smartctl, wathever command I use: moray@linc [~] jobs 0 on ttyp0> smartctl -a /dev/ad0 smartctl version 5.34 [i386-unknown-freebsd5.3] Copyright (C) 2002-4 Bruce Allen Home page is http://smartmontools.sourceforge.net/ Smartctl open device: /dev/ad0 failed: Inappropriate ioctl for device what's wrong? the smartd still have the same behavior, i.e. it produces the: Oct 19 15:43:23 linc smartd[582]: Device: /dev/ad0, 2 Offline uncorrectable sectors in /var/log messages (and send me an email)... I had a look at the freebsd port patch and noticed that it only applys the following change (that I applyed manually to the cvs version): --- os_freebsd.c.orig Sat Mar 13 11:32:52 2004 +++ os_freebsd.c Thu Jul 15 23:42:06 2004 @@ -18,6 +18,7 @@ #include <stdio.h> #include <sys/types.h> #include <dirent.h> +#include <fcntl.h> #include <err.h> #include <camlib.h> #include <cam/scsi/scsi_message.h> C. |
From: Ciro S. <mo...@ol...> - 2004-10-19 14:44:03
|
On Tue, Oct 19, 2004 at 04:03:40PM +0200 Ciro Scognamiglio <mo...@ol...> wrote: > moray@linc [~] jobs 0 on ttyp0> smartctl -a /dev/ad0 > smartctl version 5.34 [i386-unknown-freebsd5.3] Copyright (C) 2002-4 > Bruce Allen > Home page is http://smartmontools.sourceforge.net/ > > Smartctl open device: /dev/ad0 failed: Inappropriate ioctl for device > > what's wrong? geeee...I fill a bit stupid :P of course I had to be root to issue that command.... anyway, I have started a "smartctl -t offline /dev/ad0" test, I should get infos with "smartctl -l selftest /dev/ad0" as soon as it finishes right? I will let you know if this time it will work or not... also, I just noticed that the serial number of my device is: Serial Number: 00000000000000 as reported by smartctl...mmm... there's definitely something wrong here... Ciro. |
From: Eduard M. <mar...@ro...> - 2004-10-20 06:10:57
|
Ciro, I'm at a loss. Just to make sure, I just rebuilt the package from scratch and tried it out. I was able to checkout '/dev/ad0' without any problem. If you want, I can email you the tar file of my source, but it is a just a clean checkout of current CVS code. Also, I don't think that patch you indicate below is necessary with the CVS version of smartmontools, as I don't need to apply it to build, but I don't think that should cause any issues. You should be able to test out the executable directly from where you built it as './smartctl -a /dev/ad0' If that works, then the only thing I can think up is something going wrong when you copied it. Ed On Tue, 2004-10-19 at 10:03, Ciro Scognamiglio wrote: > uh...I have got a strange thing here... > Yesterday I built the cvs version of smartmontools and moved the smartctl > and smartd executables over the old files (the two built from the > current FreeBSD port)... > > They seemed to work but noticed that neither the long test nor the > offline test produced any change even though they seemed to start > correctly. > > Now that I rebooted (and didn't touch anything else) I get the following > with smartctl, wathever command I use: > > moray@linc [~] jobs 0 on ttyp0> smartctl -a /dev/ad0 > smartctl version 5.34 [i386-unknown-freebsd5.3] Copyright (C) 2002-4 > Bruce Allen > Home page is http://smartmontools.sourceforge.net/ > > Smartctl open device: /dev/ad0 failed: Inappropriate ioctl for device > > what's wrong? > > the smartd still have the same behavior, i.e. it produces the: > > Oct 19 15:43:23 linc smartd[582]: Device: /dev/ad0, 2 Offline > uncorrectable sectors > > in /var/log messages (and send me an email)... > > I had a look at the freebsd port patch and noticed that it only applys > the following change (that I applyed manually to the cvs version): > > --- os_freebsd.c.orig Sat Mar 13 11:32:52 2004 > +++ os_freebsd.c Thu Jul 15 23:42:06 2004 > @@ -18,6 +18,7 @@ > #include <stdio.h> > #include <sys/types.h> > #include <dirent.h> > +#include <fcntl.h> > #include <err.h> > #include <camlib.h> > #include <cam/scsi/scsi_message.h> > > C. > > > ------------------------------------------------------- > This SF.net email is sponsored by: IT Product Guide on ITManagersJournal > Use IT products in your business? Tell us what you think of them. Give us > Your Opinions, Get Free ThinkGeek Gift Certificates! Click to find out more > http://productguide.itmanagersjournal.com/guidepromo.tmpl > _______________________________________________ > Smartmontools-support mailing list > Sma...@li... > https://lists.sourceforge.net/lists/listinfo/smartmontools-support -- Eduard Martinescu <mar...@ro...> |
From: Ciro S. <mo...@ol...> - 2004-10-21 18:00:09
Attachments:
freebsd-BadBlockHowTo.txt
|
On Tue, Oct 19, 2004 at 08:12:57PM -0400 Eduard Martinescu <mar...@ro...> wrote: > Ciro, > > I'm at a loss. Just to make sure, I just rebuilt the package from > scratch and tried it out. I was able to checkout '/dev/ad0' without any > problem. If you want, I can email you the tar file of my source, but it > is a just a clean checkout of current CVS code. > > Also, I don't think that patch you indicate below is necessary with the > CVS version of smartmontools, as I don't need to apply it to build, but > I don't think that should cause any issues. > > You should be able to test out the executable directly from where you > built it as './smartctl -a /dev/ad0' > > If that works, then the only thing I can think up is something going > wrong when you copied it. > I am sorry, it was my fault, I was rather tired that day and tried to run smartctl as a common user :P anyway, I have now tried some tests with the cvs version checked out 4 or 5 days ago. They terminate succesfully, but I have still got smartd -H complaining about an uncorrectable sector error.... # smartctl -l selftest /dev/ad0 smartctl version 5.34 [i386-unknown-freebsd5.3] Copyright (C) 2002-4 Bruce Allen Home page is http://smartmontools.sourceforge.net/ === START OF READ SMART DATA SECTION === SMART Self-test log structure revision number 1 Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Extended offline Completed without error 00% 171 # - # 2 Extended offline Completed without error 00% 161 # - # 3 Short offline Completed without error 00% 160 # - # 4 Extended offline Completed without error 00% 138 # - # 5 Extended offline Completed without error 00% 136 # - # 6 Extended offline Completed without error 00% 124 # - # 7 Extended offline Completed: read failure 00% 118 # 26926850 The disk seems to run fine, but I still get this in /var/log/messages Oct 21 19:15:06 linc smartd[585]: Device: /dev/ad0, 2 Offline uncorrectable sectors while: # smartctl -A /dev/ad0 smartctl version 5.34 [i386-unknown-freebsd5.3] Copyright (C) 2002-4 Bruce Allen Home page is http://smartmontools.sourceforge.net/ === START OF READ SMART DATA SECTION === SMART Attributes Data Structure revision number: 16 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x000b 100 100 051 Pre-fail Always - 19 3 Spin_Up_Time 0x0007 077 046 000 Pre-fail Always - 4352 4 Start_Stop_Count 0x0032 100 100 000 Old_age Always - 176 5 Reallocated_Sector_Ct 0x0033 253 253 010 Pre-fail Always - 0 7 Seek_Error_Rate 0x000b 253 253 051 Pre-fail Always - 0 8 Seek_Time_Performance 0x0024 086 086 000 Old_age Offline - 11581 9 Power_On_Half_Minutes 0x0032 100 100 000 Old_age Always - 177h+36m 10 Spin_Retry_Count 0x0013 253 253 049 Pre-fail Always - 0 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 176 194 Temperature_Celsius 0x0022 151 124 000 Old_age Always - 29 195 Hardware_ECC_Recovered 0x000a 100 100 000 Old_age Always - 88268331 196 Reallocated_Event_Count 0x0012 100 100 000 Old_age Always - 2 197 Current_Pending_Sector 0x0033 253 253 010 Pre-fail Always - 0 198 Offline_Uncorrectable 0x0031 100 100 010 Pre-fail Offline - 2 199 UDMA_CRC_Error_Count 0x000b 100 100 051 Pre-fail Always - 0 anyway, based on the BadBlockHowTo.txt I prepared a FreeBSD version (attached). I am not sure this is all correct, maybe I could post it also on an italian FreeBSD mailing list to get some feedback. BTW, that document describes what I did to correct the 2 damaged sectors. Ciro. |
From: Bruce A. <ba...@gr...> - 2004-10-22 04:55:38
|
Ciro, Here's an idea. It probably won't work but give it a try. Try smartctl -t offline to see if after a few minutes this sets the offline uncorrectable count back to zero. In the worst case you can always 'turn off' the log warning message using -U 0 in /etc/smartd.conf. Cheers, Bruce On Thu, 21 Oct 2004, Ciro Scognamiglio wrote: > On Tue, Oct 19, 2004 at 08:12:57PM -0400 > Eduard Martinescu <mar...@ro...> wrote: > > > Ciro, > > > > I'm at a loss. Just to make sure, I just rebuilt the package from > > scratch and tried it out. I was able to checkout '/dev/ad0' without any > > problem. If you want, I can email you the tar file of my source, but it > > is a just a clean checkout of current CVS code. > > > > Also, I don't think that patch you indicate below is necessary with the > > CVS version of smartmontools, as I don't need to apply it to build, but > > I don't think that should cause any issues. > > > > You should be able to test out the executable directly from where you > > built it as './smartctl -a /dev/ad0' > > > > If that works, then the only thing I can think up is something going > > wrong when you copied it. > > > > I am sorry, it was my fault, I was rather tired that day and tried to > run smartctl as a common user :P > > anyway, I have now tried some tests with the cvs version checked out 4 > or 5 days ago. They terminate succesfully, but I have still got smartd > -H complaining about an uncorrectable sector error.... > > # smartctl -l selftest /dev/ad0 > smartctl version 5.34 [i386-unknown-freebsd5.3] Copyright (C) 2002-4 > Bruce Allen > Home page is http://smartmontools.sourceforge.net/ > > === START OF READ SMART DATA SECTION === > SMART Self-test log structure revision number 1 > Num Test_Description Status Remaining > LifeTime(hours) LBA_of_first_error > # 1 Extended offline Completed without error 00% 171 > # - > # 2 Extended offline Completed without error 00% 161 > # - > # 3 Short offline Completed without error 00% 160 > # - > # 4 Extended offline Completed without error 00% 138 > # - > # 5 Extended offline Completed without error 00% 136 > # - > # 6 Extended offline Completed without error 00% 124 > # - > # 7 Extended offline Completed: read failure 00% 118 > # 26926850 > > The disk seems to run fine, but I still get this in /var/log/messages > > Oct 21 19:15:06 linc smartd[585]: Device: /dev/ad0, 2 Offline > uncorrectable sectors > > while: > > # smartctl -A /dev/ad0 > smartctl version 5.34 [i386-unknown-freebsd5.3] Copyright (C) 2002-4 > Bruce Allen > Home page is http://smartmontools.sourceforge.net/ > > === START OF READ SMART DATA SECTION === > SMART Attributes Data Structure revision number: 16 > Vendor Specific SMART Attributes with Thresholds: > ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE > UPDATED WHEN_FAILED RAW_VALUE > 1 Raw_Read_Error_Rate 0x000b 100 100 051 Pre-fail Always > - 19 > 3 Spin_Up_Time 0x0007 077 046 000 Pre-fail Always > - 4352 > 4 Start_Stop_Count 0x0032 100 100 000 Old_age Always > - 176 > 5 Reallocated_Sector_Ct 0x0033 253 253 010 Pre-fail Always > - 0 > 7 Seek_Error_Rate 0x000b 253 253 051 Pre-fail Always > - 0 > 8 Seek_Time_Performance 0x0024 086 086 000 Old_age > Offline - 11581 > 9 Power_On_Half_Minutes 0x0032 100 100 000 Old_age Always > - 177h+36m > 10 Spin_Retry_Count 0x0013 253 253 049 Pre-fail Always > - 0 > 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always > - 176 > 194 Temperature_Celsius 0x0022 151 124 000 Old_age Always > - 29 > 195 Hardware_ECC_Recovered 0x000a 100 100 000 Old_age Always > - 88268331 > 196 Reallocated_Event_Count 0x0012 100 100 000 Old_age Always > - 2 > 197 Current_Pending_Sector 0x0033 253 253 010 Pre-fail Always > - 0 > 198 Offline_Uncorrectable 0x0031 100 100 010 Pre-fail > Offline - 2 > 199 UDMA_CRC_Error_Count 0x000b 100 100 051 Pre-fail Always > - 0 > > > > anyway, based on the BadBlockHowTo.txt I prepared a FreeBSD version > (attached). I am not sure this is all correct, maybe I could post it > also on an italian FreeBSD mailing list to get some feedback. > > BTW, that document describes what I did to correct the 2 damaged > sectors. > > Ciro. > |
From: Ciro S. <mo...@ol...> - 2004-10-22 11:15:13
|
On Thu, Oct 21, 2004 at 11:55:31PM -0500 Bruce Allen <ba...@gr...> wrote: > Ciro, > > Here's an idea. It probably won't work but give it a try. Try > smartctl -t offline > to see if after a few minutes this sets the offline uncorrectable count > back to zero. I have tried the long, short and offline tests with no results :( is there a way to debug? I mean, maybe I could send you more useful informations... > > In the worst case you can always 'turn off' the log warning message using > -U 0 > in /etc/smartd.conf. > but that wont report any other uncorrectable errors.... could it be a bug in my samsung device? As already reported I have noticed that the serial number reported by smartctl is all zeroes.... |
From: Bruce A. <ba...@gr...> - 2004-10-22 13:40:55
|
> > Here's an idea. It probably won't work but give it a try. Try > > smartctl -t offline > > to see if after a few minutes this sets the offline uncorrectable count > > back to zero. > > I have tried the long, short and offline tests with no results :( > is there a way to debug? I mean, maybe I could send you more useful > informations... It's not smartmontools: you'd need a way to debug the drive's firmware. > > In the worst case you can always 'turn off' the log warning message using > > -U 0 > > in /etc/smartd.conf. > > but that wont report any other uncorrectable errors.... Correct. > could it be a bug in my samsung device? > As already reported I have noticed that the serial number reported by > smartctl is all zeroes.... Yes, that's weird. Can you check that you get the same behavior with a knoppix 3.6 bootable CD and linux? Cheers, Bruce |
From: Eduard M. <mar...@ro...> - 2004-10-22 17:13:24
|
Can you also send me the output of the following: atacontrol info 0 (I think I remember you saying this was for /dev/ad0 , if it is ad2 or ad3, then substitute a 1 for the 0 above) That command should report a short one line message for each drive on channel 0, which should include the serial #. Ed On Fri, 2004-10-22 at 09:40, Bruce Allen wrote: > > > Here's an idea. It probably won't work but give it a try. Try > > > smartctl -t offline > > > to see if after a few minutes this sets the offline uncorrectable count > > > back to zero. > > > > I have tried the long, short and offline tests with no results :( > > is there a way to debug? I mean, maybe I could send you more useful > > informations... > > It's not smartmontools: you'd need a way to debug the drive's firmware. > > > > In the worst case you can always 'turn off' the log warning message using > > > -U 0 > > > in /etc/smartd.conf. > > > > but that wont report any other uncorrectable errors.... > > Correct. > > > could it be a bug in my samsung device? > > As already reported I have noticed that the serial number reported by > > smartctl is all zeroes.... > > Yes, that's weird. Can you check that you get the same behavior with a > knoppix 3.6 bootable CD and linux? > > Cheers, > Bruce > > > > ------------------------------------------------------- > This SF.net email is sponsored by: IT Product Guide on ITManagersJournal > Use IT products in your business? Tell us what you think of them. Give us > Your Opinions, Get Free ThinkGeek Gift Certificates! Click to find out more > http://productguide.itmanagersjournal.com/guidepromo.tmpl > _______________________________________________ > Smartmontools-support mailing list > Sma...@li... > https://lists.sourceforge.net/lists/listinfo/smartmontools-support -- Eduard Martinescu <mar...@ro...> |
From: Ciro S. <mo...@ol...> - 2004-10-22 17:35:49
|
On Fri, Oct 22, 2004 at 11:38:06AM -0400 Eduard Martinescu <mar...@ro...> wrote: > Can you also send me the output of the following: > > atacontrol info 0 > > (I think I remember you saying this was for /dev/ad0 , if it is ad2 or > ad3, then substitute a 1 for the 0 above) > > That command should report a short one line message for each drive on > channel 0, which should include the serial #. > here you are: # atacontrol info 0 Master: ad0 <SAMSUNG SP0802N/TK100-24> ATA/ATAPI revision 7 Slave: no device present I will also check with a Linux distro asap. C. |