From: dirk o. <di...@on...> - 2006-12-28 09:44:29
|
Hello list, i had an incident on my server (kernel panic, the green led on the seagate disk was permanent on). after removing and inserting the disks, the system came up again (i =A0have 2 disks in (software) raid 1). Since then smartd reports me following problem: "4294967295 Offline uncorrectable sectors" what strikes me is this number of sectors, which is 0xffffffff and which is more than the number of sectors on the disk. so i'm wondering whether i'm confronted here with a disk problem or a smartd problem. i ran the long selftest with smartctl and it "Completed without eror", but = i=20 still have the above error showing up . thanks for any guidance on this, dirk partial output of smartctl -a =3D=3D=3D START OF INFORMATION SECTION =3D=3D=3D Device Model: =A0 =A0 ST3200827AS Serial Number: =A0 =A05ND43GWE =46irmware Version: 3.AAH User Capacity: =A0 =A0200,049,647,616 bytes Device is: =A0 =A0 =A0 =A0Not in smartctl database [for details use: -P sho= wall] ATA Version is: =A0 7 ATA Standard is: =A0Exact ATA specification draft version not indicated Local Time is: =A0 =A0Sun Dec 24 14:25:41 2006 CET SMART support is: Available - device has SMART capability. SMART support is: Enabled =3D=3D=3D START OF READ SMART DATA SECTION =3D=3D=3D SMART overall-health self-assessment test result: PASSED General SMART Values: Offline data collection status: =A0(0x82) Offline data collection activity =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0= =A0 was completed without error. =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0= =A0 Auto Offline Data Collection: Enabled. Self-test execution status: =A0 =A0 =A0( =A0 0) The previous self-test rout= ine=20 completed =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0= =A0 without error or no self-test has ever =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0= =A0 been run. Total time to complete Offline data collection: =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 ( 430) seconds. Offline data collection capabilities: =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0(0x5b) SMART execute O= ffline immediate. =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0= =A0 Auto Offline data collection on/off=20 support. =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0= =A0 Suspend Offline collection upon new =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0= =A0 command. =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0= =A0 Offline surface scan supported. =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0= =A0 Self-test supported. =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0= =A0 No Conveyance Self-test supported. =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0= =A0 Selective Self-test supported. SMART capabilities: =A0 =A0 =A0 =A0 =A0 =A0(0x0003) Saves SMART data before= entering =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0= =A0 power-saving mode. =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0= =A0 Supports SMART auto save timer. Error logging capability: =A0 =A0 =A0 =A0(0x01) Error logging supported. =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0= =A0 General Purpose Logging supported. Short self-test routine recommended polling time: =A0 =A0 =A0 =A0( =A0 1) minutes. Extended self-test routine recommended polling time: =A0 =A0 =A0 =A0( =A070) minutes. SMART Attributes Data Structure revision number: 10 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME =A0 =A0 =A0 =A0 =A0FLAG =A0 =A0 VALUE WORST THRESH TYPE = =A0 =A0 =A0UPDATED =A0 WHEN_FAILED RAW_VALUE =A0 1 Raw_Read_Error_Rate =A0 =A0 0x000f =A0 095 =A0 077 =A0 006 =A0 =A0Pre= =2Dfail =A0Always =A0 =A0 =A0=20 =2D =A0 =A0 =A0 104670423 =A0 3 Spin_Up_Time =A0 =A0 =A0 =A0 =A0 =A00x0003 =A0 090 =A0 090 =A0 000 = =A0 =A0Pre-fail =A0Always =A0 =A0 =A0=20 =2D =A0 =A0 =A0 0 =A0 4 Start_Stop_Count =A0 =A0 =A0 =A00x0032 =A0 100 =A0 100 =A0 020 =A0 = =A0Old_age =A0 Always =A0 =A0 =A0=20 =2D =A0 =A0 =A0 43 =A0 5 Reallocated_Sector_Ct =A0 0x0033 =A0 098 =A0 098 =A0 036 =A0 =A0Pre-f= ail =A0Always =A0 =A0 =A0=20 =2D =A0 =A0 =A0 90 =A0 7 Seek_Error_Rate =A0 =A0 =A0 =A0 0x000f =A0 055 =A0 053 =A0 030 =A0 = =A0Pre-fail =A0Always =A0 =A0 =A0=20 =2D =A0 =A0 =A0 94495961156 =A0 9 Power_On_Hours =A0 =A0 =A0 =A0 =A00x0032 =A0 098 =A0 098 =A0 000 =A0 = =A0Old_age =A0 Always =A0 =A0 =A0=20 =2D =A0 =A0 =A0 2334 =A010 Spin_Retry_Count =A0 =A0 =A0 =A00x0013 =A0 100 =A0 100 =A0 097 =A0 = =A0Pre-fail =A0Always =A0 =A0 =A0=20 =2D =A0 =A0 =A0 0 =A012 Power_Cycle_Count =A0 =A0 =A0 0x0032 =A0 100 =A0 100 =A0 020 =A0 =A0O= ld_age =A0 Always =A0 =A0 =A0=20 =2D =A0 =A0 =A0 47 187 Unknown_Attribute =A0 =A0 =A0 0x0032 =A0 001 =A0 001 =A0 000 =A0 =A0Old= _age =A0 Always =A0 =A0 =A0=20 =2D =A0 =A0 =A0 9995 189 Unknown_Attribute =A0 =A0 =A0 0x003a =A0 100 =A0 100 =A0 000 =A0 =A0Old= _age =A0 Always =A0 =A0 =A0=20 =2D =A0 =A0 =A0 0 190 Unknown_Attribute =A0 =A0 =A0 0x0022 =A0 071 =A0 063 =A0 045 =A0 =A0Old= _age =A0 Always =A0 =A0 =A0=20 =2D =A0 =A0 =A0 7224707121181 194 Temperature_Celsius =A0 =A0 0x0022 =A0 029 =A0 040 =A0 000 =A0 =A0Old_a= ge =A0 Always =A0 =A0 =A0=20 =2D =A0 =A0 =A0 29 (Lifetime Min/Max 0/22) 195 Hardware_ECC_Recovered =A00x001a =A0 052 =A0 048 =A0 000 =A0 =A0Old_age= =A0 Always =A0 =A0 =A0=20 =2D =A0 =A0 =A0 194549275 197 Current_Pending_Sector =A00x0012 =A0 001 =A0 001 =A0 000 =A0 =A0Old_age= =A0 Always =A0 =A0 =A0=20 =2D =A0 =A0 =A0 4294967295 198 Offline_Uncorrectable =A0 0x0010 =A0 001 =A0 001 =A0 000 =A0 =A0Old_age= =A0 Offline =A0 =A0 =A0 =2D =A0 =A0 =A0 4294967295 199 UDMA_CRC_Error_Count =A0 =A00x003e =A0 200 =A0 200 =A0 000 =A0 =A0Old_a= ge =A0 Always =A0 =A0 =A0=20 =2D =A0 =A0 =A0 0 200 Multi_Zone_Error_Rate =A0 0x0000 =A0 100 =A0 253 =A0 000 =A0 =A0Old_age= =A0 Offline =A0 =A0 =A0 =2D =A0 =A0 =A0 0 202 TA_Increase_Count =A0 =A0 =A0 0x0032 =A0 100 =A0 253 =A0 000 =A0 =A0Old= _age =A0 Always =A0 =A0 =A0=20 =2D =A0 =A0 =A0 0 |
From: dirk o. <di...@on...> - 2007-01-09 22:19:19
|
In the mean while i learned a little bit more about smartmon, but i am stil= l=20 struggling with some basic questions regarding the below issue.=20 It is now clear to me that the "4294967295" is the raw value and that the=20 effective number of bad sectors is 1?=20 How can i know whether this sector is still bad or whether it is reallocate= d=20 (i see that Reallocated_Sector_Ct is 98, does this mean that 100-98=3D2 sec= tors=20 have been reallocated)? Why is smartd reporting every day that there are 4294967295 bad sectors, wh= ile=20 the selftest passes without any indication of a bad LBA? I hope someone can put me on the right track to understand this issue bette= r. cheers, dirk On Thursday 28 December 2006 10:44, dirk ooms wrote: > Hello list, > > i had an incident on my server (kernel panic, the green led on > the seagate disk was permanent on). after removing and inserting the disk= s, > the system came up again (i =A0have 2 disks in (software) raid 1). > > Since then smartd reports me following problem: > "4294967295 Offline uncorrectable sectors" > > what strikes me is this number of sectors, which is 0xffffffff and which = is > more than the number of sectors on the disk. so i'm wondering whether i'm > confronted here with a disk problem or a smartd problem. > > i ran the long selftest with smartctl and it "Completed without eror", but > i still have the above error showing up . > > thanks for any guidance on this, > dirk > > partial output of smartctl -a > > =3D=3D=3D START OF INFORMATION SECTION =3D=3D=3D > Device Model: =A0 =A0 ST3200827AS > Serial Number: =A0 =A05ND43GWE > Firmware Version: 3.AAH > User Capacity: =A0 =A0200,049,647,616 bytes > Device is: =A0 =A0 =A0 =A0Not in smartctl database [for details use: -P s= howall] > ATA Version is: =A0 7 > ATA Standard is: =A0Exact ATA specification draft version not indicated > Local Time is: =A0 =A0Sun Dec 24 14:25:41 2006 CET > SMART support is: Available - device has SMART capability. > SMART support is: Enabled > > =3D=3D=3D START OF READ SMART DATA SECTION =3D=3D=3D > SMART overall-health self-assessment test result: PASSED > > General SMART Values: > Offline data collection status: =A0(0x82) Offline data collection activity > =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 = =A0 =A0 was completed without error. > =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 = =A0 =A0 Auto Offline Data Collection: > Enabled. Self-test execution status: =A0 =A0 =A0( =A0 0) The previous sel= f-test > routine completed > =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 = =A0 =A0 without error or no self-test has > ever been run. > Total time to complete Offline > data collection: =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 ( 430) seconds. > Offline data collection > capabilities: =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0(0x5b) SMART execute= Offline immediate. > =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 = =A0 =A0 Auto Offline data collection on/off > support. > =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 = =A0 =A0 Suspend Offline collection upon new > =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 = =A0 =A0 command. > =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 = =A0 =A0 Offline surface scan supported. > =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 = =A0 =A0 Self-test supported. > =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 = =A0 =A0 No Conveyance Self-test supported. > =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 = =A0 =A0 Selective Self-test supported. > SMART capabilities: =A0 =A0 =A0 =A0 =A0 =A0(0x0003) Saves SMART data befo= re entering > =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 = =A0 =A0 power-saving mode. > =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 = =A0 =A0 Supports SMART auto save timer. > Error logging capability: =A0 =A0 =A0 =A0(0x01) Error logging supported. > =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 = =A0 =A0 General Purpose Logging supported. > Short self-test routine > recommended polling time: =A0 =A0 =A0 =A0( =A0 1) minutes. > Extended self-test routine > recommended polling time: =A0 =A0 =A0 =A0( =A070) minutes. > > SMART Attributes Data Structure revision number: 10 > Vendor Specific SMART Attributes with Thresholds: > ID# ATTRIBUTE_NAME =A0 =A0 =A0 =A0 =A0FLAG =A0 =A0 VALUE WORST THRESH TYP= E =A0 =A0 =A0UPDATED =A0 > WHEN_FAILED RAW_VALUE > =A0 1 Raw_Read_Error_Rate =A0 =A0 0x000f =A0 095 =A0 077 =A0 006 =A0 =A0P= re-fail =A0Always =A0 > =A0 =A0 - =A0 =A0 =A0 104670423 > =A0 3 Spin_Up_Time =A0 =A0 =A0 =A0 =A0 =A00x0003 =A0 090 =A0 090 =A0 000 = =A0 =A0Pre-fail =A0Always =A0 > =A0 =A0 - =A0 =A0 =A0 0 > =A0 4 Start_Stop_Count =A0 =A0 =A0 =A00x0032 =A0 100 =A0 100 =A0 020 =A0 = =A0Old_age =A0 Always =A0 > =A0 =A0 - =A0 =A0 =A0 43 > =A0 5 Reallocated_Sector_Ct =A0 0x0033 =A0 098 =A0 098 =A0 036 =A0 =A0Pre= =2Dfail =A0Always =A0 > =A0 =A0 - =A0 =A0 =A0 90 > =A0 7 Seek_Error_Rate =A0 =A0 =A0 =A0 0x000f =A0 055 =A0 053 =A0 030 =A0 = =A0Pre-fail =A0Always =A0 > =A0 =A0 - =A0 =A0 =A0 94495961156 > =A0 9 Power_On_Hours =A0 =A0 =A0 =A0 =A00x0032 =A0 098 =A0 098 =A0 000 = =A0 =A0Old_age =A0 Always =A0 > =A0 =A0 - =A0 =A0 =A0 2334 > =A010 Spin_Retry_Count =A0 =A0 =A0 =A00x0013 =A0 100 =A0 100 =A0 097 =A0 = =A0Pre-fail =A0Always =A0 > =A0 =A0 - =A0 =A0 =A0 0 > =A012 Power_Cycle_Count =A0 =A0 =A0 0x0032 =A0 100 =A0 100 =A0 020 =A0 = =A0Old_age =A0 Always =A0 > =A0 =A0 - =A0 =A0 =A0 47 > 187 Unknown_Attribute =A0 =A0 =A0 0x0032 =A0 001 =A0 001 =A0 000 =A0 =A0O= ld_age =A0 Always =A0 > =A0 =A0 - =A0 =A0 =A0 9995 > 189 Unknown_Attribute =A0 =A0 =A0 0x003a =A0 100 =A0 100 =A0 000 =A0 =A0O= ld_age =A0 Always =A0 > =A0 =A0 - =A0 =A0 =A0 0 > 190 Unknown_Attribute =A0 =A0 =A0 0x0022 =A0 071 =A0 063 =A0 045 =A0 =A0O= ld_age =A0 Always =A0 > =A0 =A0 - =A0 =A0 =A0 7224707121181 > 194 Temperature_Celsius =A0 =A0 0x0022 =A0 029 =A0 040 =A0 000 =A0 =A0Old= _age =A0 Always =A0 > =A0 =A0 - =A0 =A0 =A0 29 (Lifetime Min/Max 0/22) > 195 Hardware_ECC_Recovered =A00x001a =A0 052 =A0 048 =A0 000 =A0 =A0Old_a= ge =A0 Always =A0 > =A0 =A0 - =A0 =A0 =A0 194549275 > 197 Current_Pending_Sector =A00x0012 =A0 001 =A0 001 =A0 000 =A0 =A0Old_a= ge =A0 Always =A0 > =A0 =A0 - =A0 =A0 =A0 4294967295 > 198 Offline_Uncorrectable =A0 0x0010 =A0 001 =A0 001 =A0 000 =A0 =A0Old_a= ge =A0 Offline =A0 > =A0 =A0 - =A0 =A0 =A0 4294967295 > 199 UDMA_CRC_Error_Count =A0 =A00x003e =A0 200 =A0 200 =A0 000 =A0 =A0Old= _age =A0 Always =A0 > =A0 =A0 - =A0 =A0 =A0 0 > 200 Multi_Zone_Error_Rate =A0 0x0000 =A0 100 =A0 253 =A0 000 =A0 =A0Old_a= ge =A0 Offline =A0 > =A0 =A0 - =A0 =A0 =A0 0 > 202 TA_Increase_Count =A0 =A0 =A0 0x0032 =A0 100 =A0 253 =A0 000 =A0 =A0O= ld_age =A0 Always =A0 > =A0 =A0 - =A0 =A0 =A0 0 > > ------------------------------------------------------------------------- > Take Surveys. Earn Cash. Influence the Future of IT > Join SourceForge.net's Techsay panel and you'll get the chance to share > your opinions on IT & business topics through brief surveys - and earn ca= sh > http://www.techsay.com/default.php?page=3Djoin.php&p=3Dsourceforge&CID=3D= DEVDEV > _______________________________________________ > Smartmontools-support mailing list > Sma...@li... > https://lists.sourceforge.net/lists/listinfo/smartmontools-support |
From: Manfred S. <man...@gm...> - 2007-01-09 23:08:02
|
> > It is now clear to me that the "4294967295" is the raw value and that the > effective number of bad sectors is 1? No, that's not clear at all, not only the raw values, also the normalized values of Current_Pending_Sector and Offline_Uncorrectable are odd. This looks very fishy, maybe a firmware flaw. Perhaps you can ask Seagate support for clarification and report back? > How can i know whether this sector is still bad or whether it is > reallocated > (i see that Reallocated_Sector_Ct is 98, does this mean that 100-98=2 > sectors > have been reallocated)? No, the "98" is the normalized number, which is quite meaningless. You have to look at the raw value, which is "90". 90 reallocated sectors are quite a lot. It may be that Seagate's disks are much more "keen" on reallocating sectors than other brands, nevertheless it's really a lot. Personally I thow disks away with more than a few handful's of bad sectors (i.e. <10-20 or so), or try RMA'ing the disk. If other items look fishy as well, my disks even fly much earlier. Disks are cheap, live is short. regards, Manfred > Why is smartd reporting every day that there are 4294967295 bad sectors, > while > the selftest passes without any indication of a bad LBA? > > I hope someone can put me on the right track to understand this issue > better. > cheers, > dirk > > On Thursday 28 December 2006 10:44, dirk ooms wrote: > > Hello list, > > > > i had an incident on my server (kernel panic, the green led on > > the seagate disk was permanent on). after removing and inserting the > disks, > > the system came up again (i have 2 disks in (software) raid 1). > > > > Since then smartd reports me following problem: > > "4294967295 Offline uncorrectable sectors" > > > > what strikes me is this number of sectors, which is 0xffffffff and which > is > > more than the number of sectors on the disk. so i'm wondering whether > i'm > > confronted here with a disk problem or a smartd problem. > > > > i ran the long selftest with smartctl and it "Completed without eror", > but > > i still have the above error showing up . > > > > thanks for any guidance on this, > > dirk > > > > partial output of smartctl -a > > > > === START OF INFORMATION SECTION === > > Device Model: ST3200827AS > > Serial Number: 5ND43GWE > > Firmware Version: 3.AAH > > User Capacity: 200,049,647,616 bytes > > Device is: Not in smartctl database [for details use: -P > showall] > > ATA Version is: 7 > > ATA Standard is: Exact ATA specification draft version not indicated > > Local Time is: Sun Dec 24 14:25:41 2006 CET > > SMART support is: Available - device has SMART capability. > > SMART support is: Enabled > > > > === START OF READ SMART DATA SECTION === > > SMART overall-health self-assessment test result: PASSED > > > > General SMART Values: > > Offline data collection status: (0x82) Offline data collection > activity > > was > completed without error. > > Auto Offline > Data Collection: > > Enabled. Self-test execution status: ( 0) The previous > self-test > > routine completed > > without > error or no self-test has > > ever been run. > > Total time to complete Offline > > data collection: ( 430) seconds. > > Offline data collection > > capabilities: (0x5b) SMART execute Offline > immediate. > > Auto Offline > data collection on/off > > support. > > Suspend > Offline collection upon new > > command. > > Offline > surface scan supported. > > Self-test > supported. > > No > Conveyance Self-test supported. > > Selective > Self-test supported. > > SMART capabilities: (0x0003) Saves SMART data before > entering > > power-saving > mode. > > Supports > SMART auto save timer. > > Error logging capability: (0x01) Error logging supported. > > General > Purpose Logging supported. > > Short self-test routine > > recommended polling time: ( 1) minutes. > > Extended self-test routine > > recommended polling time: ( 70) minutes. > > > > SMART Attributes Data Structure revision number: 10 > > Vendor Specific SMART Attributes with Thresholds: > > ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE > UPDATED > > WHEN_FAILED RAW_VALUE > > 1 Raw_Read_Error_Rate 0x000f 095 077 006 Pre-fail > Always > > - 104670423 > > 3 Spin_Up_Time 0x0003 090 090 000 > Pre-fail Always > > - 0 > > 4 Start_Stop_Count 0x0032 100 100 020 > Old_age Always > > - 43 > > 5 Reallocated_Sector_Ct 0x0033 098 098 036 Pre-fail > Always > > - 90 > > 7 Seek_Error_Rate 0x000f 055 053 030 > Pre-fail Always > > - 94495961156 > > 9 Power_On_Hours 0x0032 098 098 000 > Old_age Always > > - 2334 > > 10 Spin_Retry_Count 0x0013 100 100 097 > Pre-fail Always > > - 0 > > 12 Power_Cycle_Count 0x0032 100 100 020 Old_age > Always > > - 47 > > 187 Unknown_Attribute 0x0032 001 001 000 Old_age > Always > > - 9995 > > 189 Unknown_Attribute 0x003a 100 100 000 Old_age > Always > > - 0 > > 190 Unknown_Attribute 0x0022 071 063 045 Old_age > Always > > - 7224707121181 > > 194 Temperature_Celsius 0x0022 029 040 000 Old_age > Always > > - 29 (Lifetime Min/Max 0/22) > > 195 Hardware_ECC_Recovered 0x001a 052 048 000 Old_age > Always > > - 194549275 > > 197 Current_Pending_Sector 0x0012 001 001 000 Old_age > Always > > - 4294967295 > > 198 Offline_Uncorrectable 0x0010 001 001 000 Old_age > Offline > > - 4294967295 > > 199 UDMA_CRC_Error_Count 0x003e 200 200 000 Old_age > Always > > - 0 > > 200 Multi_Zone_Error_Rate 0x0000 100 253 000 Old_age > Offline > > - 0 > > 202 TA_Increase_Count 0x0032 100 253 000 Old_age > Always > > - 0 > > > > > ------------------------------------------------------------------------- > > Take Surveys. Earn Cash. Influence the Future of IT > > Join SourceForge.net's Techsay panel and you'll get the chance to share > > your opinions on IT & business topics through brief surveys - and earn > cash > > > http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV > > _______________________________________________ > > Smartmontools-support mailing list > > Sma...@li... > > https://lists.sourceforge.net/lists/listinfo/smartmontools-support > > ------------------------------------------------------------------------- > Take Surveys. Earn Cash. Influence the Future of IT > Join SourceForge.net's Techsay panel and you'll get the chance to share > your > opinions on IT & business topics through brief surveys - and earn cash > http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV > _______________________________________________ > Smartmontools-support mailing list > Sma...@li... > https://lists.sourceforge.net/lists/listinfo/smartmontools-support -- Der GMX SmartSurfer hilft bis zu 70% Ihrer Onlinekosten zu sparen! Ideal für Modem und ISDN: http://www.gmx.net/de/go/smartsurfer |
From: dirk o. <di...@on...> - 2007-03-11 13:26:25
|
just a quick update on this issue: i contacted Seagate and they advised me = to=20 use the seagate diagnostic software to check my disks. according to the=20 seagate tool the disks are fine. smartd is still telling me every night tha= t=20 there are "4294967295 Offline uncorrectable sectors". On Wednesday 10 January 2007 00:07, Manfred Schwarb wrote: > > It is now clear to me that the "4294967295" is the raw value and that t= he > > effective number of bad sectors is 1? > > No, that's not clear at all, not only the raw values, also the normalized > values of Current_Pending_Sector and Offline_Uncorrectable are odd. > This looks very fishy, maybe a firmware flaw. > Perhaps you can ask Seagate support for clarification and report back? > > > How can i know whether this sector is still bad or whether it is > > reallocated > > (i see that Reallocated_Sector_Ct is 98, does this mean that 100-98=3D2 > > sectors > > have been reallocated)? > > No, the "98" is the normalized number, which is quite meaningless. > You have to look at the raw value, which is "90". 90 reallocated > sectors are quite a lot. It may be that Seagate's disks are much more > "keen" on reallocating sectors than other brands, nevertheless it's > really a lot. Personally I thow disks away with more than a few > handful's of bad sectors (i.e. <10-20 or so), or try RMA'ing the disk. > If other items look fishy as well, my disks even fly much earlier. > Disks are cheap, live is short. > > > regards, > Manfred > > > Why is smartd reporting every day that there are 4294967295 bad sectors, > > while > > the selftest passes without any indication of a bad LBA? > > > > I hope someone can put me on the right track to understand this issue > > better. > > cheers, > > dirk > > > > On Thursday 28 December 2006 10:44, dirk ooms wrote: > > > Hello list, > > > > > > i had an incident on my server (kernel panic, the green led on > > > the seagate disk was permanent on). after removing and inserting the > > > > disks, > > > > > the system came up again (i =A0have 2 disks in (software) raid 1). > > > > > > Since then smartd reports me following problem: > > > "4294967295 Offline uncorrectable sectors" > > > > > > what strikes me is this number of sectors, which is 0xffffffff and > > > which > > > > is > > > > > more than the number of sectors on the disk. so i'm wondering whether > > > > i'm > > > > > confronted here with a disk problem or a smartd problem. > > > > > > i ran the long selftest with smartctl and it "Completed without eror", > > > > but > > > > > i still have the above error showing up . > > > > > > thanks for any guidance on this, > > > dirk > > > > > > partial output of smartctl -a > > > > > > =3D=3D=3D START OF INFORMATION SECTION =3D=3D=3D > > > Device Model: =A0 =A0 ST3200827AS > > > Serial Number: =A0 =A05ND43GWE > > > Firmware Version: 3.AAH > > > User Capacity: =A0 =A0200,049,647,616 bytes > > > Device is: =A0 =A0 =A0 =A0Not in smartctl database [for details use: = =2DP > > > > showall] > > > > > ATA Version is: =A0 7 > > > ATA Standard is: =A0Exact ATA specification draft version not indicat= ed > > > Local Time is: =A0 =A0Sun Dec 24 14:25:41 2006 CET > > > SMART support is: Available - device has SMART capability. > > > SMART support is: Enabled > > > > > > =3D=3D=3D START OF READ SMART DATA SECTION =3D=3D=3D > > > SMART overall-health self-assessment test result: PASSED > > > > > > General SMART Values: > > > Offline data collection status: =A0(0x82) Offline data collection > > > > activity > > > > > =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 = =A0 =A0 =A0 was > > > > completed without error. > > > > > =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 = =A0 =A0 =A0 Auto Offline > > > > Data Collection: > > > Enabled. Self-test execution status: =A0 =A0 =A0( =A0 0) The previous > > > > self-test > > > > > routine completed > > > =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 = =A0 =A0 =A0 without > > > > error or no self-test has > > > > > ever been run. > > > Total time to complete Offline > > > data collection: =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 ( 430) seconds. > > > Offline data collection > > > capabilities: =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0(0x5b) SMART exe= cute Offline > > > > immediate. > > > > > =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 = =A0 =A0 =A0 Auto Offline > > > > data collection on/off > > > > > support. > > > =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 = =A0 =A0 =A0 Suspend > > > > Offline collection upon new > > > > > =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 = =A0 =A0 =A0 command. > > > =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 = =A0 =A0 =A0 Offline > > > > surface scan supported. > > > > > =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 = =A0 =A0 =A0 Self-test > > > > supported. > > > > > =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 = =A0 =A0 =A0 No > > > > Conveyance Self-test supported. > > > > > =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 = =A0 =A0 =A0 Selective > > > > Self-test supported. > > > > > SMART capabilities: =A0 =A0 =A0 =A0 =A0 =A0(0x0003) Saves SMART data = before > > > > entering > > > > > =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 = =A0 =A0 =A0 power-saving > > > > mode. > > > > > =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 = =A0 =A0 =A0 Supports > > > > SMART auto save timer. > > > > > Error logging capability: =A0 =A0 =A0 =A0(0x01) Error logging support= ed. > > > =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 = =A0 =A0 =A0 General > > > > Purpose Logging supported. > > > > > Short self-test routine > > > recommended polling time: =A0 =A0 =A0 =A0( =A0 1) minutes. > > > Extended self-test routine > > > recommended polling time: =A0 =A0 =A0 =A0( =A070) minutes. > > > > > > SMART Attributes Data Structure revision number: 10 > > > Vendor Specific SMART Attributes with Thresholds: > > > ID# ATTRIBUTE_NAME =A0 =A0 =A0 =A0 =A0FLAG =A0 =A0 VALUE WORST THRESH= TYPE =A0 > > > > =A0 =A0UPDATED =A0 > > > > > WHEN_FAILED RAW_VALUE > > > =A0 1 Raw_Read_Error_Rate =A0 =A0 0x000f =A0 095 =A0 077 =A0 006 =A0 = =A0Pre-fail > > > > =A0Always =A0 > > > > > =A0 =A0 - =A0 =A0 =A0 104670423 > > > =A0 3 Spin_Up_Time =A0 =A0 =A0 =A0 =A0 =A00x0003 =A0 090 =A0 090 =A0 = 000 =A0 > > > > =A0Pre-fail =A0Always =A0 > > > > > =A0 =A0 - =A0 =A0 =A0 0 > > > =A0 4 Start_Stop_Count =A0 =A0 =A0 =A00x0032 =A0 100 =A0 100 =A0 020 = =A0 > > > > =A0Old_age =A0 Always =A0 > > > > > =A0 =A0 - =A0 =A0 =A0 43 > > > =A0 5 Reallocated_Sector_Ct =A0 0x0033 =A0 098 =A0 098 =A0 036 =A0 = =A0Pre-fail > > > > =A0Always =A0 > > > > > =A0 =A0 - =A0 =A0 =A0 90 > > > =A0 7 Seek_Error_Rate =A0 =A0 =A0 =A0 0x000f =A0 055 =A0 053 =A0 030 = =A0 > > > > =A0Pre-fail =A0Always =A0 > > > > > =A0 =A0 - =A0 =A0 =A0 94495961156 > > > =A0 9 Power_On_Hours =A0 =A0 =A0 =A0 =A00x0032 =A0 098 =A0 098 =A0 00= 0 =A0 > > > > =A0Old_age =A0 Always =A0 > > > > > =A0 =A0 - =A0 =A0 =A0 2334 > > > =A010 Spin_Retry_Count =A0 =A0 =A0 =A00x0013 =A0 100 =A0 100 =A0 097 = =A0 > > > > =A0Pre-fail =A0Always =A0 > > > > > =A0 =A0 - =A0 =A0 =A0 0 > > > =A012 Power_Cycle_Count =A0 =A0 =A0 0x0032 =A0 100 =A0 100 =A0 020 = =A0 =A0Old_age > > > > =A0 Always =A0 > > > > > =A0 =A0 - =A0 =A0 =A0 47 > > > 187 Unknown_Attribute =A0 =A0 =A0 0x0032 =A0 001 =A0 001 =A0 000 =A0 = =A0Old_age > > > > =A0 Always =A0 > > > > > =A0 =A0 - =A0 =A0 =A0 9995 > > > 189 Unknown_Attribute =A0 =A0 =A0 0x003a =A0 100 =A0 100 =A0 000 =A0 = =A0Old_age > > > > =A0 Always =A0 > > > > > =A0 =A0 - =A0 =A0 =A0 0 > > > 190 Unknown_Attribute =A0 =A0 =A0 0x0022 =A0 071 =A0 063 =A0 045 =A0 = =A0Old_age > > > > =A0 Always =A0 > > > > > =A0 =A0 - =A0 =A0 =A0 7224707121181 > > > 194 Temperature_Celsius =A0 =A0 0x0022 =A0 029 =A0 040 =A0 000 =A0 = =A0Old_age > > > > =A0 Always =A0 > > > > > =A0 =A0 - =A0 =A0 =A0 29 (Lifetime Min/Max 0/22) > > > 195 Hardware_ECC_Recovered =A00x001a =A0 052 =A0 048 =A0 000 =A0 =A0O= ld_age =A0 > > > > Always =A0 > > > > > =A0 =A0 - =A0 =A0 =A0 194549275 > > > 197 Current_Pending_Sector =A00x0012 =A0 001 =A0 001 =A0 000 =A0 =A0O= ld_age =A0 > > > > Always =A0 > > > > > =A0 =A0 - =A0 =A0 =A0 4294967295 > > > 198 Offline_Uncorrectable =A0 0x0010 =A0 001 =A0 001 =A0 000 =A0 =A0O= ld_age =A0 > > > > Offline =A0 > > > > > =A0 =A0 - =A0 =A0 =A0 4294967295 > > > 199 UDMA_CRC_Error_Count =A0 =A00x003e =A0 200 =A0 200 =A0 000 =A0 = =A0Old_age > > > > =A0 Always =A0 > > > > > =A0 =A0 - =A0 =A0 =A0 0 > > > 200 Multi_Zone_Error_Rate =A0 0x0000 =A0 100 =A0 253 =A0 000 =A0 =A0O= ld_age =A0 > > > > Offline =A0 > > > > > =A0 =A0 - =A0 =A0 =A0 0 > > > 202 TA_Increase_Count =A0 =A0 =A0 0x0032 =A0 100 =A0 253 =A0 000 =A0 = =A0Old_age > > > > =A0 Always =A0 > > > > > =A0 =A0 - =A0 =A0 =A0 0 > > > > -----------------------------------------------------------------------= =2D- > > > > > Take Surveys. Earn Cash. Influence the Future of IT > > > Join SourceForge.net's Techsay panel and you'll get the chance to sha= re > > > your opinions on IT & business topics through brief surveys - and earn > > > > cash > > > > http://www.techsay.com/default.php?page=3Djoin.php&p=3Dsourceforge&CID= =3DDEVDEV > > > > > _______________________________________________ > > > Smartmontools-support mailing list > > > Sma...@li... > > > https://lists.sourceforge.net/lists/listinfo/smartmontools-support > > > > -----------------------------------------------------------------------= =2D- > > Take Surveys. Earn Cash. Influence the Future of IT > > Join SourceForge.net's Techsay panel and you'll get the chance to share > > your > > opinions on IT & business topics through brief surveys - and earn cash > > http://www.techsay.com/default.php?page=3Djoin.php&p=3Dsourceforge&CID= =3DDEVDEV > > _______________________________________________ > > Smartmontools-support mailing list > > Sma...@li... > > https://lists.sourceforge.net/lists/listinfo/smartmontools-support |
From: Christian F. <Chr...@t-...> - 2007-03-12 18:49:11
|
dirk ooms wrote: > just a quick update on this issue: i contacted Seagate and they advised me to > use the seagate diagnostic software to check my disks. according to the > seagate tool the disks are fine. smartd is still telling me every night that > there are "4294967295 Offline uncorrectable sectors". > > ... which is 0xffffffff or -1. Looks like an underflow due to a firmware bug. > ... > 5 Reallocated_Sector_Ct 0x0033 098 098 036 Pre-fail Always - 90 > Reallocated sector count is 90, so I would consider to replace this disk. According to the recently published Google study, the probability to fail within 60 days increases significantly after the *first* reallocated sector appears. Christian |
From: Manfred S. <man...@gm...> - 2007-03-14 08:57:07
|
> dirk ooms wrote: > > just a quick update on this issue: i contacted Seagate and they advised > me to > > use the seagate diagnostic software to check my disks. according to the > > seagate tool the disks are fine. smartd is still telling me every night > that > > there are "4294967295 Offline uncorrectable sectors". > > > > > > ... which is 0xffffffff or -1. Looks like an underflow due to a firmware > bug. > > > > ... > > 5 Reallocated_Sector_Ct 0x0033 098 098 036 Pre-fail Always > - 90 > > > > Reallocated sector count is 90, so I would consider to replace this disk. > According to the recently published Google study, the probability to > fail within 60 days increases significantly after the *first* > reallocated sector appears. > > Christian > Seconded. But Dirk may nevertheless try to get replacement for this disk, if still within warranty. 90 failed sectors is simply beyond a healthy state. I think there are reports on this list where people mangaged to get replacement although Seatools told everything was fine. Just send in the "smartctl -a" output and try your luck. Manfred > > ------------------------------------------------------------------------- > Take Surveys. Earn Cash. Influence the Future of IT > Join SourceForge.net's Techsay panel and you'll get the chance to share > your > opinions on IT & business topics through brief surveys-and earn cash > http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV > _______________________________________________ > Smartmontools-support mailing list > Sma...@li... > https://lists.sourceforge.net/lists/listinfo/smartmontools-support -- "Feel free" - 10 GB Mailbox, 100 FreeSMS/Monat ... Jetzt GMX TopMail testen: www.gmx.net/de/go/mailfooter/topmail-out |