From: Ralf P. <pa...@ki...> - 2003-02-20 08:50:03
|
Hi all! what does this output mean (output from ./smartctl -c /dev/hdd ) -> === START OF READ SMART DATA SECTION === SMART Error Log Version: 1 ATA Error Count: 2 ... Error 2 occurred at disk power-on lifetime: 2 hours When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER:10 SC:00 SN:4c CL:18 CH:f2 D/H:b2 ST:51 Sequence of commands leading to the command that caused the error were: DCR FR SC SN CL CH D/H CR Timestamp 02 40 00 00 00 f2 f2 82 7302.400 02 40 00 00 c0 f1 f2 82 7302.000 02 40 00 00 80 f1 f2 82 7301.700 02 40 00 00 40 f1 f2 82 7301.300 02 40 00 00 00 f1 f2 82 7300.900 Error 1 occurred at disk power-on lifetime: 1 hours ... What is wrong with my hard disk? The healtstatus-command of smartctl (./smartctl -H /dev/hdd) says ... === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED And all S.M.A.R.T Attributes are ok. Thanks ! Ralf |
From: Bruce A. <ba...@gr...> - 2003-02-21 04:39:17
|
Hi Ralf, On Thu, 20 Feb 2003, Ralf Panse wrote: > Hi all! > what does this output mean (output from ./smartctl -c /dev/hdd > ) -> > > === START OF READ SMART DATA SECTION === > SMART Error Log Version: 1 > ATA Error Count: 2 > ... > Error 2 occurred at disk power-on lifetime: 2 hours > When the command that caused the error occurred, the device was active or > idle. > After command completion occurred, registers were: > ER:10 SC:00 SN:4c CL:18 CH:f2 D/H:b2 ST:51 > Sequence of commands leading to the command that caused the error were: > DCR FR SC SN CL CH D/H CR Timestamp > 02 40 00 00 00 f2 f2 82 7302.400 > 02 40 00 00 c0 f1 f2 82 7302.000 > 02 40 00 00 80 f1 f2 82 7301.700 > 02 40 00 00 40 f1 f2 82 7301.300 > 02 40 00 00 00 f1 f2 82 7300.900 > > Error 1 occurred at disk power-on lifetime: 1 hours > ... > This is the ATA error log. The types of errors that it indicates are described in this document: http://www.t13.org/project/d1321r1c.pdf please see section 8.41.6.8.2.4 (Device Error Count) which starts on page 204. The output listed are the five commands leading up to the command that caused the error. The different columns refer to different ATA registers. > What is wrong with my hard disk? Probably nothing. The last of there errors occured when the disk was just a few hours old (7300 seconds after it was first turned on). This may have been due to strange or incorrect hdparm (DMA mode, etc) settings, a loose cable, or something else wrong with the disk. Assuming that the disk is more than a few hours old, it's not been exhibiting the errors recently. If you want some more reassurance, post the output of smartctl -a, please. You might also want to run some extended self-tests and examine the self-test log. You can do both of these things with smartctl. > The healtstatus-command of smartctl (./smartctl -H /dev/hdd) says ... > > === START OF READ SMART DATA SECTION === > SMART overall-health self-assessment test result: PASSED > > And all S.M.A.R.T Attributes are ok. > > > Thanks ! You're welcome! Bruce |
From: Ralf P. <rp...@ki...> - 2003-02-21 15:23:52
|
Hi Bruce ! Here thr output from smartctl -a /dev/hdd: smartctl version 5.1-4 Copyright (C) 2002 Bruce Allen Home page is http://smartmontools.sourceforge.net/ =3D=3D=3D START OF INFORMATION SECTION =3D=3D=3D Device Model: IBM-DTLA-307045 Serial Number: YMDYMHA0675 Firmware Version: TX6OA50C ATA Version is: 5 ATA Standard is: ATA/ATAPI-5 T13 1321D revision 1 Local Time is: Fri Feb 21 16:09:14 2003 CET SMART support is: Available - device has SMART capability. SMART support is: Enabled =3D=3D=3D START OF READ SMART DATA SECTION =3D=3D=3D SMART overall-health self-assessment test result: PASSED General SMART Values: Off-line data collection status: (0x02) Offline data collection activity completed without error. Self-test execution status: ( 0) The previous self-test routine=20 completed without error or no self-test has= ever been run. Total time to complete off-line data collection: (2294) seconds. Offline data collection capabilities: (0x1b) SMART execute Offline immediate. Automatic timer ON/OFF support. Suspend Offline collection upon n= ew command. Offline surface scan supported. Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. Short self-test routine recommended polling time: ( 2) minutes. Extended self-test routine recommended polling time: ( 28) minutes. SMART Attributes Data Structure revision number: 16 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE WHEN_FAI= LED=20 RAW_VALUE 1 Raw_Read_Error_Rate 0x000b 100 100 060 Pre-fail - = 1 2 Throughput_Performance 0x0005 132 132 050 Pre-fail - = =20 340 3 Spin_Up_Time 0x0007 094 094 024 Pre-fail - = =20 25789530422 4 Start_Stop_Count 0x0012 100 100 000 Old_age - = =20 52 5 Reallocated_Sector_Ct 0x0033 100 100 005 Pre-fail - = 0 7 Seek_Error_Rate 0x000b 100 100 067 Pre-fail - = 0 8 Seek_Time_Performance 0x0005 130 130 020 Pre-fail - = =20 34 9 Power_On_Hours 0x0012 100 100 000 Old_age - = =20 119 10 Spin_Retry_Count 0x0013 100 100 060 Pre-fail - = 0 12 Power_Cycle_Count 0x0032 100 100 000 Old_age - = =20 52 192 Power-Off_Retract_Count 0x0032 100 100 050 Old_age - = =20 52 193 Load_Cycle_Count 0x0012 100 100 050 Old_age - = =20 52 194 Temperature_Celsius 0x0002 171 171 000 Old_age - = =20 32 196 Reallocated_Event_Count 0x0032 100 100 000 Old_age - = 0 197 Current_Pending_Sector 0x0022 100 100 000 Old_age - = 0 198 Offline_Uncorrectable 0x0008 100 100 000 Old_age - = 0 199 UDMA_CRC_Error_Count 0x000a 200 200 000 Old_age - = 0 SMART Error Log Version: 1 ATA Error Count: 2 DCR =3D Device Control Register FR =3D Features Register SC =3D Sector Count Register SN =3D Sector Number Register CL =3D Cylinder Low Register CH =3D Cylinder High Register D/H =3D Device/Head Register CR =3D Content written to Command Register ER =3D Error register STA =3D Status register Timestamp is seconds since the previous disk power-on. Note: timestamp "wraps" after 2^32 msec =3D 49.710 days. Error 2 occurred at disk power-on lifetime: 2 hours When the command that caused the error occurred, the device was active or= =20 idle. After command completion occurred, registers were: ER:10 SC:00 SN:4c CL:18 CH:f2 D/H:b2 ST:51 Sequence of commands leading to the command that caused the error were: DCR FR SC SN CL CH D/H CR Timestamp 02 40 00 00 00 f2 f2 82 7302.400 02 40 00 00 c0 f1 f2 82 7302.000 02 40 00 00 80 f1 f2 82 7301.700 02 40 00 00 40 f1 f2 82 7301.300 02 40 00 00 00 f1 f2 82 7300.900 Error 1 occurred at disk power-on lifetime: 1 hours When the command that caused the error occurred, the device was active or= =20 idle. After command completion occurred, registers were: ER:10 SC:00 SN:00 CL:00 CH:00 D/H:b0 ST:51 Sequence of commands leading to the command that caused the error were: DCR FR SC SN CL CH D/H CR Timestamp 02 11 00 00 00 00 b0 f7 5132.900 02 00 00 00 00 00 b0 f3 5132.900 02 00 00 00 00 00 b0 ec 5132.900 02 00 40 00 c3 00 f0 40 5132.800 02 00 40 c0 c2 00 f0 40 5132.800 SMART Self-test log, version number 1 Num Test_Description Status Remaining LifeTime(hour= s) =20 LBA_of_first_error # 1 Short off-line Completed 00% 117 = - # 2 Short off-line Completed 00% 92 = - # 3 Short off-line Completed 00% 89 = - # 4 Extended off-line Completed 00% 69 = - # 5 Short off-line Completed 00% 2 = - # 6 Short off-line Completed 00% 0 = - So, may hard disk must be ok !?! There is another strange value in the SMART Attribute. With another disk = (IBM)=20 smartctl return a smaller value for Start_Stop_Count than Power_On_Hours= =2E 4 Start_Stop_Count 0x0012 100 100 000 Old_age - = 197=20 9 Power_On_Hours 0x0012 100 100 000 Old_age - 1= 16 12 Power_Cycle_Count 0x0032 100 100 000 Old_age - 197 192 Power-Off_Retract_Count 0x0032 100 100 050 Old_age - 197 193 Load_Cycle_Count 0x0012 100 100 050 Old_age - 19= 7 My disk runs more than one hour per power_on. Can you explain me this val= ue ? Thanks a lot! Ralf Am Freitag, 21. Februar 2003 05:39 schrieb Bruce Allen: > Hi Ralf, > > On Thu, 20 Feb 2003, Ralf Panse wrote: > > Hi all! > > what does this output mean (output from ./smartctl -c /dev/hdd > > ) -> > > > > =3D=3D=3D START OF READ SMART DATA SECTION =3D=3D=3D > > SMART Error Log Version: 1 > > ATA Error Count: 2 > > ... > > Error 2 occurred at disk power-on lifetime: 2 hours > > When the command that caused the error occurred, the device was activ= e or > > idle. > > After command completion occurred, registers were: > > ER:10 SC:00 SN:4c CL:18 CH:f2 D/H:b2 ST:51 > > Sequence of commands leading to the command that caused the error wer= e: > > DCR FR SC SN CL CH D/H CR Timestamp > > 02 40 00 00 00 f2 f2 82 7302.400 > > 02 40 00 00 c0 f1 f2 82 7302.000 > > 02 40 00 00 80 f1 f2 82 7301.700 > > 02 40 00 00 40 f1 f2 82 7301.300 > > 02 40 00 00 00 f1 f2 82 7300.900 > > > > Error 1 occurred at disk power-on lifetime: 1 hours > > ... > > This is the ATA error log. The types of errors that it indicates are > described in this document: http://www.t13.org/project/d1321r1c.pdf ple= ase > see section 8.41.6.8.2.4 (Device Error Count) which starts on page 204. > > The output listed are the five commands leading up to the command that > caused the error. The different columns refer to different ATA registe= rs. > > > What is wrong with my hard disk? > > Probably nothing. The last of there errors occured when the disk was j= ust > a few hours old (7300 seconds after it was first turned on). This may > have been due to strange or incorrect hdparm (DMA mode, etc) settings, = a > loose cable, or something else wrong with the disk. > > Assuming that the disk is more than a few hours old, it's not been > exhibiting the errors recently. If you want some more reassurance, pos= t > the output of smartctl -a, please. > > You might also want to run some extended self-tests and examine the > self-test log. You can do both of these things with smartctl. > > > The healtstatus-command of smartctl (./smartctl -H /dev/hdd) says ... > > > > =3D=3D=3D START OF READ SMART DATA SECTION =3D=3D=3D > > SMART overall-health self-assessment test result: PASSED > > > > And all S.M.A.R.T Attributes are ok. > > > > > > Thanks ! > > You're welcome! > > Bruce |
From: Bruce A. <ba...@gr...> - 2003-02-21 15:58:31
|
Hi Ralf, (My comments are inserted below) On Fri, 21 Feb 2003, Ralf Panse wrote: > Hi Bruce ! > > Here thr output from smartctl -a /dev/hdd: > > smartctl version 5.1-4 Copyright (C) 2002 Bruce Allen > Home page is http://smartmontools.sourceforge.net/ > > === START OF INFORMATION SECTION === > Device Model: IBM-DTLA-307045 > Serial Number: YMDYMHA0675 > Firmware Version: TX6OA50C > ATA Version is: 5 > ATA Standard is: ATA/ATAPI-5 T13 1321D revision 1 > Local Time is: Fri Feb 21 16:09:14 2003 CET > SMART support is: Available - device has SMART capability. > SMART support is: Enabled > > === START OF READ SMART DATA SECTION === > SMART overall-health self-assessment test result: PASSED > > General SMART Values: > Off-line data collection status: (0x02) Offline data collection activity > completed without error. > Self-test execution status: ( 0) The previous self-test routine > completed > without error or no self-test has ever > been run. > Total time to complete off-line > data collection: (2294) seconds. > Offline data collection > capabilities: (0x1b) SMART execute Offline immediate. > Automatic timer ON/OFF support. > Suspend Offline collection upon new > command. > Offline surface scan supported. > Self-test supported. > SMART capabilities: (0x0003) Saves SMART data before entering > power-saving mode. > Supports SMART auto save timer. > Error logging capability: (0x01) Error logging supported. > Short self-test routine > recommended polling time: ( 2) minutes. > Extended self-test routine > recommended polling time: ( 28) minutes. > > SMART Attributes Data Structure revision number: 16 > Vendor Specific SMART Attributes with Thresholds: > ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE WHEN_FAILED > RAW_VALUE > 1 Raw_Read_Error_Rate 0x000b 100 100 060 Pre-fail - 1 > 2 Throughput_Performance 0x0005 132 132 050 Pre-fail - > 340 > 3 Spin_Up_Time 0x0007 094 094 024 Pre-fail - > 25789530422 > 4 Start_Stop_Count 0x0012 100 100 000 Old_age - > 52 > 5 Reallocated_Sector_Ct 0x0033 100 100 005 Pre-fail - 0 > 7 Seek_Error_Rate 0x000b 100 100 067 Pre-fail - 0 > 8 Seek_Time_Performance 0x0005 130 130 020 Pre-fail - > 34 > 9 Power_On_Hours 0x0012 100 100 000 Old_age - > 119 > 10 Spin_Retry_Count 0x0013 100 100 060 Pre-fail - 0 > 12 Power_Cycle_Count 0x0032 100 100 000 Old_age - > 52 > 192 Power-Off_Retract_Count 0x0032 100 100 050 Old_age - > 52 > 193 Load_Cycle_Count 0x0012 100 100 050 Old_age - > 52 > 194 Temperature_Celsius 0x0002 171 171 000 Old_age - > 32 > 196 Reallocated_Event_Count 0x0032 100 100 000 Old_age - 0 > 197 Current_Pending_Sector 0x0022 100 100 000 Old_age - 0 > 198 Offline_Uncorrectable 0x0008 100 100 000 Old_age - 0 > 199 UDMA_CRC_Error_Count 0x000a 200 200 000 Old_age - 0 > > SMART Error Log Version: 1 > ATA Error Count: 2 > DCR = Device Control Register > FR = Features Register > SC = Sector Count Register > SN = Sector Number Register > CL = Cylinder Low Register > CH = Cylinder High Register > D/H = Device/Head Register > CR = Content written to Command Register > ER = Error register > STA = Status register > Timestamp is seconds since the previous disk power-on. > Note: timestamp "wraps" after 2^32 msec = 49.710 days. > > Error 2 occurred at disk power-on lifetime: 2 hours > When the command that caused the error occurred, the device was active or > idle. > After command completion occurred, registers were: > ER:10 SC:00 SN:4c CL:18 CH:f2 D/H:b2 ST:51 > Sequence of commands leading to the command that caused the error were: > DCR FR SC SN CL CH D/H CR Timestamp > 02 40 00 00 00 f2 f2 82 7302.400 > 02 40 00 00 c0 f1 f2 82 7302.000 > 02 40 00 00 80 f1 f2 82 7301.700 > 02 40 00 00 40 f1 f2 82 7301.300 > 02 40 00 00 00 f1 f2 82 7300.900 > > Error 1 occurred at disk power-on lifetime: 1 hours > When the command that caused the error occurred, the device was active or > idle. > After command completion occurred, registers were: > ER:10 SC:00 SN:00 CL:00 CH:00 D/H:b0 ST:51 > Sequence of commands leading to the command that caused the error were: > DCR FR SC SN CL CH D/H CR Timestamp > 02 11 00 00 00 00 b0 f7 5132.900 > 02 00 00 00 00 00 b0 f3 5132.900 > 02 00 00 00 00 00 b0 ec 5132.900 > 02 00 40 00 c3 00 f0 40 5132.800 > 02 00 40 c0 c2 00 f0 40 5132.800 > > SMART Self-test log, version number 1 > Num Test_Description Status Remaining LifeTime(hours) > LBA_of_first_error > # 1 Short off-line Completed 00% 117 - > # 2 Short off-line Completed 00% 92 - > # 3 Short off-line Completed 00% 89 - > # 4 Extended off-line Completed 00% 69 - > # 5 Short off-line Completed 00% 2 - > # 6 Short off-line Completed 00% 0 - > > > So, may hard disk must be ok !?! The disk looks fine. The two entries in the ATA error log were at 1 hours 25 minutes after the disk was first powered up, and 2 hours and 2 minutes after the disk was first powered up. [Perhaps you were playing the hdparm??]. The disk is now 119 hours old and hasn't shown any further errors. The time to worry is if the ATA error log starts showing hundreds or thousands of errors in the very recent past. And the self-tests all completed OK. > There is another strange value in the SMART Attribute. With another disk (IBM) > smartctl return a smaller value for Start_Stop_Count than Power_On_Hours. In itself, that's OK. Start_Stop_Count is the number of times that the disk has spun up. For a machine that is turned on and off once per month, and has been running for a year, this would be 12. But Power_On_Hours would be one year: 8760. Note that Start_Stop_Count can also change when the disk sleeps. Now these results: > 4 Start_Stop_Count 0x0012 100 100 000 Old_age - 197 > 9 Power_On_Hours 0x0012 100 100 000 Old_age - 116 > 12 Power_Cycle_Count 0x0032 100 100 000 Old_age - 197 > 192 Power-Off_Retract_Count 0x0032 100 100 050 Old_age - 197 > 193 Load_Cycle_Count 0x0012 100 100 050 Old_age - 197 look very strange. Is this a laptop disk? Please post the output of smartctl -a for this disk. > My disk runs more than one hour per power_on. Can you explain me this value ? Is the disk sleeping (either a laptop disk or desktop machine that suspends a lot)? > Thanks a lot! You're welcome! Bruce > > Ralf > > > Am Freitag, 21. Februar 2003 05:39 schrieb Bruce Allen: > > Hi Ralf, > > > > On Thu, 20 Feb 2003, Ralf Panse wrote: > > > Hi all! > > > what does this output mean (output from ./smartctl -c /dev/hdd > > > ) -> > > > > > > === START OF READ SMART DATA SECTION === > > > SMART Error Log Version: 1 > > > ATA Error Count: 2 > > > ... > > > Error 2 occurred at disk power-on lifetime: 2 hours > > > When the command that caused the error occurred, the device was active or > > > idle. > > > After command completion occurred, registers were: > > > ER:10 SC:00 SN:4c CL:18 CH:f2 D/H:b2 ST:51 > > > Sequence of commands leading to the command that caused the error were: > > > DCR FR SC SN CL CH D/H CR Timestamp > > > 02 40 00 00 00 f2 f2 82 7302.400 > > > 02 40 00 00 c0 f1 f2 82 7302.000 > > > 02 40 00 00 80 f1 f2 82 7301.700 > > > 02 40 00 00 40 f1 f2 82 7301.300 > > > 02 40 00 00 00 f1 f2 82 7300.900 > > > > > > Error 1 occurred at disk power-on lifetime: 1 hours > > > ... > > > > This is the ATA error log. The types of errors that it indicates are > > described in this document: http://www.t13.org/project/d1321r1c.pdf please > > see section 8.41.6.8.2.4 (Device Error Count) which starts on page 204. > > > > The output listed are the five commands leading up to the command that > > caused the error. The different columns refer to different ATA registers. > > > > > What is wrong with my hard disk? > > > > Probably nothing. The last of there errors occured when the disk was just > > a few hours old (7300 seconds after it was first turned on). This may > > have been due to strange or incorrect hdparm (DMA mode, etc) settings, a > > loose cable, or something else wrong with the disk. > > > > Assuming that the disk is more than a few hours old, it's not been > > exhibiting the errors recently. If you want some more reassurance, post > > the output of smartctl -a, please. > > > > You might also want to run some extended self-tests and examine the > > self-test log. You can do both of these things with smartctl. > > > > > The healtstatus-command of smartctl (./smartctl -H /dev/hdd) says ... > > > > > > === START OF READ SMART DATA SECTION === > > > SMART overall-health self-assessment test result: PASSED > > > > > > And all S.M.A.R.T Attributes are ok. > > > > > > > > > Thanks ! > > > > You're welcome! > > > > Bruce > > > > > > ------------------------------------------------------- > This SF.net email is sponsored by: SlickEdit Inc. Develop an edge. > The most comprehensive and flexible code editor you can use. > Code faster. C/C++, C#, Java, HTML, XML, many more. FREE 30-Day Trial. > www.slickedit.com/sourceforge > _______________________________________________ > Smartmontools-support mailing list > Sma...@li... > https://lists.sourceforge.net/lists/listinfo/smartmontools-support > |
From: Ralf P. <rp...@ki...> - 2003-02-21 16:12:33
|
Hi Bruce Here the output from my strange disk with the low Power_On value. It's no= t a=20 laptop disk and the pc is turned off every evening. smartctl version 5.1-4 Copyright (C) 2002 Bruce Allen Home page is http://smartmontools.sourceforge.net/ =3D=3D=3D START OF INFORMATION SECTION =3D=3D=3D Device Model: IC35L040AVER07-0 Serial Number: SXPTX393636 Firmware Version: ER4OA46A ATA Version is: 5 ATA Standard is: ATA/ATAPI-5 T13 1321D revision 1 Local Time is: Fri Feb 21 17:01:37 2003 CET SMART support is: Available - device has SMART capability. SMART support is: Enabled =3D=3D=3D START OF READ SMART DATA SECTION =3D=3D=3D SMART overall-health self-assessment test result: PASSED General SMART Values: Off-line data collection status: (0x00) Offline data collection activity = was never started. Self-test execution status: ( 0) The previous self-test routine=20 completed without error or no self-test has= ever been run. Total time to complete off-line data collection: (1383) seconds. Offline data collection capabilities: (0x1b) SMART execute Offline immediate. Automatic timer ON/OFF support. Suspend Offline collection upon n= ew command. Offline surface scan supported. Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. Short self-test routine recommended polling time: ( 1) minutes. Extended self-test routine recommended polling time: ( 23) minutes. SMART Attributes Data Structure revision number: 16 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE WHEN_FAI= LED=20 RAW_VALUE 1 Raw_Read_Error_Rate 0x000b 095 095 060 Pre-fail - = =20 131084 2 Throughput_Performance 0x0005 100 100 050 Pre-fail - = 0 3 Spin_Up_Time 0x0007 104 104 024 Pre-fail - = =20 17193697490 4 Start_Stop_Count 0x0012 100 100 000 Old_age - = =20 197 5 Reallocated_Sector_Ct 0x0033 100 100 005 Pre-fail - = 0 7 Seek_Error_Rate 0x000b 100 100 067 Pre-fail - = 0 8 Seek_Time_Performance 0x0005 100 100 020 Pre-fail - = 0 9 Power_On_Hours 0x0012 100 100 000 Old_age - = =20 117 10 Spin_Retry_Count 0x0013 100 100 060 Pre-fail - = 0 12 Power_Cycle_Count 0x0032 100 100 000 Old_age - = =20 197 192 Power-Off_Retract_Count 0x0032 100 100 050 Old_age - = =20 197 193 Load_Cycle_Count 0x0012 100 100 050 Old_age - = =20 197 194 Temperature_Celsius 0x0002 141 141 000 Old_age - = =20 39 (Lifetime Min/Max 19/47) 196 Reallocated_Event_Count 0x0032 100 100 000 Old_age - = 0 197 Current_Pending_Sector 0x0022 100 100 000 Old_age - = 0 198 Offline_Uncorrectable 0x0008 100 100 000 Old_age - = 0 199 UDMA_CRC_Error_Count 0x000a 200 200 000 Old_age - = 0 SMART Error Log Version: 1 No Errors Logged SMART Self-test log, version number 1 Num Test_Description Status Remaining LifeTime(hour= s) =20 LBA_of_first_error # 1 Short off-line Completed 00% 115 = - # 2 Short off-line Completed 00% 104 = - # 3 Short off-line Completed 00% 104 = - # 4 Short off-line Completed 00% 104 = - # 5 Short off-line Completed 00% 103 = - # 6 Short off-line Completed 00% 103 = - # 7 Short off-line Completed 00% 103 = - # 8 Short off-line Completed 00% 102 = - # 9 Extended off-line Completed 00% 101 = - #10 Short off-line Completed 00% 99 = - #11 Short off-line Completed 00% 99 = - #12 Short off-line Completed 00% 99 = - #13 Short off-line Completed 00% 99 = - #14 Short off-line Completed 00% 99 = - #15 Short off-line Completed 00% 99 = - #16 Short off-line Completed 00% 99 = - #17 Short off-line Completed 00% 97 = - #18 Short off-line Completed 00% 97 = - #19 Short off-line Completed 00% 97 = - #20 Short off-line Completed 00% 97 = - #21 Short off-line Completed 00% 97 = - Thanks=20 Ralf Am Freitag, 21. Februar 2003 16:58 schrieben Sie: > Hi Ralf, > > (My comments are inserted below) > > On Fri, 21 Feb 2003, Ralf Panse wrote: > > Hi Bruce ! > > > > Here thr output from smartctl -a /dev/hdd: > > > > smartctl version 5.1-4 Copyright (C) 2002 Bruce Allen > > Home page is http://smartmontools.sourceforge.net/ > > > > =3D=3D=3D START OF INFORMATION SECTION =3D=3D=3D > > Device Model: IBM-DTLA-307045 > > Serial Number: YMDYMHA0675 > > Firmware Version: TX6OA50C > > ATA Version is: 5 > > ATA Standard is: ATA/ATAPI-5 T13 1321D revision 1 > > Local Time is: Fri Feb 21 16:09:14 2003 CET > > SMART support is: Available - device has SMART capability. > > SMART support is: Enabled > > > > =3D=3D=3D START OF READ SMART DATA SECTION =3D=3D=3D > > SMART overall-health self-assessment test result: PASSED > > > > General SMART Values: > > Off-line data collection status: (0x02) Offline data collection activ= ity > > completed without error. > > Self-test execution status: ( 0) The previous self-test routin= e > > completed > > without error or no self-test= has > > ever been run. > > Total time to complete off-line > > data collection: (2294) seconds. > > Offline data collection > > capabilities: (0x1b) SMART execute Offline immedia= te. > > Automatic timer ON/OFF suppor= t. > > Suspend Offline collection up= on > > new command. > > Offline surface scan supporte= d. > > Self-test supported. > > SMART capabilities: (0x0003) Saves SMART data before enter= ing > > power-saving mode. > > Supports SMART auto save time= r. > > Error logging capability: (0x01) Error logging supported. > > Short self-test routine > > recommended polling time: ( 2) minutes. > > Extended self-test routine > > recommended polling time: ( 28) minutes. > > > > SMART Attributes Data Structure revision number: 16 > > Vendor Specific SMART Attributes with Thresholds: > > ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE =20 > > WHEN_FAILED RAW_VALUE > > 1 Raw_Read_Error_Rate 0x000b 100 100 060 Pre-fail = - =20 > > 1 2 Throughput_Performance 0x0005 132 132 050 Pre-fail = - > > 340 > > 3 Spin_Up_Time 0x0007 094 094 024 Pre-fail = - > > 25789530422 > > 4 Start_Stop_Count 0x0012 100 100 000 Old_age = - > > 52 > > 5 Reallocated_Sector_Ct 0x0033 100 100 005 Pre-fail = - =20 > > 0 7 Seek_Error_Rate 0x000b 100 100 067 Pre-fail = - > > 0 8 Seek_Time_Performance 0x0005 130 130 020 Pre-fai= l =20 > > - 34 > > 9 Power_On_Hours 0x0012 100 100 000 Old_age = - > > 119 > > 10 Spin_Retry_Count 0x0013 100 100 060 Pre-fail = - =20 > > 0 12 Power_Cycle_Count 0x0032 100 100 000 Old_age = =20 > > - 52 > > 192 Power-Off_Retract_Count 0x0032 100 100 050 Old_age = - > > 52 > > 193 Load_Cycle_Count 0x0012 100 100 050 Old_age = - > > 52 > > 194 Temperature_Celsius 0x0002 171 171 000 Old_age = - > > 32 > > 196 Reallocated_Event_Count 0x0032 100 100 000 Old_age = - =20 > > 0 197 Current_Pending_Sector 0x0022 100 100 000 Old_age = =20 > > - 0 198 Offline_Uncorrectable 0x0008 100 100 000 Old= _age > > - 0 199 UDMA_CRC_Error_Count 0x000a 200 200 000 = =20 > > Old_age - 0 > > > > SMART Error Log Version: 1 > > ATA Error Count: 2 > > DCR =3D Device Control Register > > FR =3D Features Register > > SC =3D Sector Count Register > > SN =3D Sector Number Register > > CL =3D Cylinder Low Register > > CH =3D Cylinder High Register > > D/H =3D Device/Head Register > > CR =3D Content written to Command Register > > ER =3D Error register > > STA =3D Status register > > Timestamp is seconds since the previous disk power-on. > > Note: timestamp "wraps" after 2^32 msec =3D 49.710 days. > > > > Error 2 occurred at disk power-on lifetime: 2 hours > > When the command that caused the error occurred, the device was activ= e or > > idle. > > After command completion occurred, registers were: > > ER:10 SC:00 SN:4c CL:18 CH:f2 D/H:b2 ST:51 > > Sequence of commands leading to the command that caused the error wer= e: > > DCR FR SC SN CL CH D/H CR Timestamp > > 02 40 00 00 00 f2 f2 82 7302.400 > > 02 40 00 00 c0 f1 f2 82 7302.000 > > 02 40 00 00 80 f1 f2 82 7301.700 > > 02 40 00 00 40 f1 f2 82 7301.300 > > 02 40 00 00 00 f1 f2 82 7300.900 > > > > Error 1 occurred at disk power-on lifetime: 1 hours > > When the command that caused the error occurred, the device was activ= e or > > idle. > > After command completion occurred, registers were: > > ER:10 SC:00 SN:00 CL:00 CH:00 D/H:b0 ST:51 > > Sequence of commands leading to the command that caused the error wer= e: > > DCR FR SC SN CL CH D/H CR Timestamp > > 02 11 00 00 00 00 b0 f7 5132.900 > > 02 00 00 00 00 00 b0 f3 5132.900 > > 02 00 00 00 00 00 b0 ec 5132.900 > > 02 00 40 00 c3 00 f0 40 5132.800 > > 02 00 40 c0 c2 00 f0 40 5132.800 > > > > SMART Self-test log, version number 1 > > Num Test_Description Status Remaining=20 > > LifeTime(hours) LBA_of_first_error > > # 1 Short off-line Completed 00% 117 = =20 > > - # 2 Short off-line Completed 00% = 92 > > - # 3 Short off-line Completed 00% = =20 > > 89 - # 4 Extended off-line Completed = 00% > > 69 - # 5 Short off-line Completed = =20 > > 00% 2 - # 6 Short off-line Completed = =20 > > 00% 0 - > > > > > > So, may hard disk must be ok !?! > > The disk looks fine. The two entries in the ATA error log were at 1 ho= urs > 25 minutes after the disk was first powered up, and 2 hours and 2 minut= es > after the disk was first powered up. [Perhaps you were playing the > hdparm??]. The disk is now 119 hours old and hasn't shown any further > errors. The time to worry is if the ATA error log starts showing hundr= eds > or thousands of errors in the very recent past. > > And the self-tests all completed OK. > > > There is another strange value in the SMART Attribute. With another d= isk > > (IBM) smartctl return a smaller value for Start_Stop_Count than > > Power_On_Hours. > > In itself, that's OK. Start_Stop_Count is the number of times that the > disk has spun up. For a machine that is turned on and off once per mon= th, > and has been running for a year, this would be 12. But Power_On_Hours > would be one year: 8760. > > Note that Start_Stop_Count can also change when the disk sleeps. > > Now these results: > > 4 Start_Stop_Count 0x0012 100 100 000 Old_age - = =20 > > 197 9 Power_On_Hours 0x0012 100 100 000 Old_age -= =20 > > 116 12 Power_Cycle_Count 0x0032 100 100 000 Old_age = - =20 > > 197 192 Power-Off_Retract_Count 0x0032 100 100 050 Old_age= - > > 197 193 Load_Cycle_Count 0x0012 100 100 050 Old_age = -=20 > > 197 > > look very strange. Is this a laptop disk? Please post the output of > smartctl -a for this disk. > > > My disk runs more than one hour per power_on. Can you explain me this > > value ? > > Is the disk sleeping (either a laptop disk or desktop machine that > suspends a lot)? > > > Thanks a lot! > > You're welcome! > > Bruce > > > Ralf > > > > Am Freitag, 21. Februar 2003 05:39 schrieb Bruce Allen: > > > Hi Ralf, > > > > > > On Thu, 20 Feb 2003, Ralf Panse wrote: > > > > Hi all! > > > > what does this output mean (output from ./smartctl -c /dev/hdd > > > > ) -> > > > > > > > > =3D=3D=3D START OF READ SMART DATA SECTION =3D=3D=3D > > > > SMART Error Log Version: 1 > > > > ATA Error Count: 2 > > > > ... > > > > Error 2 occurred at disk power-on lifetime: 2 hours > > > > When the command that caused the error occurred, the device was > > > > active or idle. > > > > After command completion occurred, registers were: > > > > ER:10 SC:00 SN:4c CL:18 CH:f2 D/H:b2 ST:51 > > > > Sequence of commands leading to the command that caused the error > > > > were: DCR FR SC SN CL CH D/H CR Timestamp > > > > 02 40 00 00 00 f2 f2 82 7302.400 > > > > 02 40 00 00 c0 f1 f2 82 7302.000 > > > > 02 40 00 00 80 f1 f2 82 7301.700 > > > > 02 40 00 00 40 f1 f2 82 7301.300 > > > > 02 40 00 00 00 f1 f2 82 7300.900 > > > > > > > > Error 1 occurred at disk power-on lifetime: 1 hours > > > > ... > > > > > > This is the ATA error log. The types of errors that it indicates a= re > > > described in this document: http://www.t13.org/project/d1321r1c.pdf > > > please see section 8.41.6.8.2.4 (Device Error Count) which starts o= n > > > page 204. > > > > > > The output listed are the five commands leading up to the command t= hat > > > caused the error. The different columns refer to different ATA > > > registers. > > > > > > > What is wrong with my hard disk? > > > > > > Probably nothing. The last of there errors occured when the disk w= as > > > just a few hours old (7300 seconds after it was first turned on). = This > > > may have been due to strange or incorrect hdparm (DMA mode, etc) > > > settings, a loose cable, or something else wrong with the disk. > > > > > > Assuming that the disk is more than a few hours old, it's not been > > > exhibiting the errors recently. If you want some more reassurance, > > > post the output of smartctl -a, please. > > > > > > You might also want to run some extended self-tests and examine the > > > self-test log. You can do both of these things with smartctl. > > > > > > > The healtstatus-command of smartctl (./smartctl -H /dev/hdd) says= ... > > > > > > > > =3D=3D=3D START OF READ SMART DATA SECTION =3D=3D=3D > > > > SMART overall-health self-assessment test result: PASSED > > > > > > > > And all S.M.A.R.T Attributes are ok. > > > > > > > > > > > > Thanks ! > > > > > > You're welcome! > > > > > > Bruce > > > > ------------------------------------------------------- > > This SF.net email is sponsored by: SlickEdit Inc. Develop an edge. > > The most comprehensive and flexible code editor you can use. > > Code faster. C/C++, C#, Java, HTML, XML, many more. FREE 30-Day Trial= =2E > > www.slickedit.com/sourceforge > > _______________________________________________ > > Smartmontools-support mailing list > > Sma...@li... > > https://lists.sourceforge.net/lists/listinfo/smartmontools-support --=20 Ralf Panse Kirchhoff-Institut f=FCr Physik Tel: 06221 54 9811 |
From: Bruce A. <ba...@gr...> - 2003-02-21 18:42:14
|
Hi Ralf, OK, I think I see what's happening. I think that you may have an IBM disk that has defective SMART firmware. Go to this page: http://www.geocities.com/dtla_update/ and follow one of the "Related links" for the 60GXP disk. This points to revised IBM firmware that should fix the problem. [You will find a link to this page on the smartmontools web page under the FAQs]. Please let me know if this helps! Cheers, =09Bruce On Fri, 21 Feb 2003, Ralf Panse wrote: > Hi Bruce >=20 > Here the output from my strange disk with the low Power_On value. It's no= t a=20 > laptop disk and the pc is turned off every evening. >=20 >=20 > smartctl version 5.1-4 Copyright (C) 2002 Bruce Allen > Home page is http://smartmontools.sourceforge.net/ >=20 > =3D=3D=3D START OF INFORMATION SECTION =3D=3D=3D > Device Model: IC35L040AVER07-0 > Serial Number: SXPTX393636 > Firmware Version: ER4OA46A > ATA Version is: 5 > ATA Standard is: ATA/ATAPI-5 T13 1321D revision 1 > Local Time is: Fri Feb 21 17:01:37 2003 CET > SMART support is: Available - device has SMART capability. > SMART support is: Enabled >=20 > =3D=3D=3D START OF READ SMART DATA SECTION =3D=3D=3D > SMART overall-health self-assessment test result: PASSED >=20 > General SMART Values: > Off-line data collection status: (0x00) Offline data collection activity = was > never started. > Self-test execution status: ( 0) The previous self-test routine=20 > completed > without error or no self-test has= ever > been run. > Total time to complete off-line > data collection: (1383) seconds. > Offline data collection > capabilities: (0x1b) SMART execute Offline immediate. > Automatic timer ON/OFF support. > Suspend Offline collection upon n= ew > command. > Offline surface scan supported. > Self-test supported. > SMART capabilities: (0x0003) Saves SMART data before entering > power-saving mode. > Supports SMART auto save timer. > Error logging capability: (0x01) Error logging supported. > Short self-test routine > recommended polling time: ( 1) minutes. > Extended self-test routine > recommended polling time: ( 23) minutes. >=20 > SMART Attributes Data Structure revision number: 16 > Vendor Specific SMART Attributes with Thresholds: > ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE WHEN_FAI= LED=20 > RAW_VALUE > 1 Raw_Read_Error_Rate 0x000b 095 095 060 Pre-fail - = =20 > 131084 > 2 Throughput_Performance 0x0005 100 100 050 Pre-fail - = 0 > 3 Spin_Up_Time 0x0007 104 104 024 Pre-fail - = =20 > 17193697490 > 4 Start_Stop_Count 0x0012 100 100 000 Old_age - = =20 > 197 > 5 Reallocated_Sector_Ct 0x0033 100 100 005 Pre-fail - = 0 > 7 Seek_Error_Rate 0x000b 100 100 067 Pre-fail - = 0 > 8 Seek_Time_Performance 0x0005 100 100 020 Pre-fail - = 0 > 9 Power_On_Hours 0x0012 100 100 000 Old_age - = =20 > 117 > 10 Spin_Retry_Count 0x0013 100 100 060 Pre-fail - = 0 > 12 Power_Cycle_Count 0x0032 100 100 000 Old_age - = =20 > 197 > 192 Power-Off_Retract_Count 0x0032 100 100 050 Old_age - = =20 > 197 > 193 Load_Cycle_Count 0x0012 100 100 050 Old_age - = =20 > 197 > 194 Temperature_Celsius 0x0002 141 141 000 Old_age - = =20 > 39 (Lifetime Min/Max 19/47) > 196 Reallocated_Event_Count 0x0032 100 100 000 Old_age - = 0 > 197 Current_Pending_Sector 0x0022 100 100 000 Old_age - = 0 > 198 Offline_Uncorrectable 0x0008 100 100 000 Old_age - = 0 > 199 UDMA_CRC_Error_Count 0x000a 200 200 000 Old_age - = 0 >=20 > SMART Error Log Version: 1 > No Errors Logged >=20 > SMART Self-test log, version number 1 > Num Test_Description Status Remaining LifeTime(hour= s) =20 > LBA_of_first_error > # 1 Short off-line Completed 00% 115 = - > # 2 Short off-line Completed 00% 104 = - > # 3 Short off-line Completed 00% 104 = - > # 4 Short off-line Completed 00% 104 = - > # 5 Short off-line Completed 00% 103 = - > # 6 Short off-line Completed 00% 103 = - > # 7 Short off-line Completed 00% 103 = - > # 8 Short off-line Completed 00% 102 = - > # 9 Extended off-line Completed 00% 101 = - > #10 Short off-line Completed 00% 99 = - > #11 Short off-line Completed 00% 99 = - > #12 Short off-line Completed 00% 99 = - > #13 Short off-line Completed 00% 99 = - > #14 Short off-line Completed 00% 99 = - > #15 Short off-line Completed 00% 99 = - > #16 Short off-line Completed 00% 99 = - > #17 Short off-line Completed 00% 97 = - > #18 Short off-line Completed 00% 97 = - > #19 Short off-line Completed 00% 97 = - > #20 Short off-line Completed 00% 97 = - > #21 Short off-line Completed 00% 97 = - >=20 >=20 >=20 > Thanks=20 > Ralf >=20 >=20 > Am Freitag, 21. Februar 2003 16:58 schrieben Sie: > > Hi Ralf, > > > > (My comments are inserted below) > > > > On Fri, 21 Feb 2003, Ralf Panse wrote: > > > Hi Bruce ! > > > > > > Here thr output from smartctl -a /dev/hdd: > > > > > > smartctl version 5.1-4 Copyright (C) 2002 Bruce Allen > > > Home page is http://smartmontools.sourceforge.net/ > > > > > > =3D=3D=3D START OF INFORMATION SECTION =3D=3D=3D > > > Device Model: IBM-DTLA-307045 > > > Serial Number: YMDYMHA0675 > > > Firmware Version: TX6OA50C > > > ATA Version is: 5 > > > ATA Standard is: ATA/ATAPI-5 T13 1321D revision 1 > > > Local Time is: Fri Feb 21 16:09:14 2003 CET > > > SMART support is: Available - device has SMART capability. > > > SMART support is: Enabled > > > > > > =3D=3D=3D START OF READ SMART DATA SECTION =3D=3D=3D > > > SMART overall-health self-assessment test result: PASSED > > > > > > General SMART Values: > > > Off-line data collection status: (0x02) Offline data collection activ= ity > > > completed without error. > > > Self-test execution status: ( 0) The previous self-test routin= e > > > completed > > > without error or no self-test= has > > > ever been run. > > > Total time to complete off-line > > > data collection: (2294) seconds. > > > Offline data collection > > > capabilities: (0x1b) SMART execute Offline immedia= te. > > > Automatic timer ON/OFF suppor= t. > > > Suspend Offline collection up= on > > > new command. > > > Offline surface scan supporte= d. > > > Self-test supported. > > > SMART capabilities: (0x0003) Saves SMART data before enter= ing > > > power-saving mode. > > > Supports SMART auto save time= r. > > > Error logging capability: (0x01) Error logging supported. > > > Short self-test routine > > > recommended polling time: ( 2) minutes. > > > Extended self-test routine > > > recommended polling time: ( 28) minutes. > > > > > > SMART Attributes Data Structure revision number: 16 > > > Vendor Specific SMART Attributes with Thresholds: > > > ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE =20 > > > WHEN_FAILED RAW_VALUE > > > 1 Raw_Read_Error_Rate 0x000b 100 100 060 Pre-fail = - =20 > > > 1 2 Throughput_Performance 0x0005 132 132 050 Pre-fail = - > > > 340 > > > 3 Spin_Up_Time 0x0007 094 094 024 Pre-fail = - > > > 25789530422 > > > 4 Start_Stop_Count 0x0012 100 100 000 Old_age = - > > > 52 > > > 5 Reallocated_Sector_Ct 0x0033 100 100 005 Pre-fail = - =20 > > > 0 7 Seek_Error_Rate 0x000b 100 100 067 Pre-fail = - > > > 0 8 Seek_Time_Performance 0x0005 130 130 020 Pre-fai= l =20 > > > - 34 > > > 9 Power_On_Hours 0x0012 100 100 000 Old_age = - > > > 119 > > > 10 Spin_Retry_Count 0x0013 100 100 060 Pre-fail = - =20 > > > 0 12 Power_Cycle_Count 0x0032 100 100 000 Old_age = =20 > > > - 52 > > > 192 Power-Off_Retract_Count 0x0032 100 100 050 Old_age = - > > > 52 > > > 193 Load_Cycle_Count 0x0012 100 100 050 Old_age = - > > > 52 > > > 194 Temperature_Celsius 0x0002 171 171 000 Old_age = - > > > 32 > > > 196 Reallocated_Event_Count 0x0032 100 100 000 Old_age = - =20 > > > 0 197 Current_Pending_Sector 0x0022 100 100 000 Old_age = =20 > > > - 0 198 Offline_Uncorrectable 0x0008 100 100 000 Old= _age > > > - 0 199 UDMA_CRC_Error_Count 0x000a 200 200 000 = =20 > > > Old_age - 0 > > > > > > SMART Error Log Version: 1 > > > ATA Error Count: 2 > > > DCR =3D Device Control Register > > > FR =3D Features Register > > > SC =3D Sector Count Register > > > SN =3D Sector Number Register > > > CL =3D Cylinder Low Register > > > CH =3D Cylinder High Register > > > D/H =3D Device/Head Register > > > CR =3D Content written to Command Register > > > ER =3D Error register > > > STA =3D Status register > > > Timestamp is seconds since the previous disk power-on. > > > Note: timestamp "wraps" after 2^32 msec =3D 49.710 days. > > > > > > Error 2 occurred at disk power-on lifetime: 2 hours > > > When the command that caused the error occurred, the device was activ= e or > > > idle. > > > After command completion occurred, registers were: > > > ER:10 SC:00 SN:4c CL:18 CH:f2 D/H:b2 ST:51 > > > Sequence of commands leading to the command that caused the error wer= e: > > > DCR FR SC SN CL CH D/H CR Timestamp > > > 02 40 00 00 00 f2 f2 82 7302.400 > > > 02 40 00 00 c0 f1 f2 82 7302.000 > > > 02 40 00 00 80 f1 f2 82 7301.700 > > > 02 40 00 00 40 f1 f2 82 7301.300 > > > 02 40 00 00 00 f1 f2 82 7300.900 > > > > > > Error 1 occurred at disk power-on lifetime: 1 hours > > > When the command that caused the error occurred, the device was activ= e or > > > idle. > > > After command completion occurred, registers were: > > > ER:10 SC:00 SN:00 CL:00 CH:00 D/H:b0 ST:51 > > > Sequence of commands leading to the command that caused the error wer= e: > > > DCR FR SC SN CL CH D/H CR Timestamp > > > 02 11 00 00 00 00 b0 f7 5132.900 > > > 02 00 00 00 00 00 b0 f3 5132.900 > > > 02 00 00 00 00 00 b0 ec 5132.900 > > > 02 00 40 00 c3 00 f0 40 5132.800 > > > 02 00 40 c0 c2 00 f0 40 5132.800 > > > > > > SMART Self-test log, version number 1 > > > Num Test_Description Status Remaining=20 > > > LifeTime(hours) LBA_of_first_error > > > # 1 Short off-line Completed 00% 117 = =20 > > > - # 2 Short off-line Completed 00% = 92 > > > - # 3 Short off-line Completed 00% = =20 > > > 89 - # 4 Extended off-line Completed = 00% > > > 69 - # 5 Short off-line Completed = =20 > > > 00% 2 - # 6 Short off-line Completed = =20 > > > 00% 0 - > > > > > > > > > So, may hard disk must be ok !?! > > > > The disk looks fine. The two entries in the ATA error log were at 1 ho= urs > > 25 minutes after the disk was first powered up, and 2 hours and 2 minut= es > > after the disk was first powered up. [Perhaps you were playing the > > hdparm??]. The disk is now 119 hours old and hasn't shown any further > > errors. The time to worry is if the ATA error log starts showing hundr= eds > > or thousands of errors in the very recent past. > > > > And the self-tests all completed OK. > > > > > There is another strange value in the SMART Attribute. With another d= isk > > > (IBM) smartctl return a smaller value for Start_Stop_Count than > > > Power_On_Hours. > > > > In itself, that's OK. Start_Stop_Count is the number of times that the > > disk has spun up. For a machine that is turned on and off once per mon= th, > > and has been running for a year, this would be 12. But Power_On_Hours > > would be one year: 8760. > > > > Note that Start_Stop_Count can also change when the disk sleeps. > > > > Now these results: > > > 4 Start_Stop_Count 0x0012 100 100 000 Old_age - = =20 > > > 197 9 Power_On_Hours 0x0012 100 100 000 Old_age -= =20 > > > 116 12 Power_Cycle_Count 0x0032 100 100 000 Old_age = - =20 > > > 197 192 Power-Off_Retract_Count 0x0032 100 100 050 Old_age= - > > > 197 193 Load_Cycle_Count 0x0012 100 100 050 Old_age = -=20 > > > 197 > > > > look very strange. Is this a laptop disk? Please post the output of > > smartctl -a for this disk. > > > > > My disk runs more than one hour per power_on. Can you explain me this > > > value ? > > > > Is the disk sleeping (either a laptop disk or desktop machine that > > suspends a lot)? > > > > > Thanks a lot! > > > > You're welcome! > > > > Bruce > > > > > Ralf > > > > > > Am Freitag, 21. Februar 2003 05:39 schrieb Bruce Allen: > > > > Hi Ralf, > > > > > > > > On Thu, 20 Feb 2003, Ralf Panse wrote: > > > > > Hi all! > > > > > what does this output mean (output from ./smartctl -c /dev/hdd > > > > > ) -> > > > > > > > > > > =3D=3D=3D START OF READ SMART DATA SECTION =3D=3D=3D > > > > > SMART Error Log Version: 1 > > > > > ATA Error Count: 2 > > > > > ... > > > > > Error 2 occurred at disk power-on lifetime: 2 hours > > > > > When the command that caused the error occurred, the device was > > > > > active or idle. > > > > > After command completion occurred, registers were: > > > > > ER:10 SC:00 SN:4c CL:18 CH:f2 D/H:b2 ST:51 > > > > > Sequence of commands leading to the command that caused the error > > > > > were: DCR FR SC SN CL CH D/H CR Timestamp > > > > > 02 40 00 00 00 f2 f2 82 7302.400 > > > > > 02 40 00 00 c0 f1 f2 82 7302.000 > > > > > 02 40 00 00 80 f1 f2 82 7301.700 > > > > > 02 40 00 00 40 f1 f2 82 7301.300 > > > > > 02 40 00 00 00 f1 f2 82 7300.900 > > > > > > > > > > Error 1 occurred at disk power-on lifetime: 1 hours > > > > > ... > > > > > > > > This is the ATA error log. The types of errors that it indicates a= re > > > > described in this document: http://www.t13.org/project/d1321r1c.pdf > > > > please see section 8.41.6.8.2.4 (Device Error Count) which starts o= n > > > > page 204. > > > > > > > > The output listed are the five commands leading up to the command t= hat > > > > caused the error. The different columns refer to different ATA > > > > registers. > > > > > > > > > What is wrong with my hard disk? > > > > > > > > Probably nothing. The last of there errors occured when the disk w= as > > > > just a few hours old (7300 seconds after it was first turned on). = This > > > > may have been due to strange or incorrect hdparm (DMA mode, etc) > > > > settings, a loose cable, or something else wrong with the disk. > > > > > > > > Assuming that the disk is more than a few hours old, it's not been > > > > exhibiting the errors recently. If you want some more reassurance, > > > > post the output of smartctl -a, please. > > > > > > > > You might also want to run some extended self-tests and examine the > > > > self-test log. You can do both of these things with smartctl. > > > > > > > > > The healtstatus-command of smartctl (./smartctl -H /dev/hdd) says= ... > > > > > > > > > > =3D=3D=3D START OF READ SMART DATA SECTION =3D=3D=3D > > > > > SMART overall-health self-assessment test result: PASSED > > > > > > > > > > And all S.M.A.R.T Attributes are ok. > > > > > > > > > > > > > > > Thanks ! > > > > > > > > You're welcome! > > > > > > > > Bruce > > > > > > ------------------------------------------------------- > > > This SF.net email is sponsored by: SlickEdit Inc. Develop an edge. > > > The most comprehensive and flexible code editor you can use. > > > Code faster. C/C++, C#, Java, HTML, XML, many more. FREE 30-Day Trial= =2E > > > www.slickedit.com/sourceforge > > > _______________________________________________ > > > Smartmontools-support mailing list > > > Sma...@li... > > > https://lists.sourceforge.net/lists/listinfo/smartmontools-support >=20 > --=20 > Ralf Panse > Kirchhoff-Institut f=FCr Physik >=20 > Tel: 06221 54 9811 >=20 >=20 > ------------------------------------------------------- > This SF.net email is sponsored by: SlickEdit Inc. Develop an edge. > The most comprehensive and flexible code editor you can use. > Code faster. C/C++, C#, Java, HTML, XML, many more. FREE 30-Day Trial. > www.slickedit.com/sourceforge > _______________________________________________ > Smartmontools-support mailing list > Sma...@li... > https://lists.sourceforge.net/lists/listinfo/smartmontools-support >=20 |
From: Ralf P. <pa...@ki...> - 2003-03-31 11:09:48
|
Hi Bruce, i got a firmware directly from IBM. But the IBM Firmware Installer told m= e=20 that no update is needed.=20 So the problem is still the same and another strange value with the same = disk=20 is printed out. The temperature is raising from 39=B0C up to 49 =B0C by h= eavy=20 activity. The max. temperature of the other older IBM hard disk's i have= ,=20 are only 37 ^C.=20 What is wrong ?=20 Ralf Am Freitag, 21. Februar 2003 19:41 schrieb Bruce Allen: > Hi Ralf, > > OK, I think I see what's happening. I think that you may have an IBM di= sk > that has defective SMART firmware. > > Go to this page: > > http://www.geocities.com/dtla_update/ > > and follow one of the "Related links" for the 60GXP disk. This points = to > revised IBM firmware that should fix the problem. > > [You will find a link to this page on the smartmontools web page under = the > FAQs]. > > Please let me know if this helps! > > Cheers, > =09Bruce > > On Fri, 21 Feb 2003, Ralf Panse wrote: > > Hi Bruce > > > > Here the output from my strange disk with the low Power_On value. It'= s > > not a laptop disk and the pc is turned off every evening. > > > > > > smartctl version 5.1-4 Copyright (C) 2002 Bruce Allen > > Home page is http://smartmontools.sourceforge.net/ > > > > =3D=3D=3D START OF INFORMATION SECTION =3D=3D=3D > > Device Model: IC35L040AVER07-0 > > Serial Number: SXPTX393636 > > Firmware Version: ER4OA46A > > ATA Version is: 5 > > ATA Standard is: ATA/ATAPI-5 T13 1321D revision 1 > > Local Time is: Fri Feb 21 17:01:37 2003 CET > > SMART support is: Available - device has SMART capability. > > SMART support is: Enabled > > > > =3D=3D=3D START OF READ SMART DATA SECTION =3D=3D=3D > > SMART overall-health self-assessment test result: PASSED > > > > General SMART Values: > > Off-line data collection status: (0x00) Offline data collection activ= ity > > was never started. > > Self-test execution status: ( 0) The previous self-test routin= e > > completed > > without error or no self-test= has > > ever been run. > > Total time to complete off-line > > data collection: (1383) seconds. > > Offline data collection > > capabilities: (0x1b) SMART execute Offline immedia= te. > > Automatic timer ON/OFF suppor= t. > > Suspend Offline collection up= on > > new command. > > Offline surface scan supporte= d. > > Self-test supported. > > SMART capabilities: (0x0003) Saves SMART data before enter= ing > > power-saving mode. > > Supports SMART auto save time= r. > > Error logging capability: (0x01) Error logging supported. > > Short self-test routine > > recommended polling time: ( 1) minutes. > > Extended self-test routine > > recommended polling time: ( 23) minutes. > > > > SMART Attributes Data Structure revision number: 16 > > Vendor Specific SMART Attributes with Thresholds: > > ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE =20 > > WHEN_FAILED RAW_VALUE > > 1 Raw_Read_Error_Rate 0x000b 095 095 060 Pre-fail = - > > 131084 > > 2 Throughput_Performance 0x0005 100 100 050 Pre-fail = - =20 > > 0 3 Spin_Up_Time 0x0007 104 104 024 Pre-fail = - > > 17193697490 > > 4 Start_Stop_Count 0x0012 100 100 000 Old_age = - > > 197 > > 5 Reallocated_Sector_Ct 0x0033 100 100 005 Pre-fail = - =20 > > 0 7 Seek_Error_Rate 0x000b 100 100 067 Pre-fail = - > > 0 8 Seek_Time_Performance 0x0005 100 100 020 Pre-fai= l =20 > > - 0 9 Power_On_Hours 0x0012 100 100 000 Old_= age=20 > > - 117 > > 10 Spin_Retry_Count 0x0013 100 100 060 Pre-fail = - =20 > > 0 12 Power_Cycle_Count 0x0032 100 100 000 Old_age = =20 > > - 197 > > 192 Power-Off_Retract_Count 0x0032 100 100 050 Old_age = - > > 197 > > 193 Load_Cycle_Count 0x0012 100 100 050 Old_age = - > > 197 > > 194 Temperature_Celsius 0x0002 141 141 000 Old_age = - > > 39 (Lifetime Min/Max 19/47) > > 196 Reallocated_Event_Count 0x0032 100 100 000 Old_age = - =20 > > 0 197 Current_Pending_Sector 0x0022 100 100 000 Old_age = =20 > > - 0 198 Offline_Uncorrectable 0x0008 100 100 000 Old= _age > > - 0 199 UDMA_CRC_Error_Count 0x000a 200 200 000 = =20 > > Old_age - 0 > > > > SMART Error Log Version: 1 > > No Errors Logged > > > > SMART Self-test log, version number 1 > > Num Test_Description Status Remaining=20 > > LifeTime(hours) LBA_of_first_error > > # 1 Short off-line Completed 00% 115 = =20 > > - # 2 Short off-line Completed 00% = 104 > > - # 3 Short off-line Completed 00% = =20 > > 104 - # 4 Short off-line Completed = 00% > > 104 - # 5 Short off-line Completed = =20 > > 00% 103 - # 6 Short off-line Completed = =20 > > 00% 103 - # 7 Short off-line Completed = =20 > > 00% 103 - # 8 Short off-line Completed= =20 > > 00% 102 - # 9 Extended off-line =20 > > Completed 00% 101 - #10 Short off-= line > > Completed 00% 99 - #11 Short > > off-line Completed 00% 99 - #= 12=20 > > Short off-line Completed 00% 99 = - > > #13 Short off-line Completed 00% 99 = =20 > > - #14 Short off-line Completed 00% = 99 > > - #15 Short off-line Completed 00% = =20 > > 99 - #16 Short off-line Completed = 00% > > 99 - #17 Short off-line Completed = =20 > > 00% 97 - #18 Short off-line Completed = =20 > > 00% 97 - #19 Short off-line Completed = =20 > > 00% 97 - #20 Short off-line Completed= =20 > > 00% 97 - #21 Short off-line =20 > > Completed 00% 97 - > > > > > > > > Thanks > > Ralf > > > > Am Freitag, 21. Februar 2003 16:58 schrieben Sie: > > > Hi Ralf, > > > > > > (My comments are inserted below) > > > > > > On Fri, 21 Feb 2003, Ralf Panse wrote: > > > > Hi Bruce ! > > > > > > > > Here thr output from smartctl -a /dev/hdd: > > > > > > > > smartctl version 5.1-4 Copyright (C) 2002 Bruce Allen > > > > Home page is http://smartmontools.sourceforge.net/ > > > > > > > > =3D=3D=3D START OF INFORMATION SECTION =3D=3D=3D > > > > Device Model: IBM-DTLA-307045 > > > > Serial Number: YMDYMHA0675 > > > > Firmware Version: TX6OA50C > > > > ATA Version is: 5 > > > > ATA Standard is: ATA/ATAPI-5 T13 1321D revision 1 > > > > Local Time is: Fri Feb 21 16:09:14 2003 CET > > > > SMART support is: Available - device has SMART capability. > > > > SMART support is: Enabled > > > > > > > > =3D=3D=3D START OF READ SMART DATA SECTION =3D=3D=3D > > > > SMART overall-health self-assessment test result: PASSED > > > > > > > > General SMART Values: > > > > Off-line data collection status: (0x02) Offline data collection > > > > activity completed without error. Self-test execution status: = (=20 > > > > 0) The previous self-test routine completed > > > > without error or no self-= test > > > > has ever been run. > > > > Total time to complete off-line > > > > data collection: (2294) seconds. > > > > Offline data collection > > > > capabilities: (0x1b) SMART execute Offline > > > > immediate. Automatic timer ON/OFF support. Suspend Offline collec= tion > > > > upon new command. > > > > Offline surface scan > > > > supported. Self-test supported. SMART capabilities: =20 > > > > (0x0003) Saves SMART data before entering power-saving mode. > > > > Supports SMART auto save > > > > timer. Error logging capability: (0x01) Error logging > > > > supported. Short self-test routine > > > > recommended polling time: ( 2) minutes. > > > > Extended self-test routine > > > > recommended polling time: ( 28) minutes. > > > > > > > > SMART Attributes Data Structure revision number: 16 > > > > Vendor Specific SMART Attributes with Thresholds: > > > > ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE > > > > WHEN_FAILED RAW_VALUE > > > > 1 Raw_Read_Error_Rate 0x000b 100 100 060 Pre-fail = =20 > > > > - 1 2 Throughput_Performance 0x0005 132 132 050 Pre-fai= l =20 > > > > - 340 > > > > 3 Spin_Up_Time 0x0007 094 094 024 Pre-fail = =20 > > > > - 25789530422 > > > > 4 Start_Stop_Count 0x0012 100 100 000 Old_age = =20 > > > > - 52 > > > > 5 Reallocated_Sector_Ct 0x0033 100 100 005 Pre-fail = =20 > > > > - 0 7 Seek_Error_Rate 0x000b 100 100 067 Pre-fai= l =20 > > > > - 0 8 Seek_Time_Performance 0x0005 130 130 020 Pre-fa= il - > > > > 34 > > > > 9 Power_On_Hours 0x0012 100 100 000 Old_age = =20 > > > > - 119 > > > > 10 Spin_Retry_Count 0x0013 100 100 060 Pre-fail = =20 > > > > - 0 12 Power_Cycle_Count 0x0032 100 100 000 Old_ag= e - > > > > 52 > > > > 192 Power-Off_Retract_Count 0x0032 100 100 050 Old_age = =20 > > > > - 52 > > > > 193 Load_Cycle_Count 0x0012 100 100 050 Old_age = =20 > > > > - 52 > > > > 194 Temperature_Celsius 0x0002 171 171 000 Old_age = =20 > > > > - 32 > > > > 196 Reallocated_Event_Count 0x0032 100 100 000 Old_age = =20 > > > > - 0 197 Current_Pending_Sector 0x0022 100 100 000 Old_a= ge - > > > > 0 198 Offline_Uncorrectable 0x0008 100 100 000 =20 > > > > Old_age - 0 199 UDMA_CRC_Error_Count 0x000a 200 200 = =20 > > > > 000 Old_age - 0 > > > > > > > > SMART Error Log Version: 1 > > > > ATA Error Count: 2 > > > > DCR =3D Device Control Register > > > > FR =3D Features Register > > > > SC =3D Sector Count Register > > > > SN =3D Sector Number Register > > > > CL =3D Cylinder Low Register > > > > CH =3D Cylinder High Register > > > > D/H =3D Device/Head Register > > > > CR =3D Content written to Command Register > > > > ER =3D Error register > > > > STA =3D Status register > > > > Timestamp is seconds since the previous disk power-on. > > > > Note: timestamp "wraps" after 2^32 msec =3D 49.710 days. > > > > > > > > Error 2 occurred at disk power-on lifetime: 2 hours > > > > When the command that caused the error occurred, the device was > > > > active or idle. > > > > After command completion occurred, registers were: > > > > ER:10 SC:00 SN:4c CL:18 CH:f2 D/H:b2 ST:51 > > > > Sequence of commands leading to the command that caused the error > > > > were: DCR FR SC SN CL CH D/H CR Timestamp > > > > 02 40 00 00 00 f2 f2 82 7302.400 > > > > 02 40 00 00 c0 f1 f2 82 7302.000 > > > > 02 40 00 00 80 f1 f2 82 7301.700 > > > > 02 40 00 00 40 f1 f2 82 7301.300 > > > > 02 40 00 00 00 f1 f2 82 7300.900 > > > > > > > > Error 1 occurred at disk power-on lifetime: 1 hours > > > > When the command that caused the error occurred, the device was > > > > active or idle. > > > > After command completion occurred, registers were: > > > > ER:10 SC:00 SN:00 CL:00 CH:00 D/H:b0 ST:51 > > > > Sequence of commands leading to the command that caused the error > > > > were: DCR FR SC SN CL CH D/H CR Timestamp > > > > 02 11 00 00 00 00 b0 f7 5132.900 > > > > 02 00 00 00 00 00 b0 f3 5132.900 > > > > 02 00 00 00 00 00 b0 ec 5132.900 > > > > 02 00 40 00 c3 00 f0 40 5132.800 > > > > 02 00 40 c0 c2 00 f0 40 5132.800 > > > > > > > > SMART Self-test log, version number 1 > > > > Num Test_Description Status Remaining > > > > LifeTime(hours) LBA_of_first_error > > > > # 1 Short off-line Completed 00% = 117 > > > > - # 2 Short off-line Completed 00% = =20 > > > > 92 - # 3 Short off-line Completed 00% = 89 =20 > > > > - # 4 Extended off-line Completed 00= % 69 > > > > - # 5 Short off-line Completed > > > > 00% 2 - # 6 Short off-line Completed > > > > 00% 0 - > > > > > > > > > > > > So, may hard disk must be ok !?! > > > > > > The disk looks fine. The two entries in the ATA error log were at = 1 > > > hours 25 minutes after the disk was first powered up, and 2 hours a= nd 2 > > > minutes after the disk was first powered up. [Perhaps you were pla= ying > > > the hdparm??]. The disk is now 119 hours old and hasn't shown any > > > further errors. The time to worry is if the ATA error log starts > > > showing hundreds or thousands of errors in the very recent past. > > > > > > And the self-tests all completed OK. > > > > > > > There is another strange value in the SMART Attribute. With anoth= er > > > > disk (IBM) smartctl return a smaller value for Start_Stop_Count = than > > > > Power_On_Hours. > > > > > > In itself, that's OK. Start_Stop_Count is the number of times that= the > > > disk has spun up. For a machine that is turned on and off once per > > > month, and has been running for a year, this would be 12. But > > > Power_On_Hours would be one year: 8760. > > > > > > Note that Start_Stop_Count can also change when the disk sleeps. > > > > > > Now these results: > > > > 4 Start_Stop_Count 0x0012 100 100 000 Old_age = - > > > > 197 9 Power_On_Hours 0x0012 100 100 000 Old_age= - > > > > 116 12 Power_Cycle_Count 0x0032 100 100 000 Old_= age=20 > > > > - 197 192 Power-Off_Retract_Count 0x0032 100 100 050 Old= _age > > > > - 197 193 Load_Cycle_Count 0x0012 100 100 050 =20 > > > > Old_age - 197 > > > > > > look very strange. Is this a laptop disk? Please post the output = of > > > smartctl -a for this disk. > > > > > > > My disk runs more than one hour per power_on. Can you explain me = this > > > > value ? > > > > > > Is the disk sleeping (either a laptop disk or desktop machine that > > > suspends a lot)? > > > > > > > Thanks a lot! > > > > > > You're welcome! > > > > > > Bruce > > > > > > > Ralf > > > > > > > > Am Freitag, 21. Februar 2003 05:39 schrieb Bruce Allen: > > > > > Hi Ralf, > > > > > > > > > > On Thu, 20 Feb 2003, Ralf Panse wrote: > > > > > > Hi all! > > > > > > what does this output mean (output from ./smartctl -c /dev/hd= d > > > > > > ) -> > > > > > > > > > > > > =3D=3D=3D START OF READ SMART DATA SECTION =3D=3D=3D > > > > > > SMART Error Log Version: 1 > > > > > > ATA Error Count: 2 > > > > > > ... > > > > > > Error 2 occurred at disk power-on lifetime: 2 hours > > > > > > When the command that caused the error occurred, the device w= as > > > > > > active or idle. > > > > > > After command completion occurred, registers were: > > > > > > ER:10 SC:00 SN:4c CL:18 CH:f2 D/H:b2 ST:51 > > > > > > Sequence of commands leading to the command that caused the e= rror > > > > > > were: DCR FR SC SN CL CH D/H CR Timestamp > > > > > > 02 40 00 00 00 f2 f2 82 7302.400 > > > > > > 02 40 00 00 c0 f1 f2 82 7302.000 > > > > > > 02 40 00 00 80 f1 f2 82 7301.700 > > > > > > 02 40 00 00 40 f1 f2 82 7301.300 > > > > > > 02 40 00 00 00 f1 f2 82 7300.900 > > > > > > > > > > > > Error 1 occurred at disk power-on lifetime: 1 hours > > > > > > ... > > > > > > > > > > This is the ATA error log. The types of errors that it indicat= es > > > > > are described in this document: > > > > > http://www.t13.org/project/d1321r1c.pdf please see section > > > > > 8.41.6.8.2.4 (Device Error Count) which starts on page 204. > > > > > > > > > > The output listed are the five commands leading up to the comma= nd > > > > > that caused the error. The different columns refer to differen= t > > > > > ATA registers. > > > > > > > > > > > What is wrong with my hard disk? > > > > > > > > > > Probably nothing. The last of there errors occured when the di= sk > > > > > was just a few hours old (7300 seconds after it was first turne= d > > > > > on). This may have been due to strange or incorrect hdparm (DM= A > > > > > mode, etc) settings, a loose cable, or something else wrong wit= h > > > > > the disk. > > > > > > > > > > Assuming that the disk is more than a few hours old, it's not b= een > > > > > exhibiting the errors recently. If you want some more reassura= nce, > > > > > post the output of smartctl -a, please. > > > > > > > > > > You might also want to run some extended self-tests and examine= the > > > > > self-test log. You can do both of these things with smartctl. > > > > > > > > > > > The healtstatus-command of smartctl (./smartctl -H /dev/hdd) = says > > > > > > ... > > > > > > > > > > > > =3D=3D=3D START OF READ SMART DATA SECTION =3D=3D=3D > > > > > > SMART overall-health self-assessment test result: PASSED > > > > > > > > > > > > And all S.M.A.R.T Attributes are ok. > > > > > > > > > > > > > > > > > > Thanks ! > > > > > > > > > > You're welcome! > > > > > > > > > > Bruce > > > > > > > > ------------------------------------------------------- > > > > This SF.net email is sponsored by: SlickEdit Inc. Develop an edge= =2E > > > > The most comprehensive and flexible code editor you can use. > > > > Code faster. C/C++, C#, Java, HTML, XML, many more. FREE 30-Day > > > > Trial. www.slickedit.com/sourceforge > > > > _______________________________________________ > > > > Smartmontools-support mailing list > > > > Sma...@li... > > > > https://lists.sourceforge.net/lists/listinfo/smartmontools-suppor= t > > > > -- > > Ralf Panse > > Kirchhoff-Institut f=FCr Physik > > > > Tel: 06221 54 9811 > > > > > > ------------------------------------------------------- > > This SF.net email is sponsored by: SlickEdit Inc. Develop an edge. > > The most comprehensive and flexible code editor you can use. > > Code faster. C/C++, C#, Java, HTML, XML, many more. FREE 30-Day Trial= =2E > > www.slickedit.com/sourceforge > > _______________________________________________ > > Smartmontools-support mailing list > > Sma...@li... > > https://lists.sourceforge.net/lists/listinfo/smartmontools-support > > ------------------------------------------------------- > This SF.net email is sponsored by: SlickEdit Inc. Develop an edge. > The most comprehensive and flexible code editor you can use. > Code faster. C/C++, C#, Java, HTML, XML, many more. FREE 30-Day Trial. > www.slickedit.com/sourceforge > _______________________________________________ > Smartmontools-support mailing list > Sma...@li... > https://lists.sourceforge.net/lists/listinfo/smartmontools-support --=20 Ralf Panse Kirchhoff-Institut f=FCr Physik Technische Informatik Tel: +49 6221 54 9811 Im Neuenheimer Feld 227 D-69120 Heidelberg, Germany e-mail: pa...@ki... |
From: Bruce A. <ba...@gr...> - 2003-04-01 04:34:40
|
Hi Ralf, > i got a firmware directly from IBM. But the IBM Firmware Installer > told me that no update is needed. OK. Thanks for trying. > So the problem is still the same and another strange value with the > same disk is printed out. The temperature is raising from 39=B0C up to > 49 =B0C by heavy activity. > The max. temperature of the other older IBM hard disk's i have, are > only 37 ^C. What is wrong ? I'm really not sure. Could you please save the output of smartctl -a -v N,raw16 /dev/hd* to a file and send it to the mailing list as an attachment, please? That might help me to understand better what's going on. [Note: you'll need version 5.1-9 or later of the package to support this -v option.] I'm sorry you are having so much trouble. Cheers, =09Bruce >=20 > Ralf >=20 >=20 > Am Freitag, 21. Februar 2003 19:41 schrieb Bruce Allen: > > Hi Ralf, > > > > OK, I think I see what's happening. I think that you may have an IBM di= sk > > that has defective SMART firmware. > > > > Go to this page: > > > > http://www.geocities.com/dtla_update/ > > > > and follow one of the "Related links" for the 60GXP disk. This points = to > > revised IBM firmware that should fix the problem. > > > > [You will find a link to this page on the smartmontools web page under = the > > FAQs]. > > > > Please let me know if this helps! > > > > Cheers, > > =09Bruce > > > > On Fri, 21 Feb 2003, Ralf Panse wrote: > > > Hi Bruce > > > > > > Here the output from my strange disk with the low Power_On value. It'= s > > > not a laptop disk and the pc is turned off every evening. > > > > > > > > > smartctl version 5.1-4 Copyright (C) 2002 Bruce Allen > > > Home page is http://smartmontools.sourceforge.net/ > > > > > > =3D=3D=3D START OF INFORMATION SECTION =3D=3D=3D > > > Device Model: IC35L040AVER07-0 > > > Serial Number: SXPTX393636 > > > Firmware Version: ER4OA46A > > > ATA Version is: 5 > > > ATA Standard is: ATA/ATAPI-5 T13 1321D revision 1 > > > Local Time is: Fri Feb 21 17:01:37 2003 CET > > > SMART support is: Available - device has SMART capability. > > > SMART support is: Enabled > > > > > > =3D=3D=3D START OF READ SMART DATA SECTION =3D=3D=3D > > > SMART overall-health self-assessment test result: PASSED > > > > > > General SMART Values: > > > Off-line data collection status: (0x00) Offline data collection activ= ity > > > was never started. > > > Self-test execution status: ( 0) The previous self-test routin= e > > > completed > > > without error or no self-test= has > > > ever been run. > > > Total time to complete off-line > > > data collection: (1383) seconds. > > > Offline data collection > > > capabilities: (0x1b) SMART execute Offline immedia= te. > > > Automatic timer ON/OFF suppor= t. > > > Suspend Offline collection up= on > > > new command. > > > Offline surface scan supporte= d. > > > Self-test supported. > > > SMART capabilities: (0x0003) Saves SMART data before enter= ing > > > power-saving mode. > > > Supports SMART auto save time= r. > > > Error logging capability: (0x01) Error logging supported. > > > Short self-test routine > > > recommended polling time: ( 1) minutes. > > > Extended self-test routine > > > recommended polling time: ( 23) minutes. > > > > > > SMART Attributes Data Structure revision number: 16 > > > Vendor Specific SMART Attributes with Thresholds: > > > ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE =20 > > > WHEN_FAILED RAW_VALUE > > > 1 Raw_Read_Error_Rate 0x000b 095 095 060 Pre-fail = - > > > 131084 > > > 2 Throughput_Performance 0x0005 100 100 050 Pre-fail = - =20 > > > 0 3 Spin_Up_Time 0x0007 104 104 024 Pre-fail = - > > > 17193697490 > > > 4 Start_Stop_Count 0x0012 100 100 000 Old_age = - > > > 197 > > > 5 Reallocated_Sector_Ct 0x0033 100 100 005 Pre-fail = - =20 > > > 0 7 Seek_Error_Rate 0x000b 100 100 067 Pre-fail = - > > > 0 8 Seek_Time_Performance 0x0005 100 100 020 Pre-fai= l =20 > > > - 0 9 Power_On_Hours 0x0012 100 100 000 Old_= age=20 > > > - 117 > > > 10 Spin_Retry_Count 0x0013 100 100 060 Pre-fail = - =20 > > > 0 12 Power_Cycle_Count 0x0032 100 100 000 Old_age = =20 > > > - 197 > > > 192 Power-Off_Retract_Count 0x0032 100 100 050 Old_age = - > > > 197 > > > 193 Load_Cycle_Count 0x0012 100 100 050 Old_age = - > > > 197 > > > 194 Temperature_Celsius 0x0002 141 141 000 Old_age = - > > > 39 (Lifetime Min/Max 19/47) > > > 196 Reallocated_Event_Count 0x0032 100 100 000 Old_age = - =20 > > > 0 197 Current_Pending_Sector 0x0022 100 100 000 Old_age = =20 > > > - 0 198 Offline_Uncorrectable 0x0008 100 100 000 Old= _age > > > - 0 199 UDMA_CRC_Error_Count 0x000a 200 200 000 = =20 > > > Old_age - 0 > > > > > > SMART Error Log Version: 1 > > > No Errors Logged > > > > > > SMART Self-test log, version number 1 > > > Num Test_Description Status Remaining=20 > > > LifeTime(hours) LBA_of_first_error > > > # 1 Short off-line Completed 00% 115 = =20 > > > - # 2 Short off-line Completed 00% = 104 > > > - # 3 Short off-line Completed 00% = =20 > > > 104 - # 4 Short off-line Completed = 00% > > > 104 - # 5 Short off-line Completed = =20 > > > 00% 103 - # 6 Short off-line Completed = =20 > > > 00% 103 - # 7 Short off-line Completed = =20 > > > 00% 103 - # 8 Short off-line Completed= =20 > > > 00% 102 - # 9 Extended off-line =20 > > > Completed 00% 101 - #10 Short off-= line > > > Completed 00% 99 - #11 Short > > > off-line Completed 00% 99 - #= 12=20 > > > Short off-line Completed 00% 99 = - > > > #13 Short off-line Completed 00% 99 = =20 > > > - #14 Short off-line Completed 00% = 99 > > > - #15 Short off-line Completed 00% = =20 > > > 99 - #16 Short off-line Completed = 00% > > > 99 - #17 Short off-line Completed = =20 > > > 00% 97 - #18 Short off-line Completed = =20 > > > 00% 97 - #19 Short off-line Completed = =20 > > > 00% 97 - #20 Short off-line Completed= =20 > > > 00% 97 - #21 Short off-line =20 > > > Completed 00% 97 - > > > > > > > > > > > > Thanks > > > Ralf > > > > > > Am Freitag, 21. Februar 2003 16:58 schrieben Sie: > > > > Hi Ralf, > > > > > > > > (My comments are inserted below) > > > > > > > > On Fri, 21 Feb 2003, Ralf Panse wrote: > > > > > Hi Bruce ! > > > > > > > > > > Here thr output from smartctl -a /dev/hdd: > > > > > > > > > > smartctl version 5.1-4 Copyright (C) 2002 Bruce Allen > > > > > Home page is http://smartmontools.sourceforge.net/ > > > > > > > > > > =3D=3D=3D START OF INFORMATION SECTION =3D=3D=3D > > > > > Device Model: IBM-DTLA-307045 > > > > > Serial Number: YMDYMHA0675 > > > > > Firmware Version: TX6OA50C > > > > > ATA Version is: 5 > > > > > ATA Standard is: ATA/ATAPI-5 T13 1321D revision 1 > > > > > Local Time is: Fri Feb 21 16:09:14 2003 CET > > > > > SMART support is: Available - device has SMART capability. > > > > > SMART support is: Enabled > > > > > > > > > > =3D=3D=3D START OF READ SMART DATA SECTION =3D=3D=3D > > > > > SMART overall-health self-assessment test result: PASSED > > > > > > > > > > General SMART Values: > > > > > Off-line data collection status: (0x02) Offline data collection > > > > > activity completed without error. Self-test execution status: = (=20 > > > > > 0) The previous self-test routine completed > > > > > without error or no self-= test > > > > > has ever been run. > > > > > Total time to complete off-line > > > > > data collection: (2294) seconds. > > > > > Offline data collection > > > > > capabilities: (0x1b) SMART execute Offline > > > > > immediate. Automatic timer ON/OFF support. Suspend Offline collec= tion > > > > > upon new command. > > > > > Offline surface scan > > > > > supported. Self-test supported. SMART capabilities: =20 > > > > > (0x0003) Saves SMART data before entering power-saving mode. > > > > > Supports SMART auto save > > > > > timer. Error logging capability: (0x01) Error logging > > > > > supported. Short self-test routine > > > > > recommended polling time: ( 2) minutes. > > > > > Extended self-test routine > > > > > recommended polling time: ( 28) minutes. > > > > > > > > > > SMART Attributes Data Structure revision number: 16 > > > > > Vendor Specific SMART Attributes with Thresholds: > > > > > ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE > > > > > WHEN_FAILED RAW_VALUE > > > > > 1 Raw_Read_Error_Rate 0x000b 100 100 060 Pre-fail = =20 > > > > > - 1 2 Throughput_Performance 0x0005 132 132 050 Pre-fai= l =20 > > > > > - 340 > > > > > 3 Spin_Up_Time 0x0007 094 094 024 Pre-fail = =20 > > > > > - 25789530422 > > > > > 4 Start_Stop_Count 0x0012 100 100 000 Old_age = =20 > > > > > - 52 > > > > > 5 Reallocated_Sector_Ct 0x0033 100 100 005 Pre-fail = =20 > > > > > - 0 7 Seek_Error_Rate 0x000b 100 100 067 Pre-fai= l =20 > > > > > - 0 8 Seek_Time_Performance 0x0005 130 130 020 Pre-fa= il - > > > > > 34 > > > > > 9 Power_On_Hours 0x0012 100 100 000 Old_age = =20 > > > > > - 119 > > > > > 10 Spin_Retry_Count 0x0013 100 100 060 Pre-fail = =20 > > > > > - 0 12 Power_Cycle_Count 0x0032 100 100 000 Old_ag= e - > > > > > 52 > > > > > 192 Power-Off_Retract_Count 0x0032 100 100 050 Old_age = =20 > > > > > - 52 > > > > > 193 Load_Cycle_Count 0x0012 100 100 050 Old_age = =20 > > > > > - 52 > > > > > 194 Temperature_Celsius 0x0002 171 171 000 Old_age = =20 > > > > > - 32 > > > > > 196 Reallocated_Event_Count 0x0032 100 100 000 Old_age = =20 > > > > > - 0 197 Current_Pending_Sector 0x0022 100 100 000 Old_a= ge - > > > > > 0 198 Offline_Uncorrectable 0x0008 100 100 000 =20 > > > > > Old_age - 0 199 UDMA_CRC_Error_Count 0x000a 200 200 = =20 > > > > > 000 Old_age - 0 > > > > > > > > > > SMART Error Log Version: 1 > > > > > ATA Error Count: 2 > > > > > DCR =3D Device Control Register > > > > > FR =3D Features Register > > > > > SC =3D Sector Count Register > > > > > SN =3D Sector Number Register > > > > > CL =3D Cylinder Low Register > > > > > CH =3D Cylinder High Register > > > > > D/H =3D Device/Head Register > > > > > CR =3D Content written to Command Register > > > > > ER =3D Error register > > > > > STA =3D Status register > > > > > Timestamp is seconds since the previous disk power-on. > > > > > Note: timestamp "wraps" after 2^32 msec =3D 49.710 days. > > > > > > > > > > Error 2 occurred at disk power-on lifetime: 2 hours > > > > > When the command that caused the error occurred, the device was > > > > > active or idle. > > > > > After command completion occurred, registers were: > > > > > ER:10 SC:00 SN:4c CL:18 CH:f2 D/H:b2 ST:51 > > > > > Sequence of commands leading to the command that caused the error > > > > > were: DCR FR SC SN CL CH D/H CR Timestamp > > > > > 02 40 00 00 00 f2 f2 82 7302.400 > > > > > 02 40 00 00 c0 f1 f2 82 7302.000 > > > > > 02 40 00 00 80 f1 f2 82 7301.700 > > > > > 02 40 00 00 40 f1 f2 82 7301.300 > > > > > 02 40 00 00 00 f1 f2 82 7300.900 > > > > > > > > > > Error 1 occurred at disk power-on lifetime: 1 hours > > > > > When the command that caused the error occurred, the device was > > > > > active or idle. > > > > > After command completion occurred, registers were: > > > > > ER:10 SC:00 SN:00 CL:00 CH:00 D/H:b0 ST:51 > > > > > Sequence of commands leading to the command that caused the error > > > > > were: DCR FR SC SN CL CH D/H CR Timestamp > > > > > 02 11 00 00 00 00 b0 f7 5132.900 > > > > > 02 00 00 00 00 00 b0 f3 5132.900 > > > > > 02 00 00 00 00 00 b0 ec 5132.900 > > > > > 02 00 40 00 c3 00 f0 40 5132.800 > > > > > 02 00 40 c0 c2 00 f0 40 5132.800 > > > > > > > > > > SMART Self-test log, version number 1 > > > > > Num Test_Description Status Remaining > > > > > LifeTime(hours) LBA_of_first_error > > > > > # 1 Short off-line Completed 00% = 117 > > > > > - # 2 Short off-line Completed 00% = =20 > > > > > 92 - # 3 Short off-line Completed 00% = 89 =20 > > > > > - # 4 Extended off-line Completed 00= % 69 > > > > > - # 5 Short off-line Completed > > > > > 00% 2 - # 6 Short off-line Completed > > > > > 00% 0 - > > > > > > > > > > > > > > > So, may hard disk must be ok !?! > > > > > > > > The disk looks fine. The two entries in the ATA error log were at = 1 > > > > hours 25 minutes after the disk was first powered up, and 2 hours a= nd 2 > > > > minutes after the disk was first powered up. [Perhaps you were pla= ying > > > > the hdparm??]. The disk is now 119 hours old and hasn't shown any > > > > further errors. The time to worry is if the ATA error log starts > > > > showing hundreds or thousands of errors in the very recent past. > > > > > > > > And the self-tests all completed OK. > > > > > > > > > There is another strange value in the SMART Attribute. With anoth= er > > > > > disk (IBM) smartctl return a smaller value for Start_Stop_Count = than > > > > > Power_On_Hours. > > > > > > > > In itself, that's OK. Start_Stop_Count is the number of times that= the > > > > disk has spun up. For a machine that is turned on and off once per > > > > month, and has been running for a year, this would be 12. But > > > > Power_On_Hours would be one year: 8760. > > > > > > > > Note that Start_Stop_Count can also change when the disk sleeps. > > > > > > > > Now these results: > > > > > 4 Start_Stop_Count 0x0012 100 100 000 Old_age = - > > > > > 197 9 Power_On_Hours 0x0012 100 100 000 Old_age= - > > > > > 116 12 Power_Cycle_Count 0x0032 100 100 000 Old_= age=20 > > > > > - 197 192 Power-Off_Retract_Count 0x0032 100 100 050 Old= _age > > > > > - 197 193 Load_Cycle_Count 0x0012 100 100 050 =20 > > > > > Old_age - 197 > > > > > > > > look very strange. Is this a laptop disk? Please post the output = of > > > > smartctl -a for this disk. > > > > > > > > > My disk runs more than one hour per power_on. Can you explain me = this > > > > > value ? > > > > > > > > Is the disk sleeping (either a laptop disk or desktop machine that > > > > suspends a lot)? > > > > > > > > > Thanks a lot! > > > > > > > > You're welcome! > > > > > > > > Bruce > > > > > > > > > Ralf > > > > > > > > > > Am Freitag, 21. Februar 2003 05:39 schrieb Bruce Allen: > > > > > > Hi Ralf, > > > > > > > > > > > > On Thu, 20 Feb 2003, Ralf Panse wrote: > > > > > > > Hi all! > > > > > > > what does this output mean (output from ./smartctl -c /dev/hd= d > > > > > > > ) -> > > > > > > > > > > > > > > =3D=3D=3D START OF READ SMART DATA SECTION =3D=3D=3D > > > > > > > SMART Error Log Version: 1 > > > > > > > ATA Error Count: 2 > > > > > > > ... > > > > > > > Error 2 occurred at disk power-on lifetime: 2 hours > > > > > > > When the command that caused the error occurred, the device w= as > > > > > > > active or idle. > > > > > > > After command completion occurred, registers were: > > > > > > > ER:10 SC:00 SN:4c CL:18 CH:f2 D/H:b2 ST:51 > > > > > > > Sequence of commands leading to the command that caused the e= rror > > > > > > > were: DCR FR SC SN CL CH D/H CR Timestamp > > > > > > > 02 40 00 00 00 f2 f2 82 7302.400 > > > > > > > 02 40 00 00 c0 f1 f2 82 7302.000 > > > > > > > 02 40 00 00 80 f1 f2 82 7301.700 > > > > > > > 02 40 00 00 40 f1 f2 82 7301.300 > > > > > > > 02 40 00 00 00 f1 f2 82 7300.900 > > > > > > > > > > > > > > Error 1 occurred at disk power-on lifetime: 1 hours > > > > > > > ... > > > > > > > > > > > > This is the ATA error log. The types of errors that it indicat= es > > > > > > are described in this document: > > > > > > http://www.t13.org/project/d1321r1c.pdf please see section > > > > > > 8.41.6.8.2.4 (Device Error Count) which starts on page 204. > > > > > > > > > > > > The output listed are the five commands leading up to the comma= nd > > > > > > that caused the error. The different columns refer to differen= t > > > > > > ATA registers. > > > > > > > > > > > > > What is wrong with my hard disk? > > > > > > > > > > > > Probably nothing. The last of there errors occured when the di= sk > > > > > > was just a few hours old (7300 seconds after it was first turne= d > > > > > > on). This may have been due to strange or incorrect hdparm (DM= A > > > > > > mode, etc) settings, a loose cable, or something else wrong wit= h > > > > > > the disk. > > > > > > > > > > > > Assuming that the disk is more than a few hours old, it's not b= een > > > > > > exhibiting the errors recently. If you want some more reassura= nce, > > > > > > post the output of smartctl -a, please. > > > > > > > > > > > > You might also want to run some extended self-tests and examine= the > > > > > > self-test log. You can do both of these things with smartctl. > > > > > > > > > > > > > The healtstatus-command of smartctl (./smartctl -H /dev/hdd) = says > > > > > > > ... > > > > > > > > > > > > > > =3D=3D=3D START OF READ SMART DATA SECTION =3D=3D=3D > > > > > > > SMART overall-health self-assessment test result: PASSED > > > > > > > > > > > > > > And all S.M.A.R.T Attributes are ok. > > > > > > > > > > > > > > > > > > > > > Thanks ! > > > > > > > > > > > > You're welcome! > > > > > > > > > > > > Bruce > > > > > > > > > > ------------------------------------------------------- > > > > > This SF.net email is sponsored by: SlickEdit Inc. Develop an edge= =2E > > > > > The most comprehensive and flexible code editor you can use. > > > > > Code faster. C/C++, C#, Java, HTML, XML, many more. FREE 30-Day > > > > > Trial. www.slickedit.com/sourceforge > > > > > _______________________________________________ > > > > > Smartmontools-support mailing list > > > > > Sma...@li... > > > > > https://lists.sourceforge.net/lists/listinfo/smartmontools-suppor= t > > > > > > -- > > > Ralf Panse > > > Kirchhoff-Institut f=FCr Physik > > > > > > Tel: 06221 54 9811 > > > > > > > > > ------------------------------------------------------- > > > This SF.net email is sponsored by: SlickEdit Inc. Develop an edge. > > > The most comprehensive and flexible code editor you can use. > > > Code faster. C/C++, C#, Java, HTML, XML, many more. FREE 30-Day Trial= =2E > > > www.slickedit.com/sourceforge > > > _______________________________________________ > > > Smartmontools-support mailing list > > > Sma...@li... > > > https://lists.sourceforge.net/lists/listinfo/smartmontools-support > > > > ------------------------------------------------------- > > This SF.net email is sponsored by: SlickEdit Inc. Develop an edge. > > The most comprehensive and flexible code editor you can use. > > Code faster. C/C++, C#, Java, HTML, XML, many more. FREE 30-Day Trial. > > www.slickedit.com/sourceforge > > _______________________________________________ > > Smartmontools-support mailing list > > Sma...@li... > > https://lists.sourceforge.net/lists/listinfo/smartmontools-support >=20 > --=20 > Ralf Panse > Kirchhoff-Institut f=FCr Physik > Technische Informatik >=20 > Tel: +49 6221 54 9811 Im Neuenheimer Feld 227 > D-69120 Heidelberg, Germany > e-mail: pa...@ki... >=20 >=20 > ------------------------------------------------------- > This SF.net email is sponsored by: ValueWeb:=20 > Dedicated Hosting for just $79/mo with 500 GB of bandwidth!=20 > No other company gives more support or power for your dedicated server > http://click.atdmt.com/AFF/go/sdnxxaff00300020aff/direct/01/ > _______________________________________________ > Smartmontools-support mailing list > Sma...@li... > https://lists.sourceforge.net/lists/listinfo/smartmontools-support >=20 |