From: Tom <thr...@ya...> - 2009-02-13 13:19:14
|
No replies to either of my prior two questions, but I will try one more time. I have a drive that always passes extended tests, but fails short tests (at a different sector every day, but always within the first ten percent of the test. Can anyone explain the reason for this? Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Short offline Completed: read failure 90% 14747 216552987 # 2 Short offline Completed: read failure 90% 14725 243684922 # 3 Extended offline Completed without error 00% 14703 - # 4 Short offline Completed: read failure 90% 14701 249983032 # 5 Short offline Completed: read failure 90% 14678 110100736 # 6 Short offline Completed: read failure 90% 14656 66663550 # 7 Short offline Completed: read failure 90% 14631 7775713 # 8 Extended offline Completed without error 00% 14619 - # 9 Short offline Completed: read failure 90% 14607 116404119 #10 Extended offline Completed without error 00% 14585 - |
From: Justin P. <jp...@lu...> - 2009-02-13 13:54:24
|
On Fri, 13 Feb 2009, Tom wrote: > No replies to either of my prior two questions, but I will > try one more time. > > I have a drive that always passes extended tests, but fails > short tests (at a different sector every day, but always > within the first ten percent of the test. Can anyone explain > the reason for this? > > Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error > # 1 Short offline Completed: read failure 90% 14747 216552987 > # 2 Short offline Completed: read failure 90% 14725 243684922 > # 3 Extended offline Completed without error 00% 14703 - > # 4 Short offline Completed: read failure 90% 14701 249983032 > # 5 Short offline Completed: read failure 90% 14678 110100736 > # 6 Short offline Completed: read failure 90% 14656 66663550 > # 7 Short offline Completed: read failure 90% 14631 7775713 > # 8 Extended offline Completed without error 00% 14619 - > # 9 Short offline Completed: read failure 90% 14607 116404119 > #10 Extended offline Completed without error 00% 14585 - > Please show the full smartctl -a output. |
From: Tom <thr...@ya...> - 2009-02-13 23:29:23
|
-- On Fri, 2/13/09, Justin Piszcz <jp...@lu...> wrote: > Please show the full smartctl -a output. OK. smartctl version 5.38 [i686-pc-linux-gnu] Copyright (C) 2002-8 Bruce Allen Home page is http://smartmontools.sourceforge.net/ === START OF INFORMATION SECTION === Model Family: Seagate Barracuda 7200.7 and 7200.7 Plus family Device Model: ST3160023AS Serial Number: xxxxxxxx Firmware Version: 3.18 User Capacity: 160,041,885,696 bytes Device is: In smartctl database [for details use: -P show] ATA Version is: 6 ATA Standard is: ATA/ATAPI-6 T13 1410D revision 2 Local Time is: Fri Feb 13 23:12:26 2009 UTC SMART support is: Available - device has SMART capability. SMART support is: Enabled === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED General SMART Values: Offline data collection status: (0x82) Offline data collection activity was completed without error. Auto Offline Data Collection: Enabled. Self-test execution status: ( 121) The previous self-test completed having the read element of the test failed. Total time to complete Offline data collection: ( 430) seconds. Offline data collection capabilities: (0x5b) SMART execute Offline immediate. Auto Offline data collection on/off support. Suspend Offline collection upon new command. Offline surface scan supported. Self-test supported. No Conveyance Self-test supported. Selective Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. No General Purpose Logging support. Short self-test routine recommended polling time: ( 1) minutes. Extended self-test routine recommended polling time: ( 111) minutes. SMART Attributes Data Structure revision number: 10 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x000f 062 052 006 Pre-fail Always - 51646049 3 Spin_Up_Time 0x0003 097 096 000 Pre-fail Always - 0 4 Start_Stop_Count 0x0032 100 100 020 Old_age Always - 19 5 Reallocated_Sector_Ct 0x0033 100 100 036 Pre-fail Always - 24 7 Seek_Error_Rate 0x000f 089 069 030 Pre-fail Always - 818691257 9 Power_On_Hours 0x0032 084 084 000 Old_age Always - 14779 10 Spin_Retry_Count 0x0013 100 100 097 Pre-fail Always - 0 12 Power_Cycle_Count 0x0032 100 100 020 Old_age Always - 233 194 Temperature_Celsius 0x0022 050 069 000 Old_age Always - 50 195 Hardware_ECC_Recovered 0x001a 062 052 000 Old_age Always - 51646049 197 Current_Pending_Sector 0x0012 100 100 000 Old_age Always - 1 198 Offline_Uncorrectable 0x0010 100 100 000 Old_age Offline - 1 199 UDMA_CRC_Error_Count 0x003e 200 196 000 Old_age Always - 11 200 Multi_Zone_Error_Rate 0x0000 100 253 000 Old_age Offline - 0 202 TA_Increase_Count 0x0032 091 244 000 Old_age Always - 9 Error SMART Error Log Read failed Smartctl: SMART Error Log Read Failed SMART Self-test log structure revision number 1 Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Short offline Completed: read failure 90% 14770 254329073 # 2 Short offline Completed: read failure 90% 14747 216552987 # 3 Short offline Completed: read failure 90% 14725 243684922 # 4 Extended offline Completed without error 00% 14703 - # 5 Short offline Completed: read failure 90% 14701 249983032 # 6 Short offline Completed: read failure 90% 14678 110100736 # 7 Short offline Completed: read failure 90% 14656 66663550 # 8 Short offline Completed: read failure 90% 14631 7775713 # 9 Extended offline Completed without error 00% 14619 - #10 Short offline Completed: read failure 90% 14607 116404119 #11 Extended offline Completed without error 00% 14585 - #12 Short offline Completed without error 00% 14503 - #13 Short offline Completed without error 00% 14480 - #14 Short offline Completed without error 00% 14456 - #15 Short offline Completed without error 00% 14433 - #16 Short offline Completed without error 00% 14409 - #17 Extended offline Completed without error 00% 14388 - #18 Short offline Completed without error 00% 14386 - #19 Short offline Completed without error 00% 14362 - #20 Short offline Completed without error 00% 14338 - #21 Short offline Completed without error 00% 14315 - SMART Selective self-test log data structure revision number 1 SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Not_testing 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay. |
From: Justin P. <jp...@lu...> - 2009-02-13 23:33:37
|
On Fri, 13 Feb 2009, Tom wrote: > -- On Fri, 2/13/09, Justin Piszcz <jp...@lu...> wrote: > >> Please show the full smartctl -a output. > > SMART Attributes Data Structure revision number: 10 > Vendor Specific SMART Attributes with Thresholds: > ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE > 1 Raw_Read_Error_Rate 0x000f 062 052 006 Pre-fail Always - 51646049 > 3 Spin_Up_Time 0x0003 097 096 000 Pre-fail Always - 0 > 4 Start_Stop_Count 0x0032 100 100 020 Old_age Always - 19 > 5 Reallocated_Sector_Ct 0x0033 100 100 036 Pre-fail Always - 24 > 7 Seek_Error_Rate 0x000f 089 069 030 Pre-fail Always - 818691257 > 9 Power_On_Hours 0x0032 084 084 000 Old_age Always - 14779 > 10 Spin_Retry_Count 0x0013 100 100 097 Pre-fail Always - 0 > 12 Power_Cycle_Count 0x0032 100 100 020 Old_age Always - 233 > 194 Temperature_Celsius 0x0022 050 069 000 Old_age Always - 50 > 195 Hardware_ECC_Recovered 0x001a 062 052 000 Old_age Always - 51646049 > 197 Current_Pending_Sector 0x0012 100 100 000 Old_age Always - 1 > 198 Offline_Uncorrectable 0x0010 100 100 000 Old_age Offline - 1 > 199 UDMA_CRC_Error_Count 0x003e 200 196 000 Old_age Always - 11 > 200 Multi_Zone_Error_Rate 0x0000 100 253 000 Old_age Offline - 0 > 202 TA_Increase_Count 0x0032 091 244 000 Old_age Always - 9 > > Error SMART Error Log Read failed > Smartctl: SMART Error Log Read Failed > SMART Self-test log structure revision number 1 > Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error > # 1 Short offline Completed: read failure 90% 14770 254329073 > # 2 Short offline Completed: read failure 90% 14747 216552987 > # 3 Short offline Completed: read failure 90% 14725 243684922 > # 4 Extended offline Completed without error 00% 14703 - > # 5 Short offline Completed: read failure 90% 14701 249983032 > # 6 Short offline Completed: read failure 90% 14678 110100736 > # 7 Short offline Completed: read failure 90% 14656 66663550 > # 8 Short offline Completed: read failure 90% 14631 7775713 > # 9 Extended offline Completed without error 00% 14619 - > #10 Short offline Completed: read failure 90% 14607 116404119 > #11 Extended offline Completed without error 00% 14585 - > #12 Short offline Completed without error 00% 14503 - > #13 Short offline Completed without error 00% 14480 - > #14 Short offline Completed without error 00% 14456 - > #15 Short offline Completed without error 00% 14433 - > #16 Short offline Completed without error 00% 14409 - > #17 Extended offline Completed without error 00% 14388 - > #18 Short offline Completed without error 00% 14386 - > #19 Short offline Completed without error 00% 14362 - > #20 Short offline Completed without error 00% 14338 - > #21 Short offline Completed without error 00% 14315 - > > SMART Selective self-test log data structure revision number 1 > SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS > 1 0 0 Not_testing > 2 0 0 Not_testing > 3 0 0 Not_testing > 4 0 0 Not_testing > 5 0 0 Not_testing > Selective self-test flags (0x0): > After scanning selected spans, do NOT read-scan remainder of disk. > If Selective self-test is pending on power-up, resume after 0 minute delay. > The important values: > 5 Reallocated_Sector_Ct 0x0033 100 100 036 Pre-fail Always - 24 > 197 Current_Pending_Sector 0x0012 100 100 000 Old_age Always - 1 > 198 Offline_Uncorrectable 0x0010 100 100 000 Old_age Offline - 1 Looks like the disk will eventually die at some point, when I had a disk like this in a RAID, I removed it from the RAID and re-ran DD, helped for awhile, got rid of the pending sector but more came back later, RMA if you can, if you care about your data, get a replacement disk.. Justin. |