|
From: <PRI...@te...> - 2010-02-08 17:55:38
|
Hello all!! I would like to submit a question, to see if someone can help me. I have the following system: [root@io2 /root]# smartctl -a /dev/hdc smartctl version 5.1-18 Copyright (C) 2002-3 Bruce Allen Home page is http://smartmontools.sourceforge.net/ === START OF INFORMATION SECTION === Device Model: FUJITSU MHT2040AS Serial Number: NP1FT42254W3 Firmware Version: 006C Device is: Not in smartctl database [for details use: -P showall] ATA Version is: 6 ATA Standard is: ATA/ATAPI-6 T13 1410D revision 3a Local Time is: Mon Feb 8 17:52:52 2010 WET SMART support is: Available - device has SMART capability. SMART support is: Enabled === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED General SMART Values: Off-line data collection status: (0x85) Offline data collection activity was aborted by an interrupting command from host. Auto Off-line Data Collection: Enabled. Self-test execution status: ( 245) Self-test routine in progress... 50% of test remaining. Total time to complete off-line data collection: ( 293) seconds. Offline data collection capabilities: (0x7b) SMART execute Offline immediate. Automatic timer ON/OFF support. Suspend Offline collection upon new command. Offline surface scan supported. Self-test supported. Conveyance Self-test supported. Selective Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. No General Purpose Logging support. Short self-test routine recommended polling time: ( 2) minutes. Extended self-test routine recommended polling time: ( 40) minutes. Conveyance self-test routine recommended polling time: ( 2) minutes. SMART Attributes Data Structure revision number: 16 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x000f 100 100 046 Pre-fail Always - 252555 2 Throughput_Performance 0x0005 100 100 030 Pre-fail Offline - 14483456 3 Spin_Up_Time 0x0003 100 100 025 Pre-fail Always - 1 4 Start_Stop_Count 0x0032 100 100 000 Old_age Always - 6 5 Reallocated_Sector_Ct 0x0033 100 100 024 Pre-fail Always - 8589934592000 7 Seek_Error_Rate 0x000f 100 100 047 Pre-fail Always - 1744 8 Seek_Time_Performance 0x0005 100 100 019 Pre-fail Offline - 0 9 Power_On_Hours 0x0032 006 006 000 Old_age Always - 170094793 10 Spin_Retry_Count 0x0013 100 100 020 Pre-fail Always - 0 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 5 192 Power-Off_Retract_Count 0x0032 100 100 000 Old_age Always - 1 193 Load_Cycle_Count 0x0032 100 100 000 Old_age Always - 56 194 Temperature_Celsius 0x0022 100 100 000 Old_age Always - 39 (Lifetime Min/Max 21/42) 195 Hardware_ECC_Recovered 0x001a 100 100 000 Old_age Always - 10501 196 Reallocated_Event_Count 0x0032 100 100 000 Old_age Always - 287440896 197 Current_Pending_Sector 0x0012 100 100 000 Old_age Always - 1 198 Offline_Uncorrectable 0x0010 100 100 000 Old_age Offline - 1 199 UDMA_CRC_Error_Count 0x003e 200 200 000 Old_age Always - 0 200 Multi_Zone_Error_Rate 0x000f 100 100 060 Pre-fail Always - 7368 203 Run_Out_Cancel 0x0002 100 100 000 Old_age Always - 2628586044629 SMART Error Log Version: 1 No Errors Logged SMART Self-test log structure revision number 1 Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Conveyance off-line Completed without error 00% 47248 - # 2 Extended captive Interrupted (host reset) 90% 47248 - # 3 Short off-line Completed without error 00% 47248 - # 4 Short captive Interrupted (host reset) 90% 47248 - # 5 Short captive Interrupted (host reset) 90% 47247 - # 6 Short off-line Completed without error 00% 47247 - # 7 Extended off-line Completed: read failure 20% 47007 0x042798c1 # 8 Short off-line Completed without error 00% 47007 - As you can see in the selftests results, when I run a long selftest, the output is that I have an error. But there is no information about what's wrong with the system, or how can I fix it. Does anyone know how to identify the error? Any help is welcomed. BR, Alberto Martínez |
|
From: Marti R. <ma...@ju...> - 2010-02-08 18:20:03
|
On Mon, Feb 8, 2010 at 7:55 PM, PRI...@te... <PRI...@te...> wrote: > Self-test execution status: ( 245) Self-test routine in progress... > 50% of test remaining. Have some patience! The test is still running. > # 1 Conveyance off-line Completed without error 00% 47248 - > # 2 Extended captive Interrupted (host reset) 90% 47248 - > # 3 Short off-line Completed without error 00% 47248 - > # 4 Short captive Interrupted (host reset) 90% 47248 - > # 5 Short captive Interrupted (host reset) 90% 47247 - No error above, you probably issued a SMART command that aborted the previous test. Or rebooted the computer. Or your controller is quirky. > # 7 Extended off-line Completed: read failure 20% 47007 0x042798c1 This test found that sector 0x042798c1 is bad. > 197 Current_Pending_Sector 0x0012 100 100 000 Old_age Always - 1 > 198 Offline_Uncorrectable 0x0010 100 100 000 Old_age Offline - 1 As confirmed by these attributes... 1 bad sector found in the raw column. > 5 Reallocated_Sector_Ct 0x0033 100 100 024 Pre-fail Always - 8589934592000 8589934592000 == 0x7d000000000 smartmontools doesn't know how to decode this raw value, but "100 100" means that it should be perfectly fine. > As you can see in the selftests results, when I run a long selftest, the > output is that I have an error. But there is no information about what's > wrong with the system, or how can I fix it. The next time you write to this bad sector, it will probably be reallocated by the disk firmware. No big deal, these do happen occasionally. Only when you have suddenly increasing values, there's reason to worry. Please, do some research yourself before shooting for the mailing lists. Or buy commercial support for your Linux distribution vendor. Regards, Marti |
|
From: Gabriele P. <gp...@di...> - 2010-02-08 21:12:42
|
Hi Alberto, On Mon, 2010-02-08 at 18:55 +0100, PRI...@te... wrote: > [root@io2 /root]# smartctl -a /dev/hdc > smartctl version 5.1-18 Copyright (C) 2002-3 Bruce Allen ^^^ this is a very very very old version of smartmontools. Current release is 5.39-1 You should install the newest version, that your distribution has. > Num Test_Description Status Remaining > LifeTime(hours) LBA_of_first_error > # 7 Extended off-line Completed: read failure 20% 47007 > 0x042798c1 > > As you can see in the selftests results, when I run a long selftest, > the output is that I have an error. But there is no information about > what's wrong with the system, or how can I fix it. Have a look in our FAQs. Under section "Selftests" you will find the question: "ATA drive is failing self-tests, but SMART health status is 'PASSED'. What's going on?" http://sourceforge.net/apps/trac/smartmontools/wiki/FAQ#ATAdriveisfailingself-testsbutSMARThealthstatusisPASSED.Whatsgoingon HTH, Gabriele |