From: Andriy G. <av...@ic...> - 2009-04-02 14:00:51
|
I have the same problem as described here: http://thread.gmane.org/gmane.linux.utilities.smartmontools/5995/focus=6000 and here: http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=503439 Does anybody have any guesses? I use FreeBSD, my system is ICH9-based (DG33TL), the disk is ST3500410AS, AHCI mode is configured in BIOS. Full smartctl -a output after some attempts to run various self-tests: smartctl version 5.38 [amd64-portbld-freebsd7.1] Copyright (C) 2002-8 Bruce Allen Home page is http://smartmontools.sourceforge.net/ === START OF INFORMATION SECTION === Device Model: ST3500410AS Serial Number: 5VM0NB43 Firmware Version: CC34 User Capacity: 500,107,862,016 bytes Device is: Not in smartctl database [for details use: -P showall] ATA Version is: 8 ATA Standard is: ATA-8-ACS revision 4 Local Time is: Thu Apr 2 16:32:02 2009 EEST SMART support is: Available - device has SMART capability. SMART support is: Enabled === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED General SMART Values: Offline data collection status: (0x82) Offline data collection activity was completed without error. Auto Offline Data Collection: Enabled. Self-test execution status: ( 249) Self-test routine in progress... 90% of test remaining. Total time to complete Offline data collection: ( 600) seconds. Offline data collection capabilities: (0x7b) SMART execute Offline immediate. Auto Offline data collection on/off support. Suspend Offline collection upon new command. Offline surface scan supported. Self-test supported. Conveyance Self-test supported. Selective Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. General Purpose Logging supported. Short self-test routine recommended polling time: ( 1) minutes. Extended self-test routine recommended polling time: ( 94) minutes. Conveyance self-test routine recommended polling time: ( 2) minutes. SCT capabilities: (0x103f) SCT Status supported. SCT Feature Control supported. SCT Data Table supported. SMART Attributes Data Structure revision number: 10 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x000f 108 099 006 Pre-fail Always - 18711630 3 Spin_Up_Time 0x0003 100 100 000 Pre-fail Always - 0 4 Start_Stop_Count 0x0032 100 100 020 Old_age Always - 2 5 Reallocated_Sector_Ct 0x0033 100 100 036 Pre-fail Always - 0 7 Seek_Error_Rate 0x000f 068 060 030 Pre-fail Always - 6591276 9 Power_On_Hours 0x0032 100 100 000 Old_age Always - 138 10 Spin_Retry_Count 0x0013 100 100 097 Pre-fail Always - 0 12 Power_Cycle_Count 0x0032 100 100 020 Old_age Always - 2 183 Unknown_Attribute 0x0000 100 100 000 Old_age Offline - 0 184 Unknown_Attribute 0x0032 100 100 099 Old_age Always - 0 187 Reported_Uncorrect 0x0032 100 100 000 Old_age Always - 0 188 Unknown_Attribute 0x0032 100 100 000 Old_age Always - 0 189 High_Fly_Writes 0x003a 100 100 000 Old_age Always - 0 190 Airflow_Temperature_Cel 0x0022 059 055 045 Old_age Always - 41 (Lifetime Min/Max 37/45) 194 Temperature_Celsius 0x0022 041 045 000 Old_age Always - 41 (0 29 0 0) 195 Hardware_ECC_Recovered 0x001a 048 032 000 Old_age Always - 18711630 197 Current_Pending_Sector 0x0012 100 100 000 Old_age Always - 0 198 Offline_Uncorrectable 0x0010 100 100 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x003e 200 200 000 Old_age Always - 0 240 Head_Flying_Hours 0x0000 100 253 000 Old_age Offline - 157522220548242 241 Unknown_Attribute 0x0000 100 253 000 Old_age Offline - 3431356807 242 Unknown_Attribute 0x0000 100 253 000 Old_age Offline - 2746629257 SMART Error Log Version: 1 No Errors Logged SMART Self-test log structure revision number 1 Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Conveyance offline Self-test routine in progress 90% 138 - # 2 Short offline Aborted by host 90% 132 - # 3 Short offline Aborted by host 90% 132 - # 4 Extended offline Aborted by host 80% 116 - # 5 Extended offline Aborted by host 90% 46 - # 6 Extended offline Aborted by host 90% 39 - # 7 Extended offline Aborted by host 60% 37 - SMART Selective self-test log data structure revision number 1 SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Not_testing 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay. -- Andriy Gapon |
From: Axel R. <gm...@me...> - 2009-04-25 20:45:15
|
Andriy Gapon <avg <at> icyb.net.ua> writes: > > > I have the same problem as described here: > http://thread.gmane.org/gmane.linux.utilities.smartmontools/5995/focus=6000 > and here: > http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=503439 > > Does anybody have any guesses? No, unfortunately I don't. For the sake of the people who understand this software more: the same happened to my on an older Seagate disk: ST3500630AS See output below Axel === START OF INFORMATION SECTION === Model Family: Seagate Barracuda 7200.10 family Device Model: ST3500630AS Serial Number: 9QG4KW2G Firmware Version: 3.AAE User Capacity: 500,107,862,016 bytes Device is: In smartctl database [for details use: -P show] ATA Version is: 7 ATA Standard is: Exact ATA specification draft version not indicated Local Time is: Sat Apr 25 22:36:03 2009 CEST SMART support is: Available - device has SMART capability. SMART support is: Enabled === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED General SMART Values: Offline data collection status: (0x82) Offline data collection activity was completed without error. Auto Offline Data Collection: Enabled. Self-test execution status: ( 249) Self-test routine in progress... 90% of test remaining. Total time to complete Offline data collection: ( 430) seconds. Offline data collection capabilities: (0x5b) SMART execute Offline immediate. Auto Offline data collection on/off support. Suspend Offline collection upon new command. Offline surface scan supported. Self-test supported. No Conveyance Self-test supported. Selective Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. General Purpose Logging supported. Short self-test routine recommended polling time: ( 1) minutes. Extended self-test routine recommended polling time: ( 163) minutes. SMART Attributes Data Structure revision number: 10 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x000f 100 253 006 Pre-fail Always - 0 3 Spin_Up_Time 0x0003 098 098 000 Pre-fail Always - 0 4 Start_Stop_Count 0x0032 100 100 020 Old_age Always - 1 5 Reallocated_Sector_Ct 0x0033 100 100 036 Pre-fail Always - 0 7 Seek_Error_Rate 0x000f 100 253 030 Pre-fail Always - 9576 9 Power_On_Hours 0x0032 100 100 000 Old_age Always - 0 10 Spin_Retry_Count 0x0013 100 100 097 Pre-fail Always - 0 12 Power_Cycle_Count 0x0032 100 100 020 Old_age Always - 3 187 Reported_Uncorrect 0x0032 100 100 000 Old_age Always - 0 189 High_Fly_Writes 0x003a 100 100 000 Old_age Always - 0 190 Airflow_Temperature_Cel 0x0022 065 065 045 Old_age Always - 35 (Lifetime Min/Max 20/35) 194 Temperature_Celsius 0x0022 035 040 000 Old_age Always - 35 (0 20 0 0) 195 Hardware_ECC_Recovered 0x001a 100 253 000 Old_age Always - 535248 197 Current_Pending_Sector 0x0012 100 100 000 Old_age Always - 0 198 Offline_Uncorrectable 0x0010 100 100 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x003e 200 200 000 Old_age Always - 0 200 Multi_Zone_Error_Rate 0x0000 100 253 000 Old_age Offline - 0 202 TA_Increase_Count 0x0032 100 253 000 Old_age Always - 0 SMART Error Log Version: 1 No Errors Logged SMART Self-test log structure revision number 1 Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Short offline Self-test routine in progress 90% 0 - SMART Selective self-test log data structure revision number 1 SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Not_testing 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay. |
From: steve l. <ste...@ya...> - 2009-05-31 22:27:59
|
I'm using the latest CVS version of smartctl (5.39 2009-05-24 16:30) to access drives on an Intel SRCS16 6-port SATA RAID controller (identical to LSI MegaRAID 150-6). I would like to understand why only partial SMART information is available. For example: server1:~/sm5 # ./smartctl -a -d megaraid,4 /dev/sda smartctl 5.39 2009-05-24 16:30 [x86_64-suse-linux] (local build) Copyright (C) 2002-9 by Bruce Allen, http://smartmontools.sourceforge.net Creating /dev/megadev0 = 17 Got MegaRAID inquiry.. ST3750330NS SN06 Device: ST3750330NS Version: SN06 Serial number: xxxxx Device type: disk Local Time is: Sat May 30 15:50:24 2009 PDT Device supports SMART and is Enabled Temperature Warning Disabled or Not Supported SMART Health Status: OK Error Counter logging not supported Device does not support Self Test logging The same drive connected directly to a SATA controller produces full SMART information. server1:~/sm5 # ./smartctl -a /dev/sdc smartctl 5.39 2009-05-24 16:30 [x86_64-suse-linux] (local build) Copyright (C) 2002-9 by Bruce Allen, http://smartmontools.sourceforge.net === START OF INFORMATION SECTION === Model Family: Seagate Barracuda ES.2 Device Model: ST3750330NS Serial Number: xxxxx Firmware Version: SN06 User Capacity: 750,156,374,016 bytes Device is: In smartctl database [for details use: -P show] ATA Version is: 8 ATA Standard is: ATA-8-ACS revision 4 Local Time is: Sat May 30 15:13:22 2009 PDT SMART support is: Available - device has SMART capability. SMART support is: Enabled === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED General SMART Values: Offline data collection status: (0x82) Offline data collection activity was completed without error. Auto Offline Data Collection: Enabled. Self-test execution status: ( 0) The previous self-test routine completed without error or no self-test has ever been run. Total time to complete Offline data collection: ( 642) seconds. Offline data collection capabilities: (0x7b) SMART execute Offline immediate. Auto Offline data collection on/off support. Suspend Offline collection upon new command. Offline surface scan supported. Self-test supported. Conveyance Self-test supported. Selective Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. General Purpose Logging supported. Short self-test routine recommended polling time: ( 1) minutes. Extended self-test routine recommended polling time: ( 177) minutes. Conveyance self-test routine recommended polling time: ( 2) minutes. SCT capabilities: (0x103d) SCT Status supported. SCT Feature Control supported. SCT Data Table supported. SMART Attributes Data Structure revision number: 10 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x000f 100 100 044 Pre-fail Always - 8817 3 Spin_Up_Time 0x0003 100 100 000 Pre-fail Always - 0 4 Start_Stop_Count 0x0032 100 100 020 Old_age Always - 5 5 Reallocated_Sector_Ct 0x0033 100 100 036 Pre-fail Always - 0 7 Seek_Error_Rate 0x000f 100 253 030 Pre-fail Always - 14414 9 Power_On_Hours 0x0032 100 100 000 Old_age Always - 4 10 Spin_Retry_Count 0x0013 100 100 097 Pre-fail Always - 0 12 Power_Cycle_Count 0x0032 100 100 020 Old_age Always - 5 184 Unknown_Attribute 0x0032 100 100 099 Old_age Always - 0 187 Reported_Uncorrect 0x0032 100 100 000 Old_age Always - 0 188 Unknown_Attribute 0x0032 100 253 000 Old_age Always - 0 189 High_Fly_Writes 0x003a 100 100 000 Old_age Always - 0 190 Airflow_Temperature_Cel 0x0022 065 057 045 Old_age Always - 35 (Lifetime Min/Max 31/36) 194 Temperature_Celsius 0x0022 035 043 000 Old_age Always - 35 (0 24 0 0) 195 Hardware_ECC_Recovered 0x001a 100 100 000 Old_age Always - 8817 197 Current_Pending_Sector 0x0012 100 100 000 Old_age Always - 0 198 Offline_Uncorrectable 0x0010 100 100 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x003e 200 253 000 Old_age Always - 0 SMART Error Log Version: 1 No Errors Logged SMART Self-test log structure revision number 1 Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Extended offline Completed without error 00% 2 - # 2 Short offline Completed without error 00% 0 - SMART Selective self-test log data structure revision number 1 SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Not_testing 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay. The motherboard is Intel SE7520BD2 with dual Xeon processors. The OS is 64 bit Suse Linux Enterprise Server (SLES) 10 SP3 with kernel 2.6.16 server1:~ # uname -a Linux server1 2.6.16.60-0.37_f594963d-smp #1 SMP Mon Mar 23 13:39:48 UTC 2009 x86_64 x86_64 x86_64 GNU/Linux I'm using the MegaRAID driver included with SLES10SP3, specifically server1:~ # grep ega /var/log/boot.msg <6>megaraid cmm: 2.20.2.8 (Release Date: Tue Dec 18 12:12:25 PST 2007) <6>megaraid: 2.20.5.2 (Release Date: Tue Dec 18 12:12:25 PST 2007) <6>megaraid: probe new device 0x1000:0x1960:0x8086:0x0523: bus 2:slot 3:func 0 <5>megaraid: fw version:[713S] bios version:[G401] <6>scsi0 : LSI Logic MegaRAID driver <5> Vendor: MegaRAID Model: LD 0 RAID1 76G Rev: 713S <5> Vendor: MegaRAID Model: LD 1 RAID1 381G Rev: 713S Before I dig into this it would be good to know if it's a known issue with a workaround. Steve |