From: Mertens B. <bra...@li...> - 2006-06-20 21:19:42
|
Hi I posted a question about my drives earlier but perhaps I didn't include the right info so I got no response. Please let me know if I need to supply any more details. Smartd has reported the following message twice now: The following warning/error was logged by the smartd daemon: Device: /dev/sdb, Failed SMART usage Attribute: 190 Unknown_Attribute. And logwatch is reporting the following at the moment: ################### Logwatch 7.3 (03/24/06) #################### Processing Initiated: Tue Jun 20 06:25:06 2006 Date Range Processed: yesterday ( 2006-Jun-19 ) Period is day. Detail Level of Output: 5 Type of Output: unformatted Logfiles for Host: valinor /dev/sdb : Prefailure: Raw_Read_Error_Rate (1) changed to 110, 111, 112, 110, 107, 106, 107, 105, 104, 107, 106, 107, 106, 107, 105, 108, 110, 111, 106, 107, 105, 106, 105, 108, 107, 106, 108, 109, 108, 109, 103, 104, 105, 104, 105, 106, Usage: Hardware_ECC_Recovered (195) changed to 60, 61, 60, 61, 62, 63, 64, 63, 61, 62, 60, 61, 62, 64, 65, 64, 63, 62, 61, 60, 61, 62, 63, Usage: Temperature_Celsius (194) changed to 55, 54, 55, 54, 55, 54, 55, 54, 55, 54, 55, 54, 55, 54, 55, 54, 55, 54, Usage: Unknown_Attribute (190) changed to 45, 46, 45, 46, 45, 46, 45, 46, 45, 46, 45, 46, 45, 46, 45, 46, 45, 46, /dev/sdb : Failed usage attribute: Unknown_Attribute (190) 29 Time(s) So I ran the long selftest using: smartctl -t long /dev/sdb -d ata After the test completed I get the following report: smartctl version 5.36 [i686-pc-linux-gnu] Copyright (C) 2002-6 Bruce Allen Home page is http://smartmontools.sourceforge.net/ === START OF INFORMATION SECTION === Device Model: ST3200827AS Serial Number: 5ND2WL7K Firmware Version: 3.AAE User Capacity: 200,049,647,616 bytes Device is: Not in smartctl database [for details use: -P showall] ATA Version is: 7 ATA Standard is: Exact ATA specification draft version not indicated Local Time is: Tue Jun 20 23:13:36 2006 CEST SMART support is: Available - device has SMART capability. SMART support is: Enabled === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED See vendor-specific Attribute list for marginal Attributes. General SMART Values: Offline data collection status: (0x82) Offline data collection activity was completed without error. Auto Offline Data Collection: Enabled. Self-test execution status: ( 249) Self-test routine in progress... 90% of test remaining. Total time to complete Offline data collection: ( 430) seconds. Offline data collection capabilities: (0x5b) SMART execute Offline immediate. Auto Offline data collection on/off support. Suspend Offline collection upon new command. Offline surface scan supported. Self-test supported. No Conveyance Self-test supported. Selective Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. General Purpose Logging supported. Short self-test routine recommended polling time: ( 1) minutes. Extended self-test routine recommended polling time: ( 70) minutes. SMART Attributes Data Structure revision number: 10 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x000f 105 091 006 Pre-fail Always - 64051840 3 Spin_Up_Time 0x0003 100 100 000 Pre-fail Always - 0 4 Start_Stop_Count 0x0032 100 100 020 Old_age Always - 1 5 Reallocated_Sector_Ct 0x0033 100 100 036 Pre-fail Always - 0 7 Seek_Error_Rate 0x000f 078 060 030 Pre-fail Always - 66493810 9 Power_On_Hours 0x0032 099 099 000 Old_age Always - 1158 10 Spin_Retry_Count 0x0013 100 100 097 Pre-fail Always - 0 12 Power_Cycle_Count 0x0032 100 100 020 Old_age Always - 3 187 Unknown_Attribute 0x0032 100 100 000 Old_age Always - 0 189 Unknown_Attribute 0x003a 100 100 000 Old_age Always - 0 190 Unknown_Attribute 0x0022 045 041 045 Old_age Always FAILING_NOW 35967047499831 194 Temperature_Celsius 0x0022 055 059 000 Old_age Always - 55 (Lifetime Min/Max 0/23) 195 Hardware_ECC_Recovered 0x001a 059 057 000 Old_age Always - 27838158 197 Current_Pending_Sector 0x0012 100 100 000 Old_age Always - 0 198 Offline_Uncorrectable 0x0010 100 100 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x003e 200 200 000 Old_age Always - 0 200 Multi_Zone_Error_Rate 0x0000 100 253 000 Old_age Offline - 0 202 TA_Increase_Count 0x0032 100 253 000 Old_age Always - 0 SMART Error Log Version: 1 No Errors Logged SMART Self-test log structure revision number 1 Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Extended offline Completed without error 00% 1156 - # 2 Extended offline Completed without error 00% 257 - # 3 Extended offline Completed without error 00% 52 - # 4 Short offline Completed without error 00% 50 - # 5 Extended offline Completed without error 00% 3 - # 6 Short offline Completed without error 00% 2 - # 7 Extended offline Aborted by host 90% 2 - # 8 Extended offline Self-test routine in progress 90% 1158 - SMART Selective self-test log data structure revision number 1 SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Not_testing 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay. Although the overall result is PASSED the unknown attribute 190 is reported "FAILING NOW". This drive is a replacement for a drive that failed so I would like to know soon if this is ok or not so I can return it if necessary. Are there other tests I can/should run? This disk is usually not mounted as I inted to use it as a backup medium for my primary disk. Thanks in advance Bram -- # Mertens Bram "M8ram" <bra...@li...> Linux User #349737 # # debian testing kernel 2.6.15-1-686 i686 1024MB RAM # # 23:08:34 up 48 days, 23:34, 6 users, load average: 0.04, 0.04, 0.00 # |
From: Manfred S. <man...@gm...> - 2006-06-20 21:50:52
|
-------- Original-Nachricht -------- Datum: Tue, 20 Jun 2006 23:18:54 +0200 Von: Mertens Bram <bra...@li...> An: sma...@li... Betreff: [smartmontools-support] What other test can I run to determine if this disk is failing? > Hi > > I posted a question about my drives earlier but perhaps I didn't > include the right info so I got no response. Please let me know if I > need to supply any more details. > > Smartd has reported the following message twice now: > The following warning/error was logged by the smartd daemon: > > Device: /dev/sdb, Failed SMART usage Attribute: 190 Unknown_Attribute. > > 190 Unknown_Attribute 0x0022 045 041 045 Old_age Always > FAILING_NOW 35967047499831 > 194 Temperature_Celsius 0x0022 055 059 000 Old_age Always > - 55 (Lifetime Min/Max 0/23) Just guessing, this could be a temperature information, altough the raw value looks a bit strange. Some Harddisks have on Attribute 190 some sort of additional temperature indication (Western Digital, Samsung), and sometimes they have a failure threshold set which corresponds to the maximum design operating temperature of the disk, which is mostly at 55 degrees. If your hard disk gets cooler (Attribute 194 less than 55 degrees), does attribute 190 change from "FAILING_NOW" to "IN_THE_PAST" ? If it's indeed a temperature indication, then this means that your disk temperature is simply over the maximum design operating temperature. Do put some ventilator in your box. regards, Manfred -- Der GMX SmartSurfer hilft bis zu 70% Ihrer Onlinekosten zu sparen! Ideal für Modem und ISDN: http://www.gmx.net/de/go/smartsurfer |
From: Mertens B. <bra...@li...> - 2006-06-21 18:44:26
|
On 2006-06-20, Manfred Schwarb wrote: > > -------- Original-Nachricht -------- > > Smartd has reported the following message twice now: > > The following warning/error was logged by the smartd daemon: > > > > Device: /dev/sdb, Failed SMART usage Attribute: 190 Unknown_Attribute. > > > > > 190 Unknown_Attribute 0x0022 045 041 045 Old_age Always > > FAILING_NOW 35967047499831 > > 194 Temperature_Celsius 0x0022 055 059 000 Old_age Always > > - 55 (Lifetime Min/Max 0/23) > > > Just guessing, this could be a temperature information, altough the > raw value looks a bit strange. Some Harddisks have on Attribute 190 some > sort of additional temperature indication (Western Digital, Samsung), > and sometimes they have a failure threshold set which corresponds to > the maximum design operating temperature of the disk, which is mostly > at 55 degrees. > If your hard disk gets cooler (Attribute 194 less than 55 degrees), > does attribute 190 change from "FAILING_NOW" to "IN_THE_PAST" ? > > If it's indeed a temperature indication, then this means that your > disk temperature is simply over the maximum design operating > temperature. > Do put some ventilator in your box. I powered down the machine yesterday evening and have hooked up the extra ventilator in the case. The machine is making a hell of a noise right now but the attribute 190 has indeed changed to "In_the_past" as the temperature (attribute 194) dropped to 51: 190 Unknown_Attribute 0x0022 049 041 045 Old_age Always In_the_past 35996995158067 194 Temperature_Celsius 0x0022 051 059 000 Old_age Always - 51 (Lifetime Min/Max 0/23) Does this mean I can safely ignore these warnings? Or is it more likely damage has already been caused? Should I worry about the Raw_Read_Error_Rate and Seek_Error_Rate reported by logwatch? 1 Raw_Read_Error_Rate 0x000f 114 091 006 Pre-fail Always - 73127485 7 Seek_Error_Rate 0x000f 078 060 030 Pre-fail Always - 67253982 My other disk (located immediately on top of this one) has a temperature reading of 57 at the moment which is probably still too hot but already much better tahn the +60 from the previous days. Someone suggested that I move the disks away from each other to improve the cooling. The only thing I can do due to the structure of the case is to move my floppy drive between both disks. Would this help to cool the disks down? Thanks for your feedback Bram -- # Mertens Bram "M8ram" <bra...@li...> Linux User #349737 # # debian testing kernel 2.6.15-1-686 i686 1024MB RAM # # 20:32:07 up 2:09, 3 users, load average: 0.00, 0.01, 0.00 # |
From: Manfred S. <man...@gm...> - 2006-06-23 10:44:32
|
-------- Original-Nachricht -------- Datum: Wed, 21 Jun 2006 20:44:34 +0200 Von: Mertens Bram <bra...@li...> An: sma...@li... Betreff: Re: [smartmontools-support] What other test can I run to determine if this disk is failing? > On 2006-06-20, Manfred Schwarb wrote: > > > > -------- Original-Nachricht -------- > > > Smartd has reported the following message twice now: > > > The following warning/error was logged by the smartd daemon: > > > > > > Device: /dev/sdb, Failed SMART usage Attribute: 190 Unknown_Attribute. > > > > > > > > 190 Unknown_Attribute 0x0022 045 041 045 Old_age > Always > > > FAILING_NOW 35967047499831 > > > 194 Temperature_Celsius 0x0022 055 059 000 Old_age > Always > > > - 55 (Lifetime Min/Max 0/23) > > > > > > Just guessing, this could be a temperature information, altough the > > raw value looks a bit strange. Some Harddisks have on Attribute 190 some > > sort of additional temperature indication (Western Digital, Samsung), > > and sometimes they have a failure threshold set which corresponds to > > the maximum design operating temperature of the disk, which is mostly > > at 55 degrees. > > If your hard disk gets cooler (Attribute 194 less than 55 degrees), > > does attribute 190 change from "FAILING_NOW" to "IN_THE_PAST" ? > > > > If it's indeed a temperature indication, then this means that your > > disk temperature is simply over the maximum design operating > > temperature. > > Do put some ventilator in your box. > > I powered down the machine yesterday evening and have hooked up the > extra ventilator in the case. The machine is making a hell of a noise > right now but the attribute 190 has indeed changed to "In_the_past" as > the temperature (attribute 194) dropped to 51: > 190 Unknown_Attribute 0x0022 049 041 045 Old_age Always > In_the_past 35996995158067 > 194 Temperature_Celsius 0x0022 051 059 000 Old_age Always > - 51 (Lifetime Min/Max 0/23) > > Does this mean I can safely ignore these warnings? Or is it more > likely damage has already been caused? > No, I would not ignore these warnings, hot disks are aging much faster than at moderate temperatures. Try at least to get below 50 degrees. And apart from this, I'm not a fortune-teller ... > Should I worry about the Raw_Read_Error_Rate and Seek_Error_Rate > reported by logwatch? > 1 Raw_Read_Error_Rate 0x000f 114 091 006 Pre-fail > Always - 73127485 > 7 Seek_Error_Rate 0x000f 078 060 030 Pre-fail > Always - 67253982 > > > My other disk (located immediately on top of this one) has a > temperature reading of 57 at the moment which is probably still too > hot but already much better tahn the +60 from the previous days. > Someone suggested that I move the disks away from each other to > improve the cooling. The only thing I can do due to the structure of > the case is to move my floppy drive between both disks. Would this > help to cool the disks down? > Case fans normally are not very effective for disk cooling. Try to attach a fan directly in front of or besides the disk. Even if you have to do some improvised botch with cord, tape or whatever ;-)) And separating disk will help. You can also put one of the disks into a 5 inch slot with appropiate adapters, e.g. if you have only one cd device. regards, Manfred > Thanks for your feedback > > Bram > > -- > # Mertens Bram "M8ram" <bra...@li...> Linux User #349737 # > # debian testing kernel 2.6.15-1-686 i686 1024MB RAM # > # 20:32:07 up 2:09, 3 users, load average: 0.00, 0.01, 0.00 # > > All the advantages of Linux Managed Hosting--Without the Cost and Risk! > Fully trained technicians. The highest number of Red Hat certifications in > the hosting industry. Fanatical Support. Click to learn more > http://sel.as-us.falkag.net/sel?cmd=lnk&kid=107521&bid=248729&dat=121642 > _______________________________________________ > Smartmontools-support mailing list > Sma...@li... > https://lists.sourceforge.net/lists/listinfo/smartmontools-support -- "Feel free" – 10 GB Mailbox, 100 FreeSMS/Monat ... Jetzt GMX TopMail testen: http://www.gmx.net/de/go/topmail |