From: Orion P. <or...@co...> - 2005-01-10 18:51:46
|
I'm getting the following at boot from one of our machines: This email was generated by the smartd daemon running on: host name: sombrero DNS domain: cora.nwra.com NIS domain: yp.colorado-research.com The following warning/error was logged by the smartd daemon: Device: /dev/hda, 1 Currently unreadable (pending) sectors For details see host's SYSLOG (default: /var/log/messages). You can also use the smartctl utility for further investigation. No additional email messages about this problem will be sent. -- It appears that this is an old condition, though I'm not sure. Is there any way to clear it? Thanks! [root@sombrero ~]# smartctl -a /dev/hda smartctl version 5.33 [i386-redhat-linux-gnu] Copyright (C) 2002-4 Bruce Allen Home page is http://smartmontools.sourceforge.net/ === START OF INFORMATION SECTION === Device Model: IBM-DARA-212000 Serial Number: AH0AH1Z6434 Firmware Version: AR4OA54A User Capacity: 12,072,517,632 bytes Device is: In smartctl database [for details use: -P show] ATA Version is: 4 ATA Standard is: ATA/ATAPI-4 T13 1153D revision 17 Local Time is: Mon Jan 10 11:50:02 2005 MST SMART support is: Available - device has SMART capability. SMART support is: Enabled === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED General SMART Values: Offline data collection status: (0x00) Offline data collection activity was never started. Auto Offline Data Collection: Disabled. Self-test execution status: ( 0) The previous self-test routine completed without error or no self-test has ever been run. Total time to complete Offline data collection: (1950) seconds. Offline data collection capabilities: (0x1b) SMART execute Offline immediate. Auto Offline data collection on/off support. Suspend Offline collection upon new command. Offline surface scan supported. Self-test supported. No Conveyance Self-test supported. No Selective Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. No General Purpose Logging support. Short self-test routine recommended polling time: ( 2) minutes. Extended self-test routine recommended polling time: ( 22) minutes. SMART Attributes Data Structure revision number: 16 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x000b 100 100 062 Pre-fail Always - 0 2 Throughput_Performance 0x0005 100 100 040 Pre-fail Offline - 0 3 Spin_Up_Time 0x0007 129 129 033 Pre-fail Always - 1 4 Start_Stop_Count 0x0012 100 100 000 Old_age Always - 896 5 Reallocated_Sector_Ct 0x0033 100 100 005 Pre-fail Always - 0 7 Seek_Error_Rate 0x000b 100 100 067 Pre-fail Always - 0 8 Seek_Time_Performance 0x0005 100 100 040 Pre-fail Offline - 0 9 Power_On_Hours 0x0012 025 025 000 Old_age Always - 33041 10 Spin_Retry_Count 0x0013 100 100 060 Pre-fail Always - 0 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 863 196 Reallocated_Event_Count 0x0032 100 100 000 Old_age Always - 8 197 Current_Pending_Sector 0x0022 100 100 000 Old_age Always - 1 198 Offline_Uncorrectable 0x0008 100 100 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x000a 200 200 000 Old_age Always - 0 SMART Error Log Version: 1 ATA Error Count: 122 (device log contains only the most recent five errors) CR = Command Register [HEX] FR = Features Register [HEX] SC = Sector Count Register [HEX] SN = Sector Number Register [HEX] CL = Cylinder Low Register [HEX] CH = Cylinder High Register [HEX] DH = Device/Head Register [HEX] DC = Device Command Register [HEX] ER = Error register [HEX] ST = Status register [HEX] Powered_Up_Time is measured from power on, and printed as DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes, SS=sec, and sss=millisec. It "wraps" after 49.710 days. Error 122 occurred at disk power-on lifetime: 16619 hours (692 days + 11 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER ST SC SN CL CH DH -- -- -- -- -- -- -- 40 51 18 07 02 0c a0 Error: UNC 24 sectors at LBA = 0x000c0207 = 786951 Commands leading to the command that caused the error were: CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name -- -- -- -- -- -- -- -- ---------------- -------------------- c8 00 20 ff 01 0c e0 00 00:05:33.700 READ DMA c8 00 28 f7 01 0c e0 00 00:05:27.200 READ DMA c8 00 30 ef 01 0c e0 00 00:05:20.700 READ DMA c8 00 38 e7 01 0c e0 00 00:05:14.200 READ DMA c8 00 40 df 01 0c e0 00 00:05:07.800 READ DMA Error 121 occurred at disk power-on lifetime: 16619 hours (692 days + 11 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER ST SC SN CL CH DH -- -- -- -- -- -- -- 40 51 18 07 02 0c a0 Error: UNC 24 sectors at LBA = 0x000c0207 = 786951 Commands leading to the command that caused the error were: CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name -- -- -- -- -- -- -- -- ---------------- -------------------- c8 00 28 f7 01 0c e0 00 00:05:27.200 READ DMA c8 00 30 ef 01 0c e0 00 00:05:20.700 READ DMA c8 00 38 e7 01 0c e0 00 00:05:14.200 READ DMA c8 00 40 df 01 0c e0 00 00:05:07.800 READ DMA c8 00 40 9f 01 0c e0 00 00:05:07.800 READ DMA Error 120 occurred at disk power-on lifetime: 16619 hours (692 days + 11 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER ST SC SN CL CH DH -- -- -- -- -- -- -- 40 51 18 07 02 0c a0 Error: UNC 24 sectors at LBA = 0x000c0207 = 786951 Commands leading to the command that caused the error were: CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name -- -- -- -- -- -- -- -- ---------------- -------------------- c8 00 30 ef 01 0c e0 00 00:05:20.700 READ DMA c8 00 38 e7 01 0c e0 00 00:05:14.200 READ DMA c8 00 40 df 01 0c e0 00 00:05:07.800 READ DMA c8 00 40 9f 01 0c e0 00 00:05:07.800 READ DMA c8 00 40 5f 01 0c e0 00 00:05:07.800 READ DMA Error 119 occurred at disk power-on lifetime: 16619 hours (692 days + 11 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER ST SC SN CL CH DH -- -- -- -- -- -- -- 40 51 18 07 02 0c a0 Error: UNC 24 sectors at LBA = 0x000c0207 = 786951 Commands leading to the command that caused the error were: CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name -- -- -- -- -- -- -- -- ---------------- -------------------- c8 00 38 e7 01 0c e0 00 00:05:14.200 READ DMA c8 00 40 df 01 0c e0 00 00:05:07.800 READ DMA c8 00 40 9f 01 0c e0 00 00:05:07.800 READ DMA c8 00 40 5f 01 0c e0 00 00:05:07.800 READ DMA c8 00 40 1f 01 0c e0 00 00:05:07.800 READ DMA Error 118 occurred at disk power-on lifetime: 16619 hours (692 days + 11 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER ST SC SN CL CH DH -- -- -- -- -- -- -- 40 51 18 07 02 0c a0 Error: UNC 24 sectors at LBA = 0x000c0207 = 786951 Commands leading to the command that caused the error were: CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name -- -- -- -- -- -- -- -- ---------------- -------------------- c8 00 40 df 01 0c e0 00 00:05:07.800 READ DMA c8 00 40 9f 01 0c e0 00 00:05:07.800 READ DMA c8 00 40 5f 01 0c e0 00 00:05:07.800 READ DMA c8 00 40 1f 01 0c e0 00 00:05:07.800 READ DMA c8 00 08 17 01 0c e0 00 00:05:07.800 READ DMA SMART Self-test log structure revision number 1 Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Extended offline Completed without error 00% 32962 - Device does not support Selective Self Tests/Logging -- Orion Poplawski System Administrator 303-415-9701 x222 Colorado Research Associates/NWRA FAX: 303-415-9702 3380 Mitchell Lane, Boulder CO 80301 http://www.co-ra.com |
From: Bruce A. <ba...@gr...> - 2005-01-10 21:50:26
|
On some disks you can clear it: see smartmontools web page FAQ for pointers. Bruce On Mon, 10 Jan 2005, Orion Poplawski wrote: > I'm getting the following at boot from one of our machines: > > This email was generated by the smartd daemon running on: > > host name: sombrero > DNS domain: cora.nwra.com > NIS domain: yp.colorado-research.com > > The following warning/error was logged by the smartd daemon: > > Device: /dev/hda, 1 Currently unreadable (pending) sectors > > For details see host's SYSLOG (default: /var/log/messages). > > You can also use the smartctl utility for further investigation. > No additional email messages about this problem will be sent. > > > -- > > It appears that this is an old condition, though I'm not sure. Is > there any way to clear it? Thanks! > > > [root@sombrero ~]# smartctl -a /dev/hda > smartctl version 5.33 [i386-redhat-linux-gnu] Copyright (C) 2002-4 Bruce > Allen > Home page is http://smartmontools.sourceforge.net/ > > === START OF INFORMATION SECTION === > Device Model: IBM-DARA-212000 > Serial Number: AH0AH1Z6434 > Firmware Version: AR4OA54A > User Capacity: 12,072,517,632 bytes > Device is: In smartctl database [for details use: -P show] > ATA Version is: 4 > ATA Standard is: ATA/ATAPI-4 T13 1153D revision 17 > Local Time is: Mon Jan 10 11:50:02 2005 MST > SMART support is: Available - device has SMART capability. > SMART support is: Enabled > > === START OF READ SMART DATA SECTION === > SMART overall-health self-assessment test result: PASSED > > General SMART Values: > Offline data collection status: (0x00) Offline data collection activity > was never started. > Auto Offline Data Collection: > Disabled. > Self-test execution status: ( 0) The previous self-test routine > completed > without error or no self-test > has ever > been run. > Total time to complete Offline > data collection: (1950) seconds. > Offline data collection > capabilities: (0x1b) SMART execute Offline immediate. > Auto Offline data collection > on/off support. > Suspend Offline collection upon new > command. > Offline surface scan supported. > Self-test supported. > No Conveyance Self-test supported. > No Selective Self-test supported. > SMART capabilities: (0x0003) Saves SMART data before entering > power-saving mode. > Supports SMART auto save timer. > Error logging capability: (0x01) Error logging supported. > No General Purpose Logging support. > Short self-test routine > recommended polling time: ( 2) minutes. > Extended self-test routine > recommended polling time: ( 22) minutes. > > SMART Attributes Data Structure revision number: 16 > Vendor Specific SMART Attributes with Thresholds: > ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE > UPDATED WHEN_FAILED RAW_VALUE > 1 Raw_Read_Error_Rate 0x000b 100 100 062 Pre-fail > Always - 0 > 2 Throughput_Performance 0x0005 100 100 040 Pre-fail > Offline - 0 > 3 Spin_Up_Time 0x0007 129 129 033 Pre-fail > Always - 1 > 4 Start_Stop_Count 0x0012 100 100 000 Old_age > Always - 896 > 5 Reallocated_Sector_Ct 0x0033 100 100 005 Pre-fail > Always - 0 > 7 Seek_Error_Rate 0x000b 100 100 067 Pre-fail > Always - 0 > 8 Seek_Time_Performance 0x0005 100 100 040 Pre-fail > Offline - 0 > 9 Power_On_Hours 0x0012 025 025 000 Old_age > Always - 33041 > 10 Spin_Retry_Count 0x0013 100 100 060 Pre-fail > Always - 0 > 12 Power_Cycle_Count 0x0032 100 100 000 Old_age > Always - 863 > 196 Reallocated_Event_Count 0x0032 100 100 000 Old_age Always > - 8 > 197 Current_Pending_Sector 0x0022 100 100 000 Old_age Always > - 1 > 198 Offline_Uncorrectable 0x0008 100 100 000 Old_age > Offline - 0 > 199 UDMA_CRC_Error_Count 0x000a 200 200 000 Old_age Always > - 0 > > SMART Error Log Version: 1 > ATA Error Count: 122 (device log contains only the most recent five errors) > CR = Command Register [HEX] > FR = Features Register [HEX] > SC = Sector Count Register [HEX] > SN = Sector Number Register [HEX] > CL = Cylinder Low Register [HEX] > CH = Cylinder High Register [HEX] > DH = Device/Head Register [HEX] > DC = Device Command Register [HEX] > ER = Error register [HEX] > ST = Status register [HEX] > Powered_Up_Time is measured from power on, and printed as > DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes, > SS=sec, and sss=millisec. It "wraps" after 49.710 days. > > Error 122 occurred at disk power-on lifetime: 16619 hours (692 days + 11 > hours) > When the command that caused the error occurred, the device was > active or idle. > > After command completion occurred, registers were: > ER ST SC SN CL CH DH > -- -- -- -- -- -- -- > 40 51 18 07 02 0c a0 Error: UNC 24 sectors at LBA = 0x000c0207 = 786951 > > Commands leading to the command that caused the error were: > CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name > -- -- -- -- -- -- -- -- ---------------- -------------------- > c8 00 20 ff 01 0c e0 00 00:05:33.700 READ DMA > c8 00 28 f7 01 0c e0 00 00:05:27.200 READ DMA > c8 00 30 ef 01 0c e0 00 00:05:20.700 READ DMA > c8 00 38 e7 01 0c e0 00 00:05:14.200 READ DMA > c8 00 40 df 01 0c e0 00 00:05:07.800 READ DMA > > Error 121 occurred at disk power-on lifetime: 16619 hours (692 days + 11 > hours) > When the command that caused the error occurred, the device was > active or idle. > > After command completion occurred, registers were: > ER ST SC SN CL CH DH > -- -- -- -- -- -- -- > 40 51 18 07 02 0c a0 Error: UNC 24 sectors at LBA = 0x000c0207 = 786951 > > Commands leading to the command that caused the error were: > CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name > -- -- -- -- -- -- -- -- ---------------- -------------------- > c8 00 28 f7 01 0c e0 00 00:05:27.200 READ DMA > c8 00 30 ef 01 0c e0 00 00:05:20.700 READ DMA > c8 00 38 e7 01 0c e0 00 00:05:14.200 READ DMA > c8 00 40 df 01 0c e0 00 00:05:07.800 READ DMA > c8 00 40 9f 01 0c e0 00 00:05:07.800 READ DMA > > Error 120 occurred at disk power-on lifetime: 16619 hours (692 days + 11 > hours) > When the command that caused the error occurred, the device was > active or idle. > > After command completion occurred, registers were: > ER ST SC SN CL CH DH > -- -- -- -- -- -- -- > 40 51 18 07 02 0c a0 Error: UNC 24 sectors at LBA = 0x000c0207 = 786951 > > Commands leading to the command that caused the error were: > CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name > -- -- -- -- -- -- -- -- ---------------- -------------------- > c8 00 30 ef 01 0c e0 00 00:05:20.700 READ DMA > c8 00 38 e7 01 0c e0 00 00:05:14.200 READ DMA > c8 00 40 df 01 0c e0 00 00:05:07.800 READ DMA > c8 00 40 9f 01 0c e0 00 00:05:07.800 READ DMA > c8 00 40 5f 01 0c e0 00 00:05:07.800 READ DMA > > Error 119 occurred at disk power-on lifetime: 16619 hours (692 days + 11 > hours) > When the command that caused the error occurred, the device was > active or idle. > > After command completion occurred, registers were: > ER ST SC SN CL CH DH > -- -- -- -- -- -- -- > 40 51 18 07 02 0c a0 Error: UNC 24 sectors at LBA = 0x000c0207 = 786951 > > Commands leading to the command that caused the error were: > CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name > -- -- -- -- -- -- -- -- ---------------- -------------------- > c8 00 38 e7 01 0c e0 00 00:05:14.200 READ DMA > c8 00 40 df 01 0c e0 00 00:05:07.800 READ DMA > c8 00 40 9f 01 0c e0 00 00:05:07.800 READ DMA > c8 00 40 5f 01 0c e0 00 00:05:07.800 READ DMA > c8 00 40 1f 01 0c e0 00 00:05:07.800 READ DMA > > Error 118 occurred at disk power-on lifetime: 16619 hours (692 days + 11 > hours) > When the command that caused the error occurred, the device was > active or idle. > > After command completion occurred, registers were: > ER ST SC SN CL CH DH > -- -- -- -- -- -- -- > 40 51 18 07 02 0c a0 Error: UNC 24 sectors at LBA = 0x000c0207 = 786951 > > Commands leading to the command that caused the error were: > CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name > -- -- -- -- -- -- -- -- ---------------- -------------------- > c8 00 40 df 01 0c e0 00 00:05:07.800 READ DMA > c8 00 40 9f 01 0c e0 00 00:05:07.800 READ DMA > c8 00 40 5f 01 0c e0 00 00:05:07.800 READ DMA > c8 00 40 1f 01 0c e0 00 00:05:07.800 READ DMA > c8 00 08 17 01 0c e0 00 00:05:07.800 READ DMA > > SMART Self-test log structure revision number 1 > Num Test_Description Status Remaining > LifeTime(hours) LBA_of_first_error > # 1 Extended offline Completed without error 00% 32962 > - > > Device does not support Selective Self Tests/Logging > > -- > Orion Poplawski > System Administrator 303-415-9701 x222 > Colorado Research Associates/NWRA FAX: 303-415-9702 > 3380 Mitchell Lane, Boulder CO 80301 http://www.co-ra.com > > > ------------------------------------------------------- > The SF.Net email is sponsored by: Beat the post-holiday blues > Get a FREE limited edition SourceForge.net t-shirt from ThinkGeek. > It's fun and FREE -- well, almost....http://www.thinkgeek.com/sfshirt > _______________________________________________ > Smartmontools-support mailing list > Sma...@li... > https://lists.sourceforge.net/lists/listinfo/smartmontools-support > > |