From: Tudor H. <tu...@sm...> - 2013-06-21 03:33:35
|
Hi, All of my drives claim to enable smart on request using -s on and return "SMART Enabled.", however, afterwards, only one of them actually has enabled smart. Can you see why? Could this be a drive/controller card failure, or is it something more obvious that I'm doing wrong? I've tried with -T permissive but that didn't work either. Thanks! Tudor. $ sudo smartctl -a /dev/sda smartctl 5.42 2011-10-20 r3458 [x86_64-linux-2.6.30-gentoo-r4] (local build) Copyright (C) 2002-11 by Bruce Allen, http://smartmontools.sourceforge.net === START OF INFORMATION SECTION === Model Family: Western Digital Caviar Green (Adv. Format) Device Model: WDC WD10EARX-00PASB0 Serial Number: WD-WCAZA9117964 Firmware Version: 0953 User Capacity: 1,000,204,886,016 bytes [1.00 TB] Sector Size: 512 bytes logical/physical Device is: In smartctl database [for details use: -P show] ATA Version is: 7 ATA Standard is: Exact ATA specification draft version not indicated Local Time is: Fri Jun 21 13:07:54 2013 EST SMART support is: Available - device has SMART capability. SMART support is: Disabled SMART Disabled. Use option -s with argument 'on' to enable it. $ sudo smartctl -s on /dev/sda smartctl 5.42 2011-10-20 r3458 [x86_64-linux-2.6.30-gentoo-r4] (local build) Copyright (C) 2002-11 by Bruce Allen, http://smartmontools.sourceforge.net === START OF ENABLE/DISABLE COMMANDS SECTION === SMART Enabled. tudor@milhouse ~ $ sudo smartctl -a /dev/sda smartctl 5.42 2011-10-20 r3458 [x86_64-linux-2.6.30-gentoo-r4] (local build) Copyright (C) 2002-11 by Bruce Allen, http://smartmontools.sourceforge.net === START OF INFORMATION SECTION === Model Family: Western Digital Caviar Green (Adv. Format) Device Model: WDC WD10EARX-00PASB0 Serial Number: WD-WCAZA9117964 Firmware Version: 0953 User Capacity: 1,000,204,886,016 bytes [1.00 TB] Sector Size: 512 bytes logical/physical Device is: In smartctl database [for details use: -P show] ATA Version is: 7 ATA Standard is: Exact ATA specification draft version not indicated Local Time is: Fri Jun 21 13:08:15 2013 EST SMART support is: Available - device has SMART capability. SMART support is: Disabled SMART Disabled. Use option -s with argument 'on' to enable it. $ $ sudo smartctl -a /dev/sdb smartctl 5.42 2011-10-20 r3458 [x86_64-linux-2.6.30-gentoo-r4] (local build) Copyright (C) 2002-11 by Bruce Allen, http://smartmontools.sourceforge.net === START OF INFORMATION SECTION === Model Family: Seagate Barracuda 7200.12 Device Model: ST31000528AS Serial Number: 9VP0G6BC Firmware Version: 0953 User Capacity: 1,000,204,886,016 bytes [1.00 TB] Sector Size: 512 bytes logical/physical Device is: In smartctl database [for details use: -P show] ATA Version is: 7 ATA Standard is: Exact ATA specification draft version not indicated Local Time is: Fri Jun 21 13:09:30 2013 EST SMART support is: Available - device has SMART capability. SMART support is: Disabled SMART Disabled. Use option -s with argument 'on' to enable it. tudor@milhouse ~ $ sudo smartctl -s on /dev/sdb smartctl 5.42 2011-10-20 r3458 [x86_64-linux-2.6.30-gentoo-r4] (local build) Copyright (C) 2002-11 by Bruce Allen, http://smartmontools.sourceforge.net === START OF ENABLE/DISABLE COMMANDS SECTION === SMART Enabled. $ sudo smartctl -a /dev/sdb smartctl 5.42 2011-10-20 r3458 [x86_64-linux-2.6.30-gentoo-r4] (local build) Copyright (C) 2002-11 by Bruce Allen, http://smartmontools.sourceforge.net === START OF INFORMATION SECTION === Model Family: Seagate Barracuda 7200.12 Device Model: ST31000528AS Serial Number: 9VP0G6BC Firmware Version: 0953 User Capacity: 1,000,204,886,016 bytes [1.00 TB] Sector Size: 512 bytes logical/physical Device is: In smartctl database [for details use: -P show] ATA Version is: 7 ATA Standard is: Exact ATA specification draft version not indicated Local Time is: Fri Jun 21 13:09:40 2013 EST SMART support is: Available - device has SMART capability. SMART support is: Disabled SMART Disabled. Use option -s with argument 'on' to enable it. $ $ sudo smartctl -a /dev/sdc smartctl 5.42 2011-10-20 r3458 [x86_64-linux-2.6.30-gentoo-r4] (local build) Copyright (C) 2002-11 by Bruce Allen, http://smartmontools.sourceforge.net === START OF INFORMATION SECTION === Model Family: Western Digital Caviar Green Device Model: WDC WD10EADS-00M2B0 Serial Number: WD-WMAV50015842 LU WWN Device Id: 5 0014ee 0abe59f70 Firmware Version: 01.00A01 User Capacity: 1,000,204,886,016 bytes [1.00 TB] Sector Size: 512 bytes logical/physical Device is: In smartctl database [for details use: -P show] ATA Version is: 8 ATA Standard is: Exact ATA specification draft version not indicated Local Time is: Fri Jun 21 13:10:33 2013 EST SMART support is: Available - device has SMART capability. SMART support is: Enabled === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED General SMART Values: Offline data collection status: (0x84) Offline data collection activity was suspended by an interrupting command from host. Auto Offline Data Collection: Enabled. Self-test execution status: ( 0) The previous self-test routine completed without error or no self-test has ever been run. Total time to complete Offline data collection: (19980) seconds. Offline data collection capabilities: (0x7b) SMART execute Offline immediate. Auto Offline data collection on/off support. Suspend Offline collection upon new command. Offline surface scan supported. Self-test supported. Conveyance Self-test supported. Selective Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. General Purpose Logging supported. Short self-test routine recommended polling time: ( 2) minutes. Extended self-test routine recommended polling time: ( 230) minutes. Conveyance self-test routine recommended polling time: ( 5) minutes. SCT capabilities: (0x303f) SCT Status supported. SCT Error Recovery Control supported. SCT Feature Control supported. SCT Data Table supported. SMART Attributes Data Structure revision number: 16 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x002f 200 200 051 Pre-fail Always - 13 3 Spin_Up_Time 0x0027 125 115 021 Pre-fail Always - 6741 4 Start_Stop_Count 0x0032 100 100 000 Old_age Always - 37 5 Reallocated_Sector_Ct 0x0033 195 195 140 Pre-fail Always - 35 7 Seek_Error_Rate 0x002e 200 200 000 Old_age Always - 0 9 Power_On_Hours 0x0032 056 056 000 Old_age Always - 32425 10 Spin_Retry_Count 0x0032 100 253 000 Old_age Always - 0 11 Calibration_Retry_Count 0x0032 100 253 000 Old_age Always - 0 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 35 192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 26 193 Load_Cycle_Count 0x0032 097 097 000 Old_age Always - 311894 194 Temperature_Celsius 0x0022 119 098 000 Old_age Always - 28 196 Reallocated_Event_Count 0x0032 193 193 000 Old_age Always - 7 197 Current_Pending_Sector 0x0032 200 200 000 Old_age Always - 0 198 Offline_Uncorrectable 0x0030 200 200 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0 200 Multi_Zone_Error_Rate 0x0008 200 200 000 Old_age Offline - 0 SMART Error Log Version: 1 No Errors Logged SMART Self-test log structure revision number 1 No self-tests have been logged. [To run self-tests, use: smartctl -t] SMART Selective self-test log data structure revision number 1 SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Not_testing 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay. $ sudo lspci 00:00.0 Host bridge: Intel Corporation X58 I/O Hub to ESI Port (rev 13) 00:01.0 PCI bridge: Intel Corporation 5520/5500/X58 I/O Hub PCI Express Root Port 1 (rev 13) 00:03.0 PCI bridge: Intel Corporation 5520/5500/X58 I/O Hub PCI Express Root Port 3 (rev 13) 00:05.0 PCI bridge: Intel Corporation 5520/X58 I/O Hub PCI Express Root Port 5 (rev 13) 00:07.0 PCI bridge: Intel Corporation 5520/5500/X58 I/O Hub PCI Express Root Port 7 (rev 13) 00:09.0 PCI bridge: Intel Corporation 5520/5500/X58 I/O Hub PCI Express Root Port 9 (rev 13) 00:10.0 PIC: Intel Corporation 5520/5500/X58 Physical and Link Layer Registers Port 0 (rev 13) 00:10.1 PIC: Intel Corporation 5520/5500/X58 Routing and Protocol Layer Registers Port 0 (rev 13) 00:11.0 PIC: Intel Corporation 5520/5500 Physical and Link Layer Registers Port 1 (rev 13) 00:11.1 PIC: Intel Corporation 5520/5500 Routing & Protocol Layer Register Port 1 (rev 13) 00:13.0 PIC: Intel Corporation 5520/5500/X58 I/O Hub I/OxAPIC Interrupt Controller (rev 13) 00:14.0 PIC: Intel Corporation 5520/5500/X58 I/O Hub System Management Registers (rev 13) 00:14.1 PIC: Intel Corporation 5520/5500/X58 I/O Hub GPIO and Scratch Pad Registers (rev 13) 00:14.2 PIC: Intel Corporation 5520/5500/X58 I/O Hub Control Status and RAS Registers (rev 13) 00:15.0 PIC: Intel Corporation 5520/5500/X58 Trusted Execution Technology Registers (rev 13) 00:1a.0 USB Controller: Intel Corporation 82801JI (ICH10 Family) USB UHCI Controller #4 00:1a.1 USB Controller: Intel Corporation 82801JI (ICH10 Family) USB UHCI Controller #5 00:1a.2 USB Controller: Intel Corporation 82801JI (ICH10 Family) USB UHCI Controller #6 00:1a.7 USB Controller: Intel Corporation 82801JI (ICH10 Family) USB2 EHCI Controller #2 00:1b.0 Audio device: Intel Corporation 82801JI (ICH10 Family) HD Audio Controller 00:1c.0 PCI bridge: Intel Corporation 82801JI (ICH10 Family) PCI Express Port 1 00:1c.1 PCI bridge: Intel Corporation 82801JI (ICH10 Family) PCI Express Port 2 00:1c.4 PCI bridge: Intel Corporation 82801JI (ICH10 Family) PCI Express Port 5 00:1c.5 PCI bridge: Intel Corporation 82801JI (ICH10 Family) PCI Express Port 6 00:1d.0 USB Controller: Intel Corporation 82801JI (ICH10 Family) USB UHCI Controller #1 00:1d.1 USB Controller: Intel Corporation 82801JI (ICH10 Family) USB UHCI Controller #2 00:1d.2 USB Controller: Intel Corporation 82801JI (ICH10 Family) USB UHCI Controller #3 00:1d.7 USB Controller: Intel Corporation 82801JI (ICH10 Family) USB2 EHCI Controller #1 00:1e.0 PCI bridge: Intel Corporation 82801 PCI Bridge (rev 90) 00:1f.0 ISA bridge: Intel Corporation 82801JIR (ICH10R) LPC Interface Controller 00:1f.2 IDE interface: Intel Corporation 82801JI (ICH10 Family) 4 port SATA IDE Controller 00:1f.3 SMBus: Intel Corporation 82801JI (ICH10 Family) SMBus Controller 00:1f.5 IDE interface: Intel Corporation 82801JI (ICH10 Family) 2 port SATA IDE Controller 01:00.0 Ethernet controller: Intel Corporation 82574L Gigabit Network Connection 04:00.0 Ethernet controller: Intel Corporation 82574L Gigabit Network Connection 05:00.0 VGA compatible controller: nVidia Corporation G98 [GeForce 8400 GS] (rev a1) 07:00.0 SATA controller: JMicron Technology Corp. 20360/20363 Serial ATA Controller (rev 02) 07:00.1 IDE interface: JMicron Technology Corp. 20360/20363 Serial ATA Controller (rev 02) 08:00.0 Ethernet controller: Realtek Semiconductor Co., Ltd. RTL8111/8168B PCI Express Gigabit Ethernet controller (rev 03) 09:00.0 Ethernet controller: Realtek Semiconductor Co., Ltd. RTL8111/8168B PCI Express Gigabit Ethernet controller (rev 03) 0a:01.0 Communication controller: NetMos Technology PCI 9835 Multi-I/O Controller (rev 01) $ $ |
From: Christian F. <Chr...@t-...> - 2013-06-21 10:24:38
|
Tudor Holton wrote: > Hi, > > All of my drives claim to enable smart on request using -s on and return > "SMART Enabled.", however, afterwards, only one of them actually has > enabled smart. > > Can you see why? Could this be a drive/controller card failure, or is > it something more obvious that I'm doing wrong? > > I've tried with -T permissive but that didn't work either. > ... > $ sudo smartctl -a /dev/sda > smartctl 5.42 2011-10-20 r3458 [x86_64-linux-2.6.30-gentoo-r4] (local build) > ... > SMART support is: Disabled > > SMART Disabled. Use option -s with argument 'on' to enable it. The following should work to skip the SMART enabled check: smartctl -s on -a /dev/sda Option -T permissive could also be used for this purpose since smartctl 6.0. Do other non-data ATA commands work for an affected disk? For example try changing APM with: smartctl -s apm,N /dev/sda and check the result with: smartctl -g apm /dev/sda (requires smartctl >= 5.43) > ... > $ sudo lspci > ... > 00:1f.2 IDE interface: Intel Corporation 82801JI (ICH10 Family) 4 port > SATA IDE Controller > ... > 00:1f.5 IDE interface: Intel Corporation 82801JI (ICH10 Family) 2 port > SATA IDE Controller > ... > 07:00.0 SATA controller: JMicron Technology Corp. 20360/20363 Serial ATA > Controller (rev 02) > 07:00.1 IDE interface: JMicron Technology Corp. 20360/20363 Serial ATA > Controller (rev 02) > Which controller is used for which disk? Is there any correlation between "controller" and "SMART enable works/fails" ? Does SMART enable work for an affected disk if connected to a different controller or machine? Thanks, Christian |
From: Tudor H. <tu...@sm...> - 2013-06-25 02:18:12
|
Hi, Thanks for the help. Sorry it's taken so long to reply. This is actually a well-used server, so I had to allocate some downtime to look at it properly. On 21/06/13 20:24, Christian Franke wrote: > Tudor Holton wrote: >> Hi, >> >> All of my drives claim to enable smart on request using -s on and return >> "SMART Enabled.", however, afterwards, only one of them actually has >> enabled smart. >> >> Can you see why? Could this be a drive/controller card failure, or is >> it something more obvious that I'm doing wrong? >> >> I've tried with -T permissive but that didn't work either. >> ... >> $ sudo smartctl -a /dev/sda >> smartctl 5.42 2011-10-20 r3458 [x86_64-linux-2.6.30-gentoo-r4] (local build) >> ... >> SMART support is: Disabled >> >> SMART Disabled. Use option -s with argument 'on' to enable it. > The following should work to skip the SMART enabled check: smartctl -s > on -a /dev/sda > Option -T permissive could also be used for this purpose since smartctl 6.0. Yeah. As mentioned, I tried -T permissive but it didn't change anything. Mind you, my version is before 6.0 and it didn't complain about the option being wrong, which seems a little odd. > Do other non-data ATA commands work for an affected disk? > For example try changing APM with: smartctl -s apm,N /dev/sda > and check the result with: smartctl -g apm /dev/sda > (requires smartctl >= 5.43) Yeah. My version of Gentoo only has 5.42 in the repository. (This also seems odd, since Gentoo's packaging is generally closer to bleeding-edge.) >> ... >> $ sudo lspci >> ... >> 00:1f.2 IDE interface: Intel Corporation 82801JI (ICH10 Family) 4 port >> SATA IDE Controller >> ... >> 00:1f.5 IDE interface: Intel Corporation 82801JI (ICH10 Family) 2 port >> SATA IDE Controller >> ... >> 07:00.0 SATA controller: JMicron Technology Corp. 20360/20363 Serial ATA >> Controller (rev 02) >> 07:00.1 IDE interface: JMicron Technology Corp. 20360/20363 Serial ATA >> Controller (rev 02) >> > Which controller is used for which disk? All appear to be on the same controller, since they're all SATA. To recap from the previous email, sda & sdb failed, where sdc succeeded. $ ls -la /dev/disk/by-path/ | grep -v part total 0 drwxr-xr-x 2 root root 460 Jun 21 12:49 . drwxr-xr-x 5 root root 100 Jun 21 12:49 .. lrwxrwxrwx 1 root root 9 Jun 21 12:49 pci-0000:00:1f.5-scsi-1:0:0:0 -> ../../sdd lrwxrwxrwx 1 root root 9 Jun 21 12:49 pci-0000:07:00.0-scsi-0:0:0:0 -> ../../sr0 lrwxrwxrwx 1 root root 9 Jun 21 12:49 pci-0000:07:00.0-scsi-0:1:0:0 -> ../../sda lrwxrwxrwx 1 root root 9 Jun 21 12:49 pci-0000:07:00.0-scsi-1:0:0:0 -> ../../sdb lrwxrwxrwx 1 root root 9 Jun 21 12:49 pci-0000:07:00.0-scsi-1:1:0:0 -> ../../sdc > Is there any correlation between "controller" and "SMART enable > works/fails" ? Unfortunately not, since they are all on the same controller. > Does SMART enable work for an affected disk if connected to a different > controller or machine? I removed the drives and tested them all on my desktop machine and smartctl -a worked on all of them. Subsequently, I discovered that not all the lines on the controller were working since it didn't pick up all the disks all the time. After some shuffling I managed to get them all running again with this configuration: $ ls -la /dev/disk/by-path/ | grep -v part total 0 drwxr-xr-x 2 root root 460 Jun 24 22:03 . drwxr-xr-x 5 root root 100 Jun 24 22:03 .. lrwxrwxrwx 1 root root 9 Jun 24 22:03 pci-0000:00:1f.2-scsi-0:0:0:0 -> ../../sr0 lrwxrwxrwx 1 root root 9 Jun 24 22:03 pci-0000:00:1f.2-scsi-0:0:1:0 -> ../../sdb lrwxrwxrwx 1 root root 9 Jun 24 22:03 pci-0000:00:1f.2-scsi-1:0:1:0 -> ../../sdc lrwxrwxrwx 1 root root 9 Jun 24 22:03 pci-0000:00:1f.5-scsi-1:0:0:0 -> ../../sdd lrwxrwxrwx 1 root root 9 Jun 24 22:03 pci-0000:07:00.0-scsi-0:0:0:0 -> ../../sda Surprise surprise, now sda fails whereas sdb, sdc, and sdd all succeed. Therefore, I conclude that the JMicron Technology Corp. 20360/20363 Serial ATA Controller is either faulty or doesn't handle smart properly by design. Have you come across this chipset before? Cheers, Tudor. |
From: Christian F. <Chr...@t-...> - 2013-06-25 19:56:29
|
Tudor Holton wrote: > On 21/06/13 20:24, Christian Franke wrote: >> Tudor Holton wrote: >>> ... >>> SMART support is: Disabled >>> >>> SMART Disabled. Use option -s with argument 'on' to enable it. >> The following should work to skip the SMART enabled check: smartctl -s >> on -a /dev/sda >> Option -T permissive could also be used for this purpose since smartctl 6.0. > Yeah. As mentioned, I tried -T permissive but it didn't change > anything. Mind you, my version is before 6.0 and it didn't complain > about the option being wrong, which seems a little odd. The option itself is supported since the early days of smartmontools (~5.1.1, 2003). But it does not skip this specific check before 6.0. >> Do other non-data ATA commands work for an affected disk? >> For example try changing APM with: smartctl -s apm,N /dev/sda >> and check the result with: smartctl -g apm /dev/sda >> (requires smartctl >= 5.43) > Yeah. My version of Gentoo only has 5.42 in the repository. (This also > seems odd, since Gentoo's packaging is generally closer to bleeding-edge.) For some (unknown) reason, newer versions are still tagged as "unstable": http://packages.gentoo.org/package/sys-apps/smartmontools > ... > I removed the drives and tested them all on my desktop machine and > smartctl -a worked on all of them. Subsequently, I discovered that not > all the lines on the controller were working since it didn't pick up all > the disks all the time. After some shuffling I managed to get them all > running again with this configuration: > $ ls -la /dev/disk/by-path/ | grep -v part > total 0 > drwxr-xr-x 2 root root 460 Jun 24 22:03 . > drwxr-xr-x 5 root root 100 Jun 24 22:03 .. > lrwxrwxrwx 1 root root 9 Jun 24 22:03 pci-0000:00:1f.2-scsi-0:0:0:0 -> > ../../sr0 > lrwxrwxrwx 1 root root 9 Jun 24 22:03 pci-0000:00:1f.2-scsi-0:0:1:0 -> > ../../sdb > lrwxrwxrwx 1 root root 9 Jun 24 22:03 pci-0000:00:1f.2-scsi-1:0:1:0 -> > ../../sdc > lrwxrwxrwx 1 root root 9 Jun 24 22:03 pci-0000:00:1f.5-scsi-1:0:0:0 -> > ../../sdd > lrwxrwxrwx 1 root root 9 Jun 24 22:03 pci-0000:07:00.0-scsi-0:0:0:0 -> > ../../sda > > > Surprise surprise, now sda fails whereas sdb, sdc, and sdd all succeed. > > Therefore, I conclude that the JMicron Technology Corp. 20360/20363 > Serial ATA Controller is either faulty or doesn't handle smart properly > by design. Likely a bug in the ATA pass-through implementation of the Linux driver for this chipset. Does e.g. "hdparm -B ..." work to change APM of any affected device? > Have you come across this chipset before? No, I don't remember any (positive or negative) reports regarding this chipset. Thanks, Christian |