Menu

SnapRaid 8 - Specific WD 2TB Drives Showing 100% Chance of Failure

Help
2015-04-28
2015-04-29
  • Gautam Desai

    Gautam Desai - 2015-04-28

    First of all Andrea for another fabulous release. It's like Christmas every few months (vs. may days of multi-year agony with Unraid). Anyway to my question.

    I ran the "snapraid smart" command and got the following results which are disturbing and also highly suspect to me since all of those drives are 2TB WD drives. Could anything weird be happening the the results of the SMART test that shows as a high probability of failure? I'll run a manual smart test on each of those drives, but I'm pretty sure there weren't any pre-fail flags or warnings as recently as last week on any of those drives...

    SnapRAID SMART report:
    
       Temp  Power   Error   FP Size
          C OnDays   Count        TB  Serial           Device    Disk
     -----------------------------------------------------------------------
         32    982       0   5%  4.0  PL1311LAG23ZSA   /dev/sdm  d1
         37   1103       0   5%  4.0  PL1311LAG0G6UA   /dev/sdl  d2
         31    122       0   5%  4.0  PL2321LAG2MM1J   /dev/sdk  d3
         34    148       0  SSD  0.5  14290D9157BD     /dev/sdj  d4
         32    916       0   5%  4.0  PL1311LAG2333A   /dev/sdq  d5
         32    879       0   5%  4.0  PL1311LAG28YKH   /dev/sdp  d6
         31    514       0   5%  4.0  PL1311LAG1YGMA   /dev/sdo  d7
         29   1533       0   5%  2.0  JK1101YAJW251V   /dev/sdn  d8
         30   1609       0 100%  2.0  WD-WCAZA1003081  /dev/sdi  d9
         31    146       0   5%  4.0  PL1311LAG1SWLA   /dev/sdh  d10
         31   1219       0   5%  3.0  MJ1323YNG1HJ0C   /dev/sdg  d11
         30    150       0   5%  4.0  PL1311LAG24AKA   /dev/sdf  d12
         30    147       0   5%  4.0  PL1331LAGUX15H   /dev/sdu  d13
         29   1610       0 100%  2.0  WD-WCAZA1002058  /dev/sdt  d14
         30   1594       0  98%  2.0  WD-WCAZA1030094  /dev/sds  d15
         30   1620       0 100%  2.0  WD-WCAZA0617370  /dev/sdr  d16
         30    146       0   5%  4.0  PL1311LAG2XA1A   /dev/sdy  d17
         29   1405       0 100%  2.0  WD-WMAZA3706377  /dev/sdx  d18
         30    147       0   5%  4.0  PL1331LAGUXDKH   /dev/sdw  d19
         29   1614       0 100%  2.0  WD-WCAZA1014816  /dev/sdv  d20
         30   1313       0   5%  3.0  MJ1323YNG1J2KC   /dev/sdb  d21
         30    953       0   5%  4.0  PL1311LAG0G1UA   /dev/sdc  parity
         30   1116       0   5%  4.0  PL2310LAG0JU6C   /dev/sde  2-parity
         29    146       0   5%  6.0  WD-WX31D944CSRN  /dev/sdd  3-parity
    
          -      -       -  n/a    -  -                /dev/fd0  -
          -      -       -  SSD  0.1  -                /dev/sda  -
          -      -       -  n/a    -  -                /dev/sr0  -
    
    The FP column is the estimated probability (in percentage) that the disk
    is going to fail in the next year.
    
    Probability that at least one disk is going to fail in the next year is 100%.
    
     
  • Quaraxkad

    Quaraxkad - 2015-04-29

    Post the full table of SMART attributes for each of the drives.

     
    • Gautam Desai

      Gautam Desai - 2015-04-29

      Thanks for looking into to this. This is all the WD drives all with 100% potential chance of failure, one of them /dev/sds is at 98%...

      Ughhh. I just saw it. The Raw Read Error rate and Spin Up Time both exceed thresholds... Maybe that's why?

      user@tower:~$ sudo smartctl -a /dev/sdi
      smartctl 6.2 2013-07-26 r3841 [x86_64-linux-3.13.0-49-generic] (local build)
      Copyright (C) 2002-13, Bruce Allen, Christian Franke, www.smartmontools.org
      
      === START OF INFORMATION SECTION ===
      Model Family:     Western Digital Caviar Green (AF)
      Device Model:     WDC WD20EARS-00MVWB0
      Serial Number:    WD-WCAZA1003081
      LU WWN Device Id: 5 0014ee 2afa68ee5
      Firmware Version: 51.0AB51
      User Capacity:    2,000,398,934,016 bytes [2.00 TB]
      Sector Size:      512 bytes logical/physical
      Device is:        In smartctl database [for details use: -P show]
      ATA Version is:   ATA8-ACS (minor revision not indicated)
      SATA Version is:  SATA 2.6, 3.0 Gb/s
      Local Time is:    Tue Apr 28 21:37:31 2015 PDT
      SMART support is: Available - device has SMART capability.
      SMART support is: Enabled
      
      === START OF READ SMART DATA SECTION ===
      SMART overall-health self-assessment test result: PASSED
      
      General SMART Values:
      Offline data collection status:  (0x82) Offline data collection activity
                                              was completed without error.
                                              Auto Offline Data Collection: Enabled.
      Self-test execution status:      ( 241) Self-test routine in progress...
                                              10% of test remaining.
      Total time to complete Offline
      data collection:                (37980) seconds.
      Offline data collection
      capabilities:                    (0x7b) SMART execute Offline immediate.
                                              Auto Offline data collection on/off support.
                                              Suspend Offline collection upon new
                                              command.
                                              Offline surface scan supported.
                                              Self-test supported.
                                              Conveyance Self-test supported.
                                              Selective Self-test supported.
      SMART capabilities:            (0x0003) Saves SMART data before entering
                                              power-saving mode.
                                              Supports SMART auto save timer.
      Error logging capability:        (0x01) Error logging supported.
                                              General Purpose Logging supported.
      Short self-test routine
      recommended polling time:        (   2) minutes.
      Extended self-test routine
      recommended polling time:        ( 366) minutes.
      Conveyance self-test routine
      recommended polling time:        (   5) minutes.
      SCT capabilities:              (0x3035) SCT Status supported.
                                              SCT Feature Control supported.
                                              SCT Data Table supported.
      
      SMART Attributes Data Structure revision number: 16
      Vendor Specific SMART Attributes with Thresholds:
      ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
        1 Raw_Read_Error_Rate     0x002f   200   200   051    Pre-fail  Always       -       0
        3 Spin_Up_Time            0x0027   170   168   021    Pre-fail  Always       -       6475
        4 Start_Stop_Count        0x0032   095   095   000    Old_age   Always       -       5750
        5 Reallocated_Sector_Ct   0x0033   200   200   140    Pre-fail  Always       -       0
        7 Seek_Error_Rate         0x002e   200   200   000    Old_age   Always       -       0
        9 Power_On_Hours          0x0032   048   048   000    Old_age   Always       -       38641
       10 Spin_Retry_Count        0x0032   100   100   000    Old_age   Always       -       0
       11 Calibration_Retry_Count 0x0032   100   100   000    Old_age   Always       -       0
       12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -       206
      192 Power-Off_Retract_Count 0x0032   200   200   000    Old_age   Always       -       58
      193 Load_Cycle_Count        0x0032   103   103   000    Old_age   Always       -       293356
      194 Temperature_Celsius     0x0022   118   100   000    Old_age   Always       -       32
      196 Reallocated_Event_Count 0x0032   200   200   000    Old_age   Always       -       0
      197 Current_Pending_Sector  0x0032   200   200   000    Old_age   Always       -       0
      198 Offline_Uncorrectable   0x0030   200   200   000    Old_age   Offline      -       0
      199 UDMA_CRC_Error_Count    0x0032   200   200   000    Old_age   Always       -       0
      200 Multi_Zone_Error_Rate   0x0008   200   200   000    Old_age   Offline      -       0
      
      SMART Error Log Version: 1
      No Errors Logged
      
      SMART Self-test log structure revision number 1
      No self-tests have been logged.  [To run self-tests, use: smartctl -t]
      
      
      SMART Selective self-test log data structure revision number 1
       SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
          1        0        0  Not_testing
          2        0        0  Not_testing
          3        0        0  Not_testing
          4        0        0  Not_testing
          5        0        0  Not_testing
      Selective self-test flags (0x0):
        After scanning selected spans, do NOT read-scan remainder of disk.
      If Selective self-test is pending on power-up, resume after 0 minute delay.
      
      user@tower:~$ sudo smartctl -a /dev/sdt
      smartctl 6.2 2013-07-26 r3841 [x86_64-linux-3.13.0-49-generic] (local build)
      Copyright (C) 2002-13, Bruce Allen, Christian Franke, www.smartmontools.org
      
      === START OF INFORMATION SECTION ===
      Model Family:     Western Digital Caviar Green (AF)
      Device Model:     WDC WD20EARS-00MVWB0
      Serial Number:    WD-WCAZA1002058
      LU WWN Device Id: 5 0014ee 2afa69114
      Firmware Version: 51.0AB51
      User Capacity:    2,000,398,934,016 bytes [2.00 TB]
      Sector Size:      512 bytes logical/physical
      Device is:        In smartctl database [for details use: -P show]
      ATA Version is:   ATA8-ACS (minor revision not indicated)
      SATA Version is:  SATA 2.6, 3.0 Gb/s
      Local Time is:    Tue Apr 28 21:38:25 2015 PDT
      SMART support is: Available - device has SMART capability.
      SMART support is: Enabled
      
      === START OF READ SMART DATA SECTION ===
      SMART overall-health self-assessment test result: PASSED
      
      General SMART Values:
      Offline data collection status:  (0x84) Offline data collection activity
                                              was suspended by an interrupting command from host.
                                              Auto Offline Data Collection: Enabled.
      Self-test execution status:      (   0) The previous self-test routine completed
                                              without error or no self-test has ever
                                              been run.
      Total time to complete Offline
      data collection:                (37500) seconds.
      Offline data collection
      capabilities:                    (0x7b) SMART execute Offline immediate.
                                              Auto Offline data collection on/off support.
                                              Suspend Offline collection upon new
                                              command.
                                              Offline surface scan supported.
                                              Self-test supported.
                                              Conveyance Self-test supported.
                                              Selective Self-test supported.
      SMART capabilities:            (0x0003) Saves SMART data before entering
                                              power-saving mode.
                                              Supports SMART auto save timer.
      Error logging capability:        (0x01) Error logging supported.
                                              General Purpose Logging supported.
      Short self-test routine
      recommended polling time:        (   2) minutes.
      Extended self-test routine
      recommended polling time:        ( 361) minutes.
      Conveyance self-test routine
      recommended polling time:        (   5) minutes.
      SCT capabilities:              (0x3035) SCT Status supported.
                                              SCT Feature Control supported.
                                              SCT Data Table supported.
      
      SMART Attributes Data Structure revision number: 16
      Vendor Specific SMART Attributes with Thresholds:
      ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
        1 Raw_Read_Error_Rate     0x002f   200   200   051    Pre-fail  Always       -       0
        3 Spin_Up_Time            0x0027   174   172   021    Pre-fail  Always       -       6283
        4 Start_Stop_Count        0x0032   094   094   000    Old_age   Always       -       6468
        5 Reallocated_Sector_Ct   0x0033   200   200   140    Pre-fail  Always       -       0
        7 Seek_Error_Rate         0x002e   200   200   000    Old_age   Always       -       0
        9 Power_On_Hours          0x0032   048   048   000    Old_age   Always       -       38647
       10 Spin_Retry_Count        0x0032   100   100   000    Old_age   Always       -       0
       11 Calibration_Retry_Count 0x0032   100   100   000    Old_age   Always       -       0
       12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -       205
      192 Power-Off_Retract_Count 0x0032   200   200   000    Old_age   Always       -       59
      193 Load_Cycle_Count        0x0032   083   083   000    Old_age   Always       -       353434
      194 Temperature_Celsius     0x0022   119   098   000    Old_age   Always       -       31
      196 Reallocated_Event_Count 0x0032   200   200   000    Old_age   Always       -       0
      197 Current_Pending_Sector  0x0032   200   200   000    Old_age   Always       -       0
      198 Offline_Uncorrectable   0x0030   200   200   000    Old_age   Offline      -       0
      199 UDMA_CRC_Error_Count    0x0032   200   200   000    Old_age   Always       -       0
      200 Multi_Zone_Error_Rate   0x0008   200   200   000    Old_age   Offline      -       0
      
      SMART Error Log Version: 1
      No Errors Logged
      
      SMART Self-test log structure revision number 1
      Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
      # 1  Short offline       Completed without error       00%     38647         -
      
      SMART Selective self-test log data structure revision number 1
       SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
          1        0        0  Not_testing
          2        0        0  Not_testing
          3        0        0  Not_testing
          4        0        0  Not_testing
          5        0        0  Not_testing
      Selective self-test flags (0x0):
        After scanning selected spans, do NOT read-scan remainder of disk.
      If Selective self-test is pending on power-up, resume after 0 minute delay.
      
      user@tower:~$ sudo smartctl -a /dev/sds
      smartctl 6.2 2013-07-26 r3841 [x86_64-linux-3.13.0-49-generic] (local build)
      Copyright (C) 2002-13, Bruce Allen, Christian Franke, www.smartmontools.org
      
      === START OF INFORMATION SECTION ===
      Model Family:     Western Digital Caviar Green (AF)
      Device Model:     WDC WD20EARS-00MVWB0
      Serial Number:    WD-WCAZA1030094
      LU WWN Device Id: 5 0014ee 25a50be91
      Firmware Version: 51.0AB51
      User Capacity:    2,000,398,934,016 bytes [2.00 TB]
      Sector Size:      512 bytes logical/physical
      Device is:        In smartctl database [for details use: -P show]
      ATA Version is:   ATA8-ACS (minor revision not indicated)
      SATA Version is:  SATA 2.6, 3.0 Gb/s
      Local Time is:    Tue Apr 28 21:38:48 2015 PDT
      SMART support is: Available - device has SMART capability.
      SMART support is: Enabled
      
      === START OF READ SMART DATA SECTION ===
      SMART overall-health self-assessment test result: PASSED
      
      General SMART Values:
      Offline data collection status:  (0x84) Offline data collection activity
                                              was suspended by an interrupting command from host.
                                              Auto Offline Data Collection: Enabled.
      Self-test execution status:      (   0) The previous self-test routine completed
                                              without error or no self-test has ever
                                              been run.
      Total time to complete Offline
      data collection:                (37560) seconds.
      Offline data collection
      capabilities:                    (0x7b) SMART execute Offline immediate.
                                              Auto Offline data collection on/off support.
                                              Suspend Offline collection upon new
                                              command.
                                              Offline surface scan supported.
                                              Self-test supported.
                                              Conveyance Self-test supported.
                                              Selective Self-test supported.
      SMART capabilities:            (0x0003) Saves SMART data before entering
                                              power-saving mode.
                                              Supports SMART auto save timer.
      Error logging capability:        (0x01) Error logging supported.
                                              General Purpose Logging supported.
      Short self-test routine
      recommended polling time:        (   2) minutes.
      Extended self-test routine
      recommended polling time:        ( 362) minutes.
      Conveyance self-test routine
      recommended polling time:        (   5) minutes.
      SCT capabilities:              (0x3035) SCT Status supported.
                                              SCT Feature Control supported.
                                              SCT Data Table supported.
      
      SMART Attributes Data Structure revision number: 16
      Vendor Specific SMART Attributes with Thresholds:
      ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
        1 Raw_Read_Error_Rate     0x002f   200   200   051    Pre-fail  Always       -       0
        3 Spin_Up_Time            0x0027   168   167   021    Pre-fail  Always       -       6575
        4 Start_Stop_Count        0x0032   096   096   000    Old_age   Always       -       4363
        5 Reallocated_Sector_Ct   0x0033   200   200   140    Pre-fail  Always       -       0
        7 Seek_Error_Rate         0x002e   200   200   000    Old_age   Always       -       0
        9 Power_On_Hours          0x0032   048   048   000    Old_age   Always       -       38285
       10 Spin_Retry_Count        0x0032   100   100   000    Old_age   Always       -       0
       11 Calibration_Retry_Count 0x0032   100   100   000    Old_age   Always       -       0
       12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -       195
      192 Power-Off_Retract_Count 0x0032   200   200   000    Old_age   Always       -       48
      193 Load_Cycle_Count        0x0032   131   131   000    Old_age   Always       -       209081
      194 Temperature_Celsius     0x0022   119   097   000    Old_age   Always       -       31
      196 Reallocated_Event_Count 0x0032   200   200   000    Old_age   Always       -       0
      197 Current_Pending_Sector  0x0032   200   200   000    Old_age   Always       -       0
      198 Offline_Uncorrectable   0x0030   200   200   000    Old_age   Offline      -       0
      199 UDMA_CRC_Error_Count    0x0032   200   200   000    Old_age   Always       -       0
      200 Multi_Zone_Error_Rate   0x0008   200   200   000    Old_age   Offline      -       0
      
      SMART Error Log Version: 1
      No Errors Logged
      
      SMART Self-test log structure revision number 1
      Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
      # 1  Short offline       Completed without error       00%     38285         -
      
      SMART Selective self-test log data structure revision number 1
       SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
          1        0        0  Not_testing
          2        0        0  Not_testing
          3        0        0  Not_testing
          4        0        0  Not_testing
          5        0        0  Not_testing
      Selective self-test flags (0x0):
        After scanning selected spans, do NOT read-scan remainder of disk.
      If Selective self-test is pending on power-up, resume after 0 minute delay.
      
      user@tower:~$ sudo smartctl -a /dev/sdr
      smartctl 6.2 2013-07-26 r3841 [x86_64-linux-3.13.0-49-generic] (local build)
      Copyright (C) 2002-13, Bruce Allen, Christian Franke, www.smartmontools.org
      
      === START OF INFORMATION SECTION ===
      Model Family:     Western Digital Caviar Green (AF)
      Device Model:     WDC WD20EARS-00MVWB0
      Serial Number:    WD-WCAZA0617370
      LU WWN Device Id: 5 0014ee 2af91e231
      Firmware Version: 51.0AB51
      User Capacity:    2,000,398,934,016 bytes [2.00 TB]
      Sector Size:      512 bytes logical/physical
      Device is:        In smartctl database [for details use: -P show]
      ATA Version is:   ATA8-ACS (minor revision not indicated)
      SATA Version is:  SATA 2.6, 3.0 Gb/s
      Local Time is:    Tue Apr 28 21:38:51 2015 PDT
      SMART support is: Available - device has SMART capability.
      SMART support is: Enabled
      
      === START OF READ SMART DATA SECTION ===
      SMART overall-health self-assessment test result: PASSED
      
      General SMART Values:
      Offline data collection status:  (0x84) Offline data collection activity
                                              was suspended by an interrupting command from host.
                                              Auto Offline Data Collection: Enabled.
      Self-test execution status:      ( 121) The previous self-test completed having
                                              the read element of the test failed.
      Total time to complete Offline
      data collection:                (35460) seconds.
      Offline data collection
      capabilities:                    (0x7b) SMART execute Offline immediate.
                                              Auto Offline data collection on/off support.
                                              Suspend Offline collection upon new
                                              command.
                                              Offline surface scan supported.
                                              Self-test supported.
                                              Conveyance Self-test supported.
                                              Selective Self-test supported.
      SMART capabilities:            (0x0003) Saves SMART data before entering
                                              power-saving mode.
                                              Supports SMART auto save timer.
      Error logging capability:        (0x01) Error logging supported.
                                              General Purpose Logging supported.
      Short self-test routine
      recommended polling time:        (   2) minutes.
      Extended self-test routine
      recommended polling time:        ( 342) minutes.
      Conveyance self-test routine
      recommended polling time:        (   5) minutes.
      SCT capabilities:              (0x3035) SCT Status supported.
                                              SCT Feature Control supported.
                                              SCT Data Table supported.
      
      SMART Attributes Data Structure revision number: 16
      Vendor Specific SMART Attributes with Thresholds:
      ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
        1 Raw_Read_Error_Rate     0x002f   200   200   051    Pre-fail  Always       -       0
        3 Spin_Up_Time            0x0027   171   170   021    Pre-fail  Always       -       6433
        4 Start_Stop_Count        0x0032   094   094   000    Old_age   Always       -       6395
        5 Reallocated_Sector_Ct   0x0033   200   200   140    Pre-fail  Always       -       0
        7 Seek_Error_Rate         0x002e   200   200   000    Old_age   Always       -       0
        9 Power_On_Hours          0x0032   047   047   000    Old_age   Always       -       38891
       10 Spin_Retry_Count        0x0032   100   100   000    Old_age   Always       -       0
       11 Calibration_Retry_Count 0x0032   100   100   000    Old_age   Always       -       0
       12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -       229
      192 Power-Off_Retract_Count 0x0032   200   200   000    Old_age   Always       -       69
      193 Load_Cycle_Count        0x0032   048   048   000    Old_age   Always       -       456433
      194 Temperature_Celsius     0x0022   119   101   000    Old_age   Always       -       31
      196 Reallocated_Event_Count 0x0032   200   200   000    Old_age   Always       -       0
      197 Current_Pending_Sector  0x0032   200   200   000    Old_age   Always       -       4
      198 Offline_Uncorrectable   0x0030   200   200   000    Old_age   Offline      -       1
      199 UDMA_CRC_Error_Count    0x0032   200   200   000    Old_age   Always       -       0
      200 Multi_Zone_Error_Rate   0x0008   200   200   000    Old_age   Offline      -       2
      
      SMART Error Log Version: 1
      No Errors Logged
      
      SMART Self-test log structure revision number 1
      Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
      # 1  Short offline       Completed: read failure       90%     38891         3780937596
      # 2  Short offline       Completed without error       00%      6862         -
      
      SMART Selective self-test log data structure revision number 1
       SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
          1        0        0  Not_testing
          2        0        0  Not_testing
          3        0        0  Not_testing
          4        0        0  Not_testing
          5        0        0  Not_testing
      Selective self-test flags (0x0):
        After scanning selected spans, do NOT read-scan remainder of disk.
      If Selective self-test is pending on power-up, resume after 0 minute delay.
      
      user@tower:~$ sudo smartctl -a /dev/sdx
      smartctl 6.2 2013-07-26 r3841 [x86_64-linux-3.13.0-49-generic] (local build)
      Copyright (C) 2002-13, Bruce Allen, Christian Franke, www.smartmontools.org
      
      === START OF INFORMATION SECTION ===
      Model Family:     Western Digital Caviar Green (AF)
      Device Model:     WDC WD20EARS-00MVWB0
      Serial Number:    WD-WMAZA3706377
      LU WWN Device Id: 5 0014ee 0ad540bc4
      Firmware Version: 51.0AB51
      User Capacity:    2,000,398,934,016 bytes [2.00 TB]
      Sector Size:      512 bytes logical/physical
      Device is:        In smartctl database [for details use: -P show]
      ATA Version is:   ATA8-ACS (minor revision not indicated)
      SATA Version is:  SATA 2.6, 3.0 Gb/s
      Local Time is:    Tue Apr 28 21:39:04 2015 PDT
      SMART support is: Available - device has SMART capability.
      SMART support is: Enabled
      
      === START OF READ SMART DATA SECTION ===
      SMART overall-health self-assessment test result: PASSED
      
      General SMART Values:
      Offline data collection status:  (0x82) Offline data collection activity
                                              was completed without error.
                                              Auto Offline Data Collection: Enabled.
      Self-test execution status:      (   0) The previous self-test routine completed
                                              without error or no self-test has ever
                                              been run.
      Total time to complete Offline
      data collection:                (36480) seconds.
      Offline data collection
      capabilities:                    (0x7b) SMART execute Offline immediate.
                                              Auto Offline data collection on/off support.
                                              Suspend Offline collection upon new
                                              command.
                                              Offline surface scan supported.
                                              Self-test supported.
                                              Conveyance Self-test supported.
                                              Selective Self-test supported.
      SMART capabilities:            (0x0003) Saves SMART data before entering
                                              power-saving mode.
                                              Supports SMART auto save timer.
      Error logging capability:        (0x01) Error logging supported.
                                              General Purpose Logging supported.
      Short self-test routine
      recommended polling time:        (   2) minutes.
      Extended self-test routine
      recommended polling time:        ( 352) minutes.
      Conveyance self-test routine
      recommended polling time:        (   5) minutes.
      SCT capabilities:              (0x3035) SCT Status supported.
                                              SCT Feature Control supported.
                                              SCT Data Table supported.
      
      SMART Attributes Data Structure revision number: 16
      Vendor Specific SMART Attributes with Thresholds:
      ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
        1 Raw_Read_Error_Rate     0x002f   200   200   051    Pre-fail  Always       -       0
        3 Spin_Up_Time            0x0027   169   167   021    Pre-fail  Always       -       6541
        4 Start_Stop_Count        0x0032   096   096   000    Old_age   Always       -       4554
        5 Reallocated_Sector_Ct   0x0033   200   200   140    Pre-fail  Always       -       0
        7 Seek_Error_Rate         0x002e   200   200   000    Old_age   Always       -       0
        9 Power_On_Hours          0x0032   054   054   000    Old_age   Always       -       33732
       10 Spin_Retry_Count        0x0032   100   100   000    Old_age   Always       -       0
       11 Calibration_Retry_Count 0x0032   100   100   000    Old_age   Always       -       0
       12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -       151
      192 Power-Off_Retract_Count 0x0032   200   200   000    Old_age   Always       -       39
      193 Load_Cycle_Count        0x0032   116   116   000    Old_age   Always       -       253141
      194 Temperature_Celsius     0x0022   119   098   000    Old_age   Always       -       31
      196 Reallocated_Event_Count 0x0032   200   200   000    Old_age   Always       -       0
      197 Current_Pending_Sector  0x0032   200   200   000    Old_age   Always       -       0
      198 Offline_Uncorrectable   0x0030   200   200   000    Old_age   Offline      -       0
      199 UDMA_CRC_Error_Count    0x0032   200   200   000    Old_age   Always       -       0
      200 Multi_Zone_Error_Rate   0x0008   200   200   000    Old_age   Offline      -       0
      
      SMART Error Log Version: 1
      No Errors Logged
      
      SMART Self-test log structure revision number 1
      Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
      # 1  Short offline       Completed without error       00%     33732         -
      
      SMART Selective self-test log data structure revision number 1
       SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
          1        0        0  Not_testing
          2        0        0  Not_testing
          3        0        0  Not_testing
          4        0        0  Not_testing
          5        0        0  Not_testing
      Selective self-test flags (0x0):
        After scanning selected spans, do NOT read-scan remainder of disk.
      If Selective self-test is pending on power-up, resume after 0 minute delay.
      
      user@tower:~$ sudo smartctl -a /dev/sdv
      smartctl 6.2 2013-07-26 r3841 [x86_64-linux-3.13.0-49-generic] (local build)
      Copyright (C) 2002-13, Bruce Allen, Christian Franke, www.smartmontools.org
      
      === START OF INFORMATION SECTION ===
      Model Family:     Western Digital Caviar Green (AF)
      Device Model:     WDC WD20EARS-00MVWB0
      Serial Number:    WD-WCAZA1014816
      LU WWN Device Id: 5 0014ee 25a50e598
      Firmware Version: 51.0AB51
      User Capacity:    2,000,398,934,016 bytes [2.00 TB]
      Sector Size:      512 bytes logical/physical
      Device is:        In smartctl database [for details use: -P show]
      ATA Version is:   ATA8-ACS (minor revision not indicated)
      SATA Version is:  SATA 2.6, 3.0 Gb/s
      Local Time is:    Tue Apr 28 21:39:17 2015 PDT
      SMART support is: Available - device has SMART capability.
      SMART support is: Enabled
      
      === START OF READ SMART DATA SECTION ===
      SMART overall-health self-assessment test result: PASSED
      
      General SMART Values:
      Offline data collection status:  (0x84) Offline data collection activity
                                              was suspended by an interrupting command from host.
                                              Auto Offline Data Collection: Enabled.
      Self-test execution status:      (   0) The previous self-test routine completed
                                              without error or no self-test has ever
                                              been run.
      Total time to complete Offline
      data collection:                (38580) seconds.
      Offline data collection
      capabilities:                    (0x7b) SMART execute Offline immediate.
                                              Auto Offline data collection on/off support.
                                              Suspend Offline collection upon new
                                              command.
                                              Offline surface scan supported.
                                              Self-test supported.
                                              Conveyance Self-test supported.
                                              Selective Self-test supported.
      SMART capabilities:            (0x0003) Saves SMART data before entering
                                              power-saving mode.
                                              Supports SMART auto save timer.
      Error logging capability:        (0x01) Error logging supported.
                                              General Purpose Logging supported.
      Short self-test routine
      recommended polling time:        (   2) minutes.
      Extended self-test routine
      recommended polling time:        ( 372) minutes.
      Conveyance self-test routine
      recommended polling time:        (   5) minutes.
      SCT capabilities:              (0x3035) SCT Status supported.
                                              SCT Feature Control supported.
                                              SCT Data Table supported.
      
      SMART Attributes Data Structure revision number: 16
      Vendor Specific SMART Attributes with Thresholds:
      ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
        1 Raw_Read_Error_Rate     0x002f   200   200   051    Pre-fail  Always       -       0
        3 Spin_Up_Time            0x0027   168   165   021    Pre-fail  Always       -       6558
        4 Start_Stop_Count        0x0032   095   095   000    Old_age   Always       -       5452
        5 Reallocated_Sector_Ct   0x0033   200   200   140    Pre-fail  Always       -       0
        7 Seek_Error_Rate         0x002e   200   200   000    Old_age   Always       -       0
        9 Power_On_Hours          0x0032   047   047   000    Old_age   Always       -       38761
       10 Spin_Retry_Count        0x0032   100   100   000    Old_age   Always       -       0
       11 Calibration_Retry_Count 0x0032   100   100   000    Old_age   Always       -       0
       12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -       219
      192 Power-Off_Retract_Count 0x0032   200   200   000    Old_age   Always       -       58
      193 Load_Cycle_Count        0x0032   066   066   000    Old_age   Always       -       403095
      194 Temperature_Celsius     0x0022   120   103   000    Old_age   Always       -       30
      196 Reallocated_Event_Count 0x0032   200   200   000    Old_age   Always       -       0
      197 Current_Pending_Sector  0x0032   200   200   000    Old_age   Always       -       0
      198 Offline_Uncorrectable   0x0030   200   200   000    Old_age   Offline      -       0
      199 UDMA_CRC_Error_Count    0x0032   200   200   000    Old_age   Always       -       0
      200 Multi_Zone_Error_Rate   0x0008   200   200   000    Old_age   Offline      -       0
      
      SMART Error Log Version: 1
      No Errors Logged
      
      SMART Self-test log structure revision number 1
      Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
      # 1  Short offline       Completed without error       00%     38761         -
      
      SMART Selective self-test log data structure revision number 1
       SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
          1        0        0  Not_testing
          2        0        0  Not_testing
          3        0        0  Not_testing
          4        0        0  Not_testing
          5        0        0  Not_testing
      Selective self-test flags (0x0):
        After scanning selected spans, do NOT read-scan remainder of disk.
      If Selective self-test is pending on power-up, resume after 0 minute delay.
      
       

      Last edit: Gautam Desai 2015-04-29
  • xad

    xad - 2015-04-29

    Hmm, you have very high values on #193, "Load_Cycle_Count" (head parking).
    You migth want to read up on "WD green load cycle", before using them in a NAS. (Same for issue with "WD Red".)

    /X

     
    • Gautam Desai

      Gautam Desai - 2015-04-29

      Question, is it really the same for the WD Red NAS drives? I have a 6TB as parity and it doesn't react to the idle3-tools command. Maybe the tool doesn't support WD Red's command set?

       
  • Jessie Taylor

    Jessie Taylor - 2015-04-29

    In addition to the very high load cycle count, those HDDs have been powered on for nearly 4.5 years.

    While I think 100% chance of failure in the next year is certainly a significant overestimate, I would still expect a very high failure rate on those HDDs.

    By the way, you can use a DOS utility called WDIDLE3 to set the idle timer to 255 (rather than the default 8 seconds), or disable it entirely (my preference).

    There is also a linux tool called idle3-tools that can do the same thing:

    http://idle3-tools.sourceforge.net/

    You could use that tool on your current drives, but most of the damage is already done. Be sure to use it early on any new WD Green drives that you get.

     

    Last edit: Jessie Taylor 2015-04-29
    • Gautam Desai

      Gautam Desai - 2015-04-29

      Super useful information! Thanks!

       
  • Jessie Taylor

    Jessie Taylor - 2015-04-29

    I don't think it is a problem for Reds. At least, I have only heard about it being an issue for Greens.

     
    • xad

      xad - 2015-04-29

      I had to fix my last 3T-Red (setting to max time). Not sure if I did it with my others but they have it set to "never". I think the default setting is not consistent so better check. Easiest is to read the "Load_Cycle_Count" counter which should give you a good indication what the settings is.
      NOTE! I have read it is not recommended to set it to "never"/deactivated, so look it up before chosing that option.

      For the Red's there is another approved tool than "WDIDLE3". "WDIDLE3" is for the Green's and has been reported not to always work for the Red's (ex. of thread discussing this issue: https://forums.freenas.org/index.php?threads/wd-utility-for-red-drives-with-high-load-cycle-counts.18095/)

      /X

       
      • Gautam Desai

        Gautam Desai - 2015-04-29

        Actually I checked my 6TB Red which has been running for 147 days and I was already at 477 so I think I was having the problem. I used the idle3 and though it wouldn't print the current value of that drive, I was able to disable the parking successfully using it. Glad it worked!

         
        • Jessie Taylor

          Jessie Taylor - 2015-04-29

          3 load cycles per day is not bad. The Greens with the problem typically accumulate more than 100 cycles per day.

          The Green drives are specified for more than 100,000 load cycles (I think I saw one specified at 200K or 300K).

           

Log in to post a comment.

MongoDB Logo MongoDB