Hi,

The failing disk was my first SATA disk (sda) which i booted from.
I have replaced it and  put the faulty one back as /dev/sdc.
What happened was that it actually it was a 300 GB ATA disk (/dev/hda)
who was breaking down in my machine :

disks in my machine when the failure happened :

hda : Seagate ST3300622A 300 GB ATA-100
         Serial Number: 5NF21XQ8
sda : Seagate ST32000641AS 2TB SATA 6.0Gb/s
          Serial Number: 9WM13KYT
sdb : Seagate ST32000641AS 2TB SATA 6.0Gb/s
          
Serial Number: 9WM13RCC

after replacing sda and adding the old sda as sdc :

sda : Seagate ST32000641AS 2TB SATA 6.0Gb/s
          Serial Number: 9WM54R31
sdb : Seagate ST32000641AS 2TB SATA 6.0Gb/s
          
Serial Number: 9WM13RCC
sda : Seagate ST32000641AS 2TB SATA 6.0Gb/s
          Serial Number: 9WM13KYT

i ran a couple of selftest's  on sdc , with unclear results :

[hubble:root]:(~)# smartctl -l selftest /dev/sdc
smartctl 6.2 2013-07-26 r3841 [x86_64-linux-2.6.32.12] (local build)
Copyright (C) 2002-13, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF READ SMART DATA SECTION ===
SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA
_of_first_error
# 1  Extended offline    Completed: unknown failure    90%     15556         0
# 2  Short offline       Completed: unknown failure    90%     15521         0

[hubble:root]:(~)# smartctl -l selftest /dev/sdc
smartctl 6.2 2013-07-26 r3841 [x86_64-linux-2.6.32.12] (local build)
Copyright (C) 2002-13, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF READ SMART DATA SECTION ===
SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_err
or
# 1  Extended offline    Completed: unknown failure    90%     15556         0
# 2  Short offline       Completed: unknown failure    90%     15521         0

[hubble:root]:(~)# man smartctl        
[hubble:root]:(~)# smartctl -t short /dev/sdc
smartctl 6.2 2013-07-26 r3841 [x86_64-linux-2.6.32.12] (local build)
Copyright (C) 2002-13, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF OFFLINE IMMEDIATE AND SELF-TEST SECTION ===
Sending command: "Execute SMART Short self-test routine immediately in off-line mode".
Drive command "Execute SMART Short self-test routine immediately in off-line mode" successful
.
Testing has begun.
Please wait 1 minutes for test to complete.
Test will complete after Wed Oct 23 12:41:04 2013

Use smartctl -X to abort test.
[hubble:root]:(~)# smartctl -l selftest /dev/sdc
smartctl 6.2 2013-07-26 r3841 [x86_64-linux-2.6.32.12] (local build)
Copyright (C) 2002-13, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF READ SMART DATA SECTION ===
SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_err
or
# 1  Short offline       Completed: unknown failure    90%     15576         0
# 2  Extended offline    Completed: unknown failure    90%     15556         0
# 3  Short offline       Completed: unknown failure    90%     15521         0

[hubble:root]:(~)# smartctl -x /dev/sdc
smartctl 6.2 2013-07-26 r3841 [x86_64-linux-2.6.32.12] (local build)
Copyright (C) 2002-13, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Model Family:     Seagate Barracuda XT
Device Model:     ST32000641AS
Serial Number:    9WM13KYT
LU WWN Device Id: 5 000c50 0269f88b4
Firmware Version: CC13
User Capacity:    2,000,398,934,016 bytes [2.00 TB]
Sector Size:      512 bytes logical/physical
Rotation Rate:    7200 rpm
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   ATA8-ACS T13/1699-D revision 4
SATA Version is:  SATA 2.6, 6.0 Gb/s
Local Time is:    Wed Oct 23 13:07:18 2013 CEST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
AAM feature is:   Unavailable
APM feature is:   Unavailable
Rd look-ahead is: Enabled
Write cache is:   Enabled
ATA Security is:  Disabled, NOT FROZEN [SEC1]
Wt Cache Reorder: Unavailable

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: FAILED!
Drive failure expected in less than 24 hours. SAVE ALL DATA.
See vendor-specific Attribute list for failed Attributes.

General SMART Values:
Offline data collection status:  (0x82) Offline data collection activity
                                        was completed without error.
                                        Auto Offline Data Collection: Enabled.
Self-test execution status:      (  73) The previous self-test completed having
                                        a test element that failed and the test
                                        element that failed is not known.
Total time to complete Offline
data collection:                (  609) seconds.
Offline data collection
capabilities:                    (0x7b) SMART execute Offline immediate.
                                        Auto Offline data collection on/off support.
                                        Suspend Offline collection upon new
                                        command.
                                        Offline surface scan supported.
                                        Self-test supported.
                                        Conveyance Self-test supported.
                                        Selective Self-test supported.
SMART capabilities:            (0x0003) Saves SMART data before entering
                                        power-saving mode.
                                        Supports SMART auto save timer.
Error logging capability:        (0x01) Error logging supported.
                                        General Purpose Logging supported.
Short self-test routine
recommended polling time:        (   1) minutes.
Extended self-test routine
recommended polling time:        ( 344) minutes.
Conveyance self-test routine
recommended polling time:        (   2) minutes.

SMART Attributes Data Structure revision number: 10
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAGS    VALUE WORST THRESH FAIL RAW_VALUE
  1 Raw_Read_Error_Rate     POSR--   118   090   006    -    188112650
  3 Spin_Up_Time            PO----   100   100   000    -    0
  4 Start_Stop_Count        -O--CK   100   100   020    -    72
  5 Reallocated_Sector_Ct   PO--CK   029   029   036    NOW  2909
  7 Seek_Error_Rate         POSR--   078   060   030    -    64860461
  9 Power_On_Hours          -O--CK   083   083   000    -    15576
 10 Spin_Retry_Count        PO--C-   100   100   097    -    0
 12 Power_Cycle_Count       -O--CK   100   100   020    -    81
183 Runtime_Bad_Block       -O--CK   100   100   000    -    0
184 End-to-End_Error        -O--CK   100   100   099    -    0
187 Reported_Uncorrect      -O--CK   001   001   000    -    1008
188 Command_Timeout         -O--CK   100   098   000    -    21475164165
189 High_Fly_Writes         -O-RCK   001   001   000    -    248
190 Airflow_Temperature_Cel -O---K   066   053   045    -    34 (Min/Max 27/36)
191 G-Sense_Error_Rate      -O--CK   100   100   000    -    1
192 Power-Off_Retract_Count -O--CK   100   100   000    -    71
193 Load_Cycle_Count        -O--CK   100   100   000    -    85
194 Temperature_Celsius     -O---K   034   047   000    -    34 (0 19 0 0 0)
195 Hardware_ECC_Recovered  -O-RC-   035   018   000    -    188112650
197 Current_Pending_Sector  -O--C-   100   094   000    -    0
198 Offline_Uncorrectable   ----C-   100   094   000    -    0
199 UDMA_CRC_Error_Count    -OSRCK   200   200   000    -    0
240 Head_Flying_Hours       ------   100   253   000    -    241883968191857
241 Total_LBAs_Written      ------   100   253   000    -    2725068191
242 Total_LBAs_Read         ------   100   253   000    -    3189000094
                            ||||||_ K auto-keep
                            |||||__ C event count
                            ||||___ R error rate
                            |||____ S speed/performance
                            ||_____ O updated online
                            |______ P prefailure warning

General Purpose Log Directory Version 1
SMART           Log Directory Version 1 [multi-sector log support]
Address    Access  R/W   Size  Description
0x00       GPL,SL  R/O      1  Log Directory
0x01       GPL,SL  R/O      1  Summary SMART error log
0x02       GPL,SL  R/O      5  Comprehensive SMART error log
0x03       GPL     R/O      5  Ext. Comprehensive SMART error log
0x06       GPL,SL  R/O      1  SMART self-test log
0x07       GPL     R/O      1  Extended self-test log
0x09       GPL,SL  R/W      1  Selective self-test log
0x10       GPL     R/O      1  NCQ Command Error log
0x11       GPL     R/O      1  SATA Phy Event Counters
0x21       GPL     R/O      1  Write stream error log
0x22       GPL     R/O      1  Read stream error log
0x80-0x9f  GPL,SL  R/W     16  Host vendor specific log
0xa1       GPL,SL  VS      20  Device vendor specific log
0xa2       GPL     VS    2248  Device vendor specific log
0xa8       GPL,SL  VS     129  Device vendor specific log
0xa9       GPL,SL  VS       1  Device vendor specific log
0xb0       GPL     VS    2928  Device vendor specific log
0xbd       GPL     VS     252  Device vendor specific log
0xbe-0xbf  GPL     VS   65535  Device vendor specific log
0xc0       GPL,SL  VS       1  Device vendor specific log
0xe0       GPL,SL  R/W      1  SCT Command/Status
0xe1       GPL,SL  R/W      1  SCT Data Transfer

SMART Extended Comprehensive Error Log Version: 1 (5 sectors)
Device Error Count: 1132 (device log contains only the most recent 20 errors)
        CR     = Command Register
        FEATR  = Features Register
        COUNT  = Count (was: Sector Count) Register
        LBA_48 = Upper bytes of LBA High/Mid/Low Registers ]  ATA-8
        LH     = LBA High (was: Cylinder High) Register    ]   LBA
        LM     = LBA Mid (was: Cylinder Low) Register      ] Register
        LL     = LBA Low (was: Sector Number) Register     ]
        DV     = Device (was: Device/Head) Register
        DC     = Device Control Register
        ER     = Error register
        ST     = Status register
Powered_Up_Time is measured from power on, and printed as
DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes,
SS=sec, and sss=millisec. It "wraps" after 49.710 days.

Error 1132 [11] occurred at disk power-on lifetime: 2532 hours (105 days + 12 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER -- ST COUNT  LBA_48  LH LM LL DV DC
  -- -- -- == -- == == == -- -- -- -- --
  40 -- 51 00 00 00 00 73 00 db de 00 00  Error: UNC at LBA = 0x7300dbde = 1929436126

  Commands leading to the command that caused the error were:
  CR FEATR COUNT  LBA_48  LH LM LL DV DC  Powered_Up_Time  Command/Feature_Name
  -- == -- == -- == == == -- -- -- -- --  ---------------  --------------------
  25 03 03 00 40 00 00 73 00 db ba 40 00     02:25:34.340  READ DMA EXT
  00 00 00 00 00 00 00 00 00 00 00 00 04     02:25:34.251  NOP [Abort queued commands]
  00 00 00 00 00 00 00 00 00 00 00 00 04     02:25:34.178  NOP [Abort queued commands]
  25 03 03 00 40 00 00 73 00 db ba 40 00     02:25:31.221  READ DMA EXT
  00 00 00 00 00 00 00 00 00 00 00 00 04     02:25:30.514  NOP [Abort queued commands]

Error 1131 [10] occurred at disk power-on lifetime: 2532 hours (105 days + 12 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER -- ST COUNT  LBA_48  LH LM LL DV DC
  -- -- -- == -- == == == -- -- -- -- --
  40 -- 51 00 00 00 00 73 00 db de 00 00  Error: UNC at LBA = 0x7300dbde = 1929436126

  Commands leading to the command that caused the error were:
  CR FEATR COUNT  LBA_48  LH LM LL DV DC  Powered_Up_Time  Command/Feature_Name
  -- == -- == -- == == == -- -- -- -- --  ---------------  --------------------
  25 03 03 00 40 00 00 73 00 db ba 40 00     02:25:31.221  READ DMA EXT
  00 00 00 00 00 00 00 00 00 00 00 00 04     02:25:30.514  NOP [Abort queued commands]
  25 03 03 00 40 00 00 73 00 db ba 40 00     02:25:27.435  READ DMA EXT
  00 00 00 00 00 00 00 00 00 00 00 00 04     02:25:27.346  NOP [Abort queued commands]
  00 00 00 00 00 00 00 00 00 00 00 00 04     02:25:27.274  NOP [Abort queued commands]

Error 1130 [9] occurred at disk power-on lifetime: 2532 hours (105 days + 12 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER -- ST COUNT  LBA_48  LH LM LL DV DC
  -- -- -- == -- == == == -- -- -- -- --
  40 -- 51 00 00 00 00 73 00 db de 00 00  Error: UNC at LBA = 0x7300dbde = 1929436126

  Commands leading to the command that caused the error were:
  CR FEATR COUNT  LBA_48  LH LM LL DV DC  Powered_Up_Time  Command/Feature_Name
  -- == -- == -- == == == -- -- -- -- --  ---------------  --------------------
  25 03 03 00 40 00 00 73 00 db ba 40 00     02:25:27.435  READ DMA EXT
  00 00 00 00 00 00 00 00 00 00 00 00 04     02:25:27.346  NOP [Abort queued commands]
  00 00 00 00 00 00 00 00 00 00 00 00 04     02:25:27.274  NOP [Abort queued commands]
  25 03 03 00 40 00 00 73 00 db ba 40 00     02:25:24.281  READ DMA EXT
  25 03 03 00 40 00 00 73 00 db 7a 40 00     02:25:24.279  READ DMA EXT

Error 1129 [8] occurred at disk power-on lifetime: 2532 hours (105 days + 12 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER -- ST COUNT  LBA_48  LH LM LL DV DC
  -- -- -- == -- == == == -- -- -- -- --
  40 -- 51 00 00 00 00 73 00 db de 00 00  Error: UNC at LBA = 0x7300dbde = 1929436126

  Commands leading to the command that caused the error were:
  CR FEATR COUNT  LBA_48  LH LM LL DV DC  Powered_Up_Time  Command/Feature_Name
  -- == -- == -- == == == -- -- -- -- --  ---------------  --------------------
  25 03 03 00 40 00 00 73 00 db ba 40 00     02:25:24.281  READ DMA EXT
  25 03 03 00 40 00 00 73 00 db 7a 40 00     02:25:24.279  READ DMA EXT
  25 03 03 00 40 00 00 73 00 db 3a 40 00     02:25:24.277  READ DMA EXT
  25 03 03 00 40 00 00 72 00 db fa 40 00     02:25:24.276  READ DMA EXT
  25 03 03 00 40 00 00 72 00 db ba 40 00     02:25:24.274  READ DMA EXT

Error 1128 [7] occurred at disk power-on lifetime: 2532 hours (105 days + 12 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER -- ST COUNT  LBA_48  LH LM LL DV DC
  -- -- -- == -- == == == -- -- -- -- --
  40 -- 51 00 00 00 00 6b 00 db 35 00 00  Error: UNC at LBA = 0x6b00db35 = 1795218229

  Commands leading to the command that caused the error were:
  CR FEATR COUNT  LBA_48  LH LM LL DV DC  Powered_Up_Time  Command/Feature_Name
  -- == -- == -- == == == -- -- -- -- --  ---------------  --------------------
  25 03 03 00 40 00 00 6a 00 db fa 40 00     02:25:18.679  READ DMA EXT
  25 03 03 00 40 00 00 6a 00 db ba 40 00     02:25:18.676  READ DMA EXT
  25 03 03 00 40 00 00 6a 00 db 7a 40 00     02:25:18.675  READ DMA EXT
  25 03 03 00 40 00 00 6a 00 db 3a 40 00     02:25:18.674  READ DMA EXT
  25 03 03 00 40 00 00 69 00 db fa 40 00     02:25:18.672  READ DMA EXT

Error 1127 [6] occurred at disk power-on lifetime: 2532 hours (105 days + 12 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER -- ST COUNT  LBA_48  LH LM LL DV DC
  -- -- -- == -- == == == -- -- -- -- --
  40 -- 51 00 00 00 00 58 00 db ad 00 00  Error: UNC at LBA = 0x5800dbad = 1476451245

  Commands leading to the command that caused the error were:
  CR FEATR COUNT  LBA_48  LH LM LL DV DC  Powered_Up_Time  Command/Feature_Name
  -- == -- == -- == == == -- -- -- -- --  ---------------  --------------------
  25 03 03 00 08 00 00 58 00 db aa 40 00     02:25:14.693  READ DMA EXT
  00 00 00 00 00 00 00 00 00 00 00 00 04     02:25:14.600  NOP [Abort queued commands]
  00 00 00 00 00 00 00 00 00 00 00 00 04     02:25:14.528  NOP [Abort queued commands]
  25 03 03 00 08 00 00 58 00 db aa 40 00     02:25:11.544  READ DMA EXT
  00 00 00 00 00 00 00 00 00 00 00 00 04     02:25:10.837  NOP [Abort queued commands]

Error 1126 [5] occurred at disk power-on lifetime: 2532 hours (105 days + 12 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER -- ST COUNT  LBA_48  LH LM LL DV DC
  -- -- -- == -- == == == -- -- -- -- --
  40 -- 51 00 00 00 00 58 00 db ad 00 00  Error: UNC at LBA = 0x5800dbad = 1476451245

  Commands leading to the command that caused the error were:
  CR FEATR COUNT  LBA_48  LH LM LL DV DC  Powered_Up_Time  Command/Feature_Name
  -- == -- == -- == == == -- -- -- -- --  ---------------  --------------------
  25 03 03 00 08 00 00 58 00 db aa 40 00     02:25:11.544  READ DMA EXT
  00 00 00 00 00 00 00 00 00 00 00 00 04     02:25:10.837  NOP [Abort queued commands]
  25 03 03 00 08 00 00 58 00 db aa 40 00     02:25:07.788  READ DMA EXT
  00 00 00 00 00 00 00 00 00 00 00 00 04     02:25:07.696  NOP [Abort queued commands]
  00 00 00 00 00 00 00 00 00 00 00 00 04     02:25:07.623  NOP [Abort queued commands]

Error 1125 [4] occurred at disk power-on lifetime: 2532 hours (105 days + 12 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER -- ST COUNT  LBA_48  LH LM LL DV DC
  -- -- -- == -- == == == -- -- -- -- --
  40 -- 51 00 00 00 00 58 00 db ad 00 00  Error: UNC at LBA = 0x5800dbad = 1476451245

  Commands leading to the command that caused the error were:
  CR FEATR COUNT  LBA_48  LH LM LL DV DC  Powered_Up_Time  Command/Feature_Name
  -- == -- == -- == == == -- -- -- -- --  ---------------  --------------------
  25 03 03 00 08 00 00 58 00 db aa 40 00     02:25:07.788  READ DMA EXT
  00 00 00 00 00 00 00 00 00 00 00 00 04     02:25:07.696  NOP [Abort queued commands]
  00 00 00 00 00 00 00 00 00 00 00 00 04     02:25:07.623  NOP [Abort queued commands]
  25 03 03 00 08 00 00 58 00 db aa 40 00     02:25:04.579  READ DMA EXT
  00 00 00 00 00 00 00 00 00 00 00 00 04     02:25:03.875  NOP [Abort queued commands]

SMART Extended Self-test Log Version: 1 (1 sectors)
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_err
or
# 1  Short offline       Completed: unknown failure    90%     15576         0
# 2  Extended offline    Completed: unknown failure    90%     15556         0
# 3  Short offline       Completed: unknown failure    90%     15521         0

SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.

SCT Commands not supported

Device Statistics (GP Log 0x04) not supported

SATA Phy Event Counters (GP Log 0x11)
ID      Size     Value  Description
0x000a  2            9  Device-to-host register FISes sent due to a COMRESET
0x0001  2            0  Command failed due to ICRC error
0x0003  2            0  R_ERR response for device-to-host data FIS
0x0004  2            0  R_ERR response for host-to-device data FIS
0x0006  2            0  R_ERR response for device-to-host non-data FIS
0x0007  2            0  R_ERR response for host-to-device non-data FIS

[hubble:root]:(~)#

A long report of errors, but computer generated, so useful to the
folks maintaining this piece of software. Should i replace sdc ?
Maybe, but for non-critical stuff , sdc still offers a lot of space.

Best Regards,

Robert
--
Robert M. Stockmann - RHCE
Network Engineer - UNIX/Linux Specialist
crashrecovery.org  stock@stokkie.net