Hello,

I recently installed smartmontools on some of my servers on a test basis and have been verifying the output. It seems to be working fine except a few issues that I have noticed. I would like to know if anyone explain me about those issue. I'm listing below for your reference.

1. The lifetime on one drive does not seem to be showing correctly as it varies up and down significantly. Why is it showing like that?

SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Extended offline    Completed without error       00%        40         -
# 2  Short offline       Aborted by host               80%        39         -
# 3  Extended offline    Completed without error       00%       965         -
# 4  Short offline       Aborted by host               80%       964         -
# 5  Short offline       Completed without error       00%       958         -
# 6  Short offline       Completed without error       00%       952         -
# 7  Short offline       Completed without error       00%       946         -
# 8  Extended offline    Completed without error       00%       944         -
# 9  Short offline       Completed without error       00%       940         -
#10  Short offline       Completed without error       00%       934         -
#11  Short offline       Completed without error       00%       928         -
#12  Extended offline    Completed without error       00%       797         -
#13  Extended offline    Completed without error       00%       629         -
#14  Extended offline    Completed without error       00%       462         -
#15  Extended offline    Completed without error       00%       294         -
#16  Extended offline    Completed without error       00%       126         -
#17  Extended offline    Completed without error       00%      1051         -
#18  Extended offline    Completed without error       00%       883         -
#19  Extended offline    Completed without error       00%       715         -
#20  Extended offline    Completed without error       00%       547         -
#21  Extended offline    Completed without error       00%       380         -


2. On another drive, short offline test failed.

Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Short offline       Completed: read failure       90%       744         1755039

When I checked the smartctl output both Reallocated_Event_Count  and Current_Pending_Sector are being shown as zero.

196 Reallocated_Event_Count 0x0032   200   200   000    Old_age   Always       -       0
197 Current_Pending_Sector  0x0012   200   200   000    Old_age   Always       -       0

My question here is, if the drive has got any bad sectors, would not the above attributes change?

I could also see the following log in smartctl output.

Error 4 occurred at disk power-on lifetime: 568 hours (23 days + 16 hours)
  When the command that caused the error occurred, the device was doing SMART Offline or Self-test.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  40 51 08 dd ce 1a e0  Error:

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  00 00 25 00 00 08 00 00  29d+13:32:49.608  NOP [Abort queued commands]
  0d 00 5b 00 00 fb 20 00  29d+13:32:49.608  [RESERVED]
  00 00 25 00 00 08 00 00  29d+13:32:49.608  NOP [Abort queued commands]
  00 00 1a 00 00 b5 ce 00  29d+13:32:49.608  NOP [Abort queued commands]
  00 00 25 00 00 08 00 00  29d+13:32:49.608  NOP [Abort queued commands]

Could anyone tell me what does "NOP [Abort queued commands]" and "[RESERVED]" mean? Also I guess this is an old error since the time it logged was at 568 hours. Am I right here?

I appreciate if anyone can explain to my questions(They have been marked in red coulour). Thanks!

--
Deepak