Thread: [smartmontools-support] exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x2 frozen

Disk Inspection and Monitoring

Brought to you by: ballen4705, chrfranke, dipohl

smartmontools-support

[smartmontools-support] exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x2 frozen

From: Jonas P. <jon...@xm...> - 2008-08-30 09:23:38

[ I wasn't a member on my first submit and it seems that moderation is
somewhat slow so I resend this now ]

Hi experts,

I've tried searching every archive I can find, and my priminary
conclusion is that my drive must have an odd fault, but I'd love to hear
your confirmation.

On my MacBookPro 4,1 (penryn) - which is less than a month old - I
started to have strange random hangs after a couple of weeks. It often
happens after about 5 minutes of idle time (either after resume or
reboot), but also long after that when a burst of disk activity starts
(say an apt-get install). My point is that I never see this problem just
after boot, nor at the point of resuming from suspend-to-ram. Suspending
to ram seems unrelated as it is just as likely to happen over the
weekend after a fresh boot on friday night and no suspending.

When I'm lucky, things revert to normal after a while, when I'm unlucky
the disk is marked read-only and all I can do is reboot.

Here is a snippet from this morning - this time it came back to life:

[46874.898690] ata3.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x2
frozen
[46874.898703] ata3.00: cmd c8/00:08:90:3c:59/00:00:00:00:00/ef tag 0
dma 4096 in
[46874.898705]          res 40/00:01:01:4f:c2/00:00:00:00:00/00 Emask
0x4 (timeout)
[46874.898709] ata3.00: status: { DRDY }
[46879.643962] ata3: port is slow to respond, please be patient (Status
0xd0)
[46884.473195] ata3: device not ready (errno=-16), forcing hardreset
[46884.473202] ata3: soft resetting link
[46912.740010] ata3.00: qc timeout (cmd 0xec)
[46912.740020] ata3.00: failed to IDENTIFY (I/O error, err_mask=0x4)
[46912.740023] ata3.00: revalidation failed (errno=-5)
[46912.740028] ata3: failed to recover some devices, retrying in 5 secs
[46917.458070] ata3: soft resetting link
[46917.636464] ata3.00: configured for UDMA/100
[46917.636482] ata3: EH complete
[46917.699224] sd 2:0:0:0: [sda] 488397168 512-byte hardware sectors
(250059 MB)
[46917.699257] sd 2:0:0:0: [sda] Write Protect is off
[46917.699263] sd 2:0:0:0: [sda] Mode Sense: 00 3a 00 00
[46917.699300] sd 2:0:0:0: [sda] Write cache: enabled, read cache:
enabled, doesn't support DPO or FUA


I've seen some references that this COULD be a problem with
smartmontools pre 5.38 (though no explanation how), but still I am using
  5.38-1ubuntu1~hardy1 and in fact, initially I didn't have smartd
running, so I doubt that could be the problem.

My drive is a:
Device Model:     FUJITSU MHY2250BH
Serial Number:    K43CT862GMK9

I have run self tests that seem OK:

SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining
LifeTime(hours)  LBA_of_first_error
# 1  Conveyance offline  Completed without error       00%       837
      -
# 2  Extended offline    Completed without error       00%       827
      -
# 3  Short offline       Completed without error       00%       825
      -

I have a very similar driver (just smaller) in a somewhat older
MacBookPro running the same installation and it works perfectly. Tried
both with the most recent standard kernels  (2.6.24-*-generic) and also
a self compiled 2.6.26 without finding any more clues.

So I guess I will have to replace it anyway - unless you have any ideas
for me.

			Thanks in advance / Jonas

[smartmontools-support] exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x2 frozen

From: Jonas P. <jon...@xm...> - 2008-08-29 00:12:14

Hi experts,

I've tried searching every archive I can find, and my priminary 
conclusion is that my drive must have an odd fault, but I'd love to hear 
your confirmation.

On my MacBookPro 4,1 (penryn) - which is less than a month old - I 
started to have strange random hangs after a couple of weeks. It often 
happens after about 5 minutes of idle time (either after resume or 
reboot), but also long after that when a burst of disk activity starts 
(say an apt-get install). My point is that I never see this problem just 
after boot, nor at the point of resuming from suspend-to-ram. Suspending 
to ram seems unrelated as it is just as likely to happen over the 
weekend after a fresh boot on friday night and no suspending.

When I'm lucky, things revert to normal after a while, when I'm unlucky 
the disk is marked read-only and all I can do is reboot.

Here is a snippet from this morning - this time it came back to life:

[46874.898690] ata3.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x2 
frozen
[46874.898703] ata3.00: cmd c8/00:08:90:3c:59/00:00:00:00:00/ef tag 0 
dma 4096 in
[46874.898705]          res 40/00:01:01:4f:c2/00:00:00:00:00/00 Emask 
0x4 (timeout)
[46874.898709] ata3.00: status: { DRDY }
[46879.643962] ata3: port is slow to respond, please be patient (Status 
0xd0)
[46884.473195] ata3: device not ready (errno=-16), forcing hardreset
[46884.473202] ata3: soft resetting link
[46912.740010] ata3.00: qc timeout (cmd 0xec)
[46912.740020] ata3.00: failed to IDENTIFY (I/O error, err_mask=0x4)
[46912.740023] ata3.00: revalidation failed (errno=-5)
[46912.740028] ata3: failed to recover some devices, retrying in 5 secs
[46917.458070] ata3: soft resetting link
[46917.636464] ata3.00: configured for UDMA/100
[46917.636482] ata3: EH complete
[46917.699224] sd 2:0:0:0: [sda] 488397168 512-byte hardware sectors 
(250059 MB)
[46917.699257] sd 2:0:0:0: [sda] Write Protect is off
[46917.699263] sd 2:0:0:0: [sda] Mode Sense: 00 3a 00 00
[46917.699300] sd 2:0:0:0: [sda] Write cache: enabled, read cache: 
enabled, doesn't support DPO or FUA


I've seen some references that this COULD be a problem with 
smartmontools pre 5.38 (though no explanation how), but still I am using 
  5.38-1ubuntu1~hardy1 and in fact, initially I didn't have smartd 
running, so I doubt that could be the problem.

My drive is a:
Device Model:     FUJITSU MHY2250BH
Serial Number:    K43CT862GMK9

I have run self tests that seem OK:

SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining 
LifeTime(hours)  LBA_of_first_error
# 1  Conveyance offline  Completed without error       00%       837 
      -
# 2  Extended offline    Completed without error       00%       827 
      -
# 3  Short offline       Completed without error       00%       825 
      -

I have a very similar driver (just smaller) in a somewhat older 
MacBookPro running the same installation and it works perfectly. Tried 
both with the most recent standard kernels  (2.6.24-*-generic) and also 
a self compiled 2.6.26 without finding any more clues.

So I guess I will have to replace it anyway - unless you have any ideas 
for me.

			Thanks in advance / Jonas