From: Jonas P. <jon...@xm...> - 2008-08-30 09:23:38
|
[ I wasn't a member on my first submit and it seems that moderation is somewhat slow so I resend this now ] Hi experts, I've tried searching every archive I can find, and my priminary conclusion is that my drive must have an odd fault, but I'd love to hear your confirmation. On my MacBookPro 4,1 (penryn) - which is less than a month old - I started to have strange random hangs after a couple of weeks. It often happens after about 5 minutes of idle time (either after resume or reboot), but also long after that when a burst of disk activity starts (say an apt-get install). My point is that I never see this problem just after boot, nor at the point of resuming from suspend-to-ram. Suspending to ram seems unrelated as it is just as likely to happen over the weekend after a fresh boot on friday night and no suspending. When I'm lucky, things revert to normal after a while, when I'm unlucky the disk is marked read-only and all I can do is reboot. Here is a snippet from this morning - this time it came back to life: [46874.898690] ata3.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x2 frozen [46874.898703] ata3.00: cmd c8/00:08:90:3c:59/00:00:00:00:00/ef tag 0 dma 4096 in [46874.898705] res 40/00:01:01:4f:c2/00:00:00:00:00/00 Emask 0x4 (timeout) [46874.898709] ata3.00: status: { DRDY } [46879.643962] ata3: port is slow to respond, please be patient (Status 0xd0) [46884.473195] ata3: device not ready (errno=-16), forcing hardreset [46884.473202] ata3: soft resetting link [46912.740010] ata3.00: qc timeout (cmd 0xec) [46912.740020] ata3.00: failed to IDENTIFY (I/O error, err_mask=0x4) [46912.740023] ata3.00: revalidation failed (errno=-5) [46912.740028] ata3: failed to recover some devices, retrying in 5 secs [46917.458070] ata3: soft resetting link [46917.636464] ata3.00: configured for UDMA/100 [46917.636482] ata3: EH complete [46917.699224] sd 2:0:0:0: [sda] 488397168 512-byte hardware sectors (250059 MB) [46917.699257] sd 2:0:0:0: [sda] Write Protect is off [46917.699263] sd 2:0:0:0: [sda] Mode Sense: 00 3a 00 00 [46917.699300] sd 2:0:0:0: [sda] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA I've seen some references that this COULD be a problem with smartmontools pre 5.38 (though no explanation how), but still I am using 5.38-1ubuntu1~hardy1 and in fact, initially I didn't have smartd running, so I doubt that could be the problem. My drive is a: Device Model: FUJITSU MHY2250BH Serial Number: K43CT862GMK9 I have run self tests that seem OK: SMART Self-test log structure revision number 1 Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Conveyance offline Completed without error 00% 837 - # 2 Extended offline Completed without error 00% 827 - # 3 Short offline Completed without error 00% 825 - I have a very similar driver (just smaller) in a somewhat older MacBookPro running the same installation and it works perfectly. Tried both with the most recent standard kernels (2.6.24-*-generic) and also a self compiled 2.6.26 without finding any more clues. So I guess I will have to replace it anyway - unless you have any ideas for me. Thanks in advance / Jonas |