From: <bug...@bu...> - 2010-07-13 09:25:02
|
https://bugzilla.kernel.org/show_bug.cgi?id=16376 Summary: random - possibly Radeon DRM KMS related - freezes Product: Drivers Version: 2.5 Platform: All OS/Version: Linux Tree: Mainline Status: NEW Severity: normal Priority: P1 Component: Video(DRI - non Intel) AssignedTo: dri...@ke... ReportedBy: Martin@Lichtvoll.de Regression: Yes Affected kernel versions: - 2.6.34-tp42-toi-3.1-04981-gb9a071a - 2.6.34.1-tp42-toi-3.1.1.1-04990-g3a7d1f4 Last kernel that worked: - 2.6.33.2-tp42-toi-3.1-lowmem-free-991-992-04964-gf00c7ec-dirty (including some patches to explicitely allow freeing lowmem pages on hibernation - I tested them for Nigel) Currently running kernel: - 2.6.33.6-tp42-toi-3.1.1.1-04982-g768d8a0 All from Nigel Cunnigham's TuxOnIce trees, but hangs happened before any hibernation cycle took place. So now I just report this although I do not have much information about the circumstances of these hangs. With 2.6.34 I had two and with 2.6.34.1 I had one sudden freeze of at least the desktop on my ThinkPad T42 with Radeon graphics. Mouse pointer just froze, Ctrl-Alt-F1 did nothing and AFAIR also there was no disk I/O anymore. I am not completely sure about the last one. Since the freezes happened in quite unpleasant circumstances I did not bother to start up a second machine in order to try to SSH into my T42. With 2.6.33.2 I did not experience those freezes. I used the tuxonice-2.6.34 tree from Nigel Cunningham, cause I prefer TuxOnIce over other hibernation methods. Since all of the freezes just happened after a fresh boot of the system without any snapshot cycle in between them, I believe this to be a mainline kernel bug. All freezes have been while running a KDE 4.4.4 desktop with OpenGL compositing enabled. The first two times just shortly after login in to the desktop. The third time while playing an AVI file from my photo SD card with Dragon Player. On the other hand I had hours of uptime with some TuxOnIce snapshot cycles without anything happening. I never had a freeze after the machine had done snapshot cycle. The freezes rather happened shortly after a fresh boot of the machine. I am not sure whether this is a Radeon DRM KMS related bug, but this is my best guess at the moment. Other activities involved in all three situations were: - USB. On the first two a M-Audio Sonica Theater was connected. On the third the kernel was reading the AVI file from a SD card connected via USB card reader. - eSATA harddisk. In all times an external 500 GB harddisk was connected via eSATA But also here I had the kernel running for hours with USB or eSATA without anything happening. In the further cause I will attach some hardware and software details that might be helpful. Currently I just downgraded to 2.6.33 again. I compiled me a 2.6.33.6. Since actually I really want some stability at least during the week where I hold a Linux training, I want to stick with it for now. Currently my plans are to wait for 2.6.34.2 or .3 and try again. The laptop is used for production work and I want it to meet some basic stability requirements. But if need be and I manage to take time for it, I may do some guided testing. -- Configure bugmail: https://bugzilla.kernel.org/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are watching the assignee of the bug. |
From: <bug...@bu...> - 2010-07-13 10:11:29
|
https://bugzilla.kernel.org/show_bug.cgi?id=16376 --- Comment #1 from Martin Steigerwald <Martin@Lichtvoll.de> 2010-07-13 10:11:15 --- Created an attachment (id=27083) --> (https://bugzilla.kernel.org/attachment.cgi?id=27083) hardware of my ThinkPad T42, lspci -nvv -- Configure bugmail: https://bugzilla.kernel.org/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are watching the assignee of the bug. |
From: <bug...@bu...> - 2010-07-13 10:15:13
|
https://bugzilla.kernel.org/show_bug.cgi?id=16376 --- Comment #2 from Martin Steigerwald <Martin@Lichtvoll.de> 2010-07-13 10:15:02 --- Userspace I use: apt-show-versions | egrep "(xserver-xorg/|xserver-xorg-core/|xserver-xorg-video-radeon/|libgl1-mesa-dri/|libdrm2/|libdrm-radeon1/|kde-window-manager/)" kde-window-manager/squeeze uptodate 4:4.4.4-1 libdrm-radeon1/experimental uptodate 2.4.21-1 libdrm2/experimental uptodate 2.4.21-1 libgl1-mesa-dri/experimental uptodate 7.8.2-1 xserver-xorg/squeeze uptodate 1:7.5+6 xserver-xorg-core/squeeze uptodate 2:1.7.7-2 xserver-xorg-video-radeon/sid uptodate 1:6.13.1-1 -- Configure bugmail: https://bugzilla.kernel.org/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are watching the assignee of the bug. |
From: <bug...@bu...> - 2010-07-13 10:24:15
|
https://bugzilla.kernel.org/show_bug.cgi?id=16376 --- Comment #3 from Martin Steigerwald <Martin@Lichtvoll.de> 2010-07-13 10:24:05 --- I did some research on the internet and found this possibly related case - hot, since it also happens on a ThinkPad T42 with similar if not same graphics hardware: ------------------------------------------------------ Random freezes with kernel 2.6.34 and xorg 1.8 The other day I upgraded the kernel to version 2.6.34 and at the same time xorg-server to 1.8 (along with input drivers and video drivers). From that moment, I suffer from random freezes, the system is completely locked up; the screen doesn't blank, though. The system appears to be perfectly fine, but after some minutes it will freeze. As far as I can remember, the freezes only occur when using a webbrowser. Chromium is my default browser, but also Firefox and Konqueror caused the system to freeze completely when I want to open a webpage. Other network programs such as irssi can run for hours and they didn't seem to cause any havoc. At first I thought this might be caused by some instability in the latest xf86-video-ati driver, so I downgraded back to xorg-server 1.7. Still the same symptoms, so I am pretty sure now there's something in the kernel. So I went back to xorg-server 1.8 and downgraded the kernel to 2.6.33. So far, the system hasn't let me down, yet. Also, I uninstalled the madwifi packages on my system so I'm using the ath5 drivers shipped with the kernel. That change doesn't seem to make a difference. I am not exactly sure what exactly could cause this, there's no trace in any log file to be found. My suspicion is that it's network related. The hardware is a Thinkpad T42 with an ATI Radeon Mobility 9600 (r300) chip and an Atheros wireless card. Anyone else with similar experiences with this hardware? I can't think of a way how to properly debug this. I know I can bisect, but that's time consuming and perhaps it's quicker to ask around first before I walk down that road. Any suggestions are welcome on how to track this down ------------------------------------------------------ http://bbs.archlinux.org/viewtopic.php?pid=789948 I do not use any wireless however, the ipw2200 radio is disabled here. Unlikely cause it seems to be easily reproducable: ------------------------------------------------------ seems to freeze on drm installation [2.6.34, 2× RV280] See attached screenshot. While I can boot anything up to 2.6.33, 2.6.34 seems to reproducibly hang itself up when it starts DRM. This machine has two Radeon RV280 cards. ------------------------------------------------------ http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=586137 -- Configure bugmail: https://bugzilla.kernel.org/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are watching the assignee of the bug. |
From: <bug...@bu...> - 2010-07-13 12:43:42
|
https://bugzilla.kernel.org/show_bug.cgi?id=16376 --- Comment #4 from Martin Steigerwald <Martin@Lichtvoll.de> 2010-07-13 12:43:23 --- While searching for traces of those freezes in syslog I came across backtraces that may or may not be releated. See bug #16377. There were prior to the third freeze. I did not find any backtraces shortly before that freeze. Will look for the earlier two freezes now. -- Configure bugmail: https://bugzilla.kernel.org/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are watching the assignee of the bug. |
From: <bug...@bu...> - 2010-07-13 13:55:12
|
https://bugzilla.kernel.org/show_bug.cgi?id=16376 Alex Deucher <ale...@gm...> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |ale...@gm... --- Comment #5 from Alex Deucher <ale...@gm...> 2010-07-13 13:32:15 --- Does s/r work ok without tuxonice? -- Configure bugmail: https://bugzilla.kernel.org/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are watching the assignee of the bug. |
From: <bug...@bu...> - 2010-07-13 15:12:20
|
https://bugzilla.kernel.org/show_bug.cgi?id=16376 --- Comment #6 from Martin Steigerwald <Martin@Lichtvoll.de> 2010-07-13 15:12:04 --- The bug does not appear to be suspend/resume related at all. The freezes did happen after a fresh boot, without any snapshot cycle in between. They never happened after the first TuxOnIce cycle which I guess is just a coincidence. So the machine had no snapshot cycles as the freeze occured. The only thing TuxOnIce does on a fresh boot is finding no TuxOnIce image and exiting, so I highly doubt the freezes are TuxOnIce related in any way. -- Configure bugmail: https://bugzilla.kernel.org/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are watching the assignee of the bug. |
From: <bug...@bu...> - 2010-07-13 15:14:56
|
https://bugzilla.kernel.org/show_bug.cgi?id=16376 --- Comment #7 from Alex Deucher <ale...@gm...> 2010-07-13 15:14:47 --- Does this happen with 2.6.35? -- Configure bugmail: https://bugzilla.kernel.org/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are watching the assignee of the bug. |
From: <bug...@bu...> - 2010-07-15 21:12:56
|
https://bugzilla.kernel.org/show_bug.cgi?id=16376 Rafael J. Wysocki <rj...@si...> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |mac...@gm..., | |rj...@si... Kernel Version| |2.6.34 Blocks| |15310 -- Configure bugmail: https://bugzilla.kernel.org/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are watching the assignee of the bug. |
From: <bug...@bu...> - 2010-07-16 18:15:50
|
https://bugzilla.kernel.org/show_bug.cgi?id=16376 --- Comment #8 from Martin Steigerwald <Martin@Lichtvoll.de> 2010-07-16 18:15:33 --- Yes, after giving up on an issue of Debians make-kpkg with Kernel 2.6.35 "+" sign in version number[1], I used make deb-pkg to compile 2.6.35-rc5-04995-g7441ae8 from TuxOnIce head. About 5 minutes after booting it, shortly after starting KDE 4 OpenGL composited desktop session, the machine froze hard. This time I tried accessing it from another machine, my second ThinkPad, a T23 with SuperSavage chipset. The frozen T42 did not answer to any ping, "destination host unreachable". Thus I conclude the kernel was completely locked up. Again it was a fresh boot, no hibernation cycle in between. The T23 also has a 2.6.34.1 which didn't yet lock up and has an uptime of 7 days, but only 3 TuxOnIce snapshot cycles. It appears to be stable. Thus I think it really could have to do with radeon KMS DRM code. [1] http://bugs.debian.org/588178 -- Configure bugmail: https://bugzilla.kernel.org/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are watching the assignee of the bug. |
From: <bug...@bu...> - 2010-07-16 19:18:24
|
https://bugzilla.kernel.org/show_bug.cgi?id=16376 --- Comment #9 from Alex Deucher <ale...@gm...> 2010-07-16 19:18:13 --- Any chance you could bisect this? -- Configure bugmail: https://bugzilla.kernel.org/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are watching the assignee of the bug. |
From: <bug...@bu...> - 2010-07-16 21:09:12
|
https://bugzilla.kernel.org/show_bug.cgi?id=16376 --- Comment #10 from Martin Steigerwald <Martin@Lichtvoll.de> 2010-07-16 21:08:55 --- Difficult, since it doesn't happen all the time, I have no clear pattern on reproducabilty. When it happened it often happened within 5 minutes after starting the desktop, but when playing the AVI videos from SD card it took longer. And sometimes it just didn't freeze at all. This also is the laptop for all my stuff, thus I have some stability requirements for it. I never did bisecting so far, but as far as I understand it usually includes testing about a dozen kernels and thus having a dozen freezes - quite some risk to loose unsaved data as I can't predict if and when it freezes. -- Configure bugmail: https://bugzilla.kernel.org/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are watching the assignee of the bug. |
From: <bug...@bu...> - 2010-07-16 21:15:22
|
https://bugzilla.kernel.org/show_bug.cgi?id=16376 --- Comment #11 from Martin Steigerwald <Martin@Lichtvoll.de> 2010-07-16 21:14:56 --- But there is no other way to get this one tracked? I thought about net console, but when the kernel doesn't even respond to a ping anymore, I guess it won't send out anything over the net. Maybe in the momemts before the freeze it sends anything? -- Configure bugmail: https://bugzilla.kernel.org/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are watching the assignee of the bug. |
From: <bug...@bu...> - 2010-07-23 14:55:44
|
https://bugzilla.kernel.org/show_bug.cgi?id=16376 --- Comment #12 from Alex Deucher <ale...@gm...> 2010-07-23 14:30:10 --- Unfortunately GPU freezes are really hard to track down without a reliable test case. -- Configure bugmail: https://bugzilla.kernel.org/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are watching the assignee of the bug. |
From: <bug...@bu...> - 2010-07-23 15:55:43
|
https://bugzilla.kernel.org/show_bug.cgi?id=16376 --- Comment #14 from Martin Steigerwald <Martin@Lichtvoll.de> 2010-07-23 15:19:29 --- Another hint: I ran 2.6.34.1 on a Radeon based Dell workstation at work, but without KMS and there the desktop never froze. It had a problem with shutting down one SoftRAID on hibernation so I downgraded there too (also reported). The 2.6.34.1 on my SuperSavage based T23 is behaving well currently. -- Configure bugmail: https://bugzilla.kernel.org/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are watching the assignee of the bug. |
From: <bug...@bu...> - 2010-07-23 15:57:31
|
https://bugzilla.kernel.org/show_bug.cgi?id=16376 --- Comment #13 from Martin Steigerwald <Martin@Lichtvoll.de> 2010-07-23 15:17:48 --- I don't have a reliable test case. It does not happen always, but when it happened, it usually happened quite short after the first boot. Unfortunately with bisecting that would mean booting a kernel three or five times and letting it run for about half an hour to be quite sure. An that for possibly a dozen kernels of which some might have other problems like eating my filesystems. Right now thats too much for me. I can test some selected kernels or patches as I manage to take time however. -- Configure bugmail: https://bugzilla.kernel.org/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are watching the assignee of the bug. |
From: <bug...@bu...> - 2010-07-25 18:51:51
|
https://bugzilla.kernel.org/show_bug.cgi?id=16376 Dragos Delcea <dra...@gm...> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |dra...@gm... --- Comment #15 from Dragos Delcea <dra...@gm...> 2010-07-25 18:51:35 --- I think I might have stumbled over this as well. Hw is a Lenovo T60 Thinkpad (radeon mobility X1400 video card). I'm on 32 bit gentoo, everything 2.6.33 and below works; 2.6.34 - in either gentoo or vanilla flavours - has random freezes. I'using 2.6.33.6 currently and I am happy, 2.6.34.1 and 2.6.34.2 (and other gentoo 2.6.34 specific kernels as well) freeze on me. Notably, I'm not using KMS yet, but I am following the radeon master git; not sure whether the freezes are graphic related, though. My use case that seems to trigger this is KVM; having 2 VMs running at once is usually enough. -- Configure bugmail: https://bugzilla.kernel.org/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are watching the assignee of the bug. |
From: <bug...@bu...> - 2010-07-26 07:08:43
|
https://bugzilla.kernel.org/show_bug.cgi?id=16376 --- Comment #16 from Dragos Delcea <dra...@gm...> 2010-07-26 07:08:27 --- It should have red: 2.6.33.x works while 2.6.34 and 2.6.34.1 freeze on me. There's no 2.6.34.2 (yet) available. -- Configure bugmail: https://bugzilla.kernel.org/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are watching the assignee of the bug. |
From: <bug...@bu...> - 2010-07-26 14:59:17
|
https://bugzilla.kernel.org/show_bug.cgi?id=16376 --- Comment #17 from Alex Deucher <ale...@gm...> 2010-07-26 14:59:05 --- (In reply to comment #16) > It should have red: 2.6.33.x works while 2.6.34 and 2.6.34.1 freeze on me. > There's no 2.6.34.2 (yet) available. Can you bisect to see what commit is causing the problem? -- Configure bugmail: https://bugzilla.kernel.org/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are watching the assignee of the bug. |
From: <bug...@bu...> - 2010-07-28 12:06:02
|
https://bugzilla.kernel.org/show_bug.cgi?id=16376 --- Comment #18 from Dragos Delcea <dra...@gm...> 2010-07-28 12:05:47 --- I can't reproduce it anymore with 2.6.35-rc6. I'm going to stay with the same kernel and switch to KMS and see if I can reproduce the reporter's problem. -- Configure bugmail: https://bugzilla.kernel.org/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are watching the assignee of the bug. |
From: <bug...@bu...> - 2010-07-28 12:51:38
|
https://bugzilla.kernel.org/show_bug.cgi?id=16376 --- Comment #19 from Martin Steigerwald <Martin@Lichtvoll.de> 2010-07-28 12:51:23 --- I started a compile of 2.6.35-rc6 and will test as well. Thanks. -- Configure bugmail: https://bugzilla.kernel.org/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are watching the assignee of the bug. |
From: <bug...@bu...> - 2010-07-28 14:33:01
|
https://bugzilla.kernel.org/show_bug.cgi?id=16376 --- Comment #20 from Alex Deucher <ale...@gm...> 2010-07-28 14:32:47 --- If rc6 is stable, can you bisect between rc5 and rc6 to see what fixed the issue? It should be a much smaller change set. -- Configure bugmail: https://bugzilla.kernel.org/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are watching the assignee of the bug. |
From: <bug...@bu...> - 2010-08-01 14:17:12
|
https://bugzilla.kernel.org/show_bug.cgi?id=16376 --- Comment #21 from Martin Steigerwald <Martin@Lichtvoll.de> 2010-07-28 17:33:33 --- 2.6.35-rc6 froze a few minutes after the second boot directly after opening an image in Gwenview from KDE 4.4.4. After the first boot it was stable for 25 minutes including playing some AVI movie from my digicam with Dragon Player - but well it seems to be randomly, I have no pattern to trigger it - only observation I made is if it doesn't happen in the first half hour after boot it likely doesn't happen anymore until I do another fresh boot: martin@shambhala:~> uprecords -m200 | egrep "(rc6|#)" # Uptime | System Boot up 127 0 days, 00:25:14 | Linux 2.6.35-rc6-tp42-to Wed Jul 28 18:46:25 2010 186 0 days, 00:00:53 | Linux 2.6.35-rc6-tp42-to Wed Jul 28 19:12:13 2010 --- Comment #22 from Rafael J. Wysocki <rj...@si...> 2010-08-01 14:16:56 --- Handled-By : Alex Deucher <ale...@gm...> -- Configure bugmail: https://bugzilla.kernel.org/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are watching the assignee of the bug. |
From: <bug...@bu...> - 2010-08-14 16:47:58
|
https://bugzilla.kernel.org/show_bug.cgi?id=16376 --- Comment #23 from Martin Steigerwald <Martin@Lichtvoll.de> 2010-08-14 16:47:40 --- This bug still happens with 2.6.35.2. This time it hung while compiling virtualbox-ose kernel modules and playing music with Amarok. The music stopped playing. I think I will have a try with 2.6.36-rc3 or 4 again. And if I manage to do it possibly at least try going back to some 2.6.34-rc's to see if the bug was introduced with some rc. I do not feel comfortable with bisecting at random versions, cause I don't know whether they might have been short lived ext4 filesystem corruption bugs or whatnot. -- Configure bugmail: https://bugzilla.kernel.org/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are watching the assignee of the bug. |
From: <bug...@bu...> - 2010-08-14 16:54:55
|
https://bugzilla.kernel.org/show_bug.cgi?id=16376 Martin Steigerwald <Martin@Lichtvoll.de> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |ai...@li... --- Comment #24 from Martin Steigerwald <Martin@Lichtvoll.de> 2010-08-14 16:54:43 --- Dave, I hope you do not mind adding you to the CC list, but since this happens on a ThinkPad T42 your feedback might help. I thought: From: Michel Dänzer <da...@vm...> commit e376573f7267390f4e1bdc552564b6fb913bce76 upstream. This fixes a problem where on low VRAM cards we'd run out of space for validation. [airlied: Tested on my M7, Thinkpad T42, compiz works with no problems.] Signed-off-by: Michel Dänzer <da...@vm...> Signed-off-by: Dave Airlie <ai...@re...> Signed-off-by: Greg Kroah-Hartman <gr...@su...> might have fixed this bug -- Configure bugmail: https://bugzilla.kernel.org/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are watching the assignee of the bug. |