https://bugzilla.kernel.org/show_bug.cgi?id=113341
Bug ID: 113341 Summary: GPU Lockup on AMD Kaveri Product: Drivers Version: 2.5 Kernel Version: 4.4.2 Hardware: x86-64 OS: Linux Tree: Mainline Status: NEW Severity: normal Priority: P1 Component: Video(DRI - non Intel) Assignee: drivers_video-dri@kernel-bugs.osdl.org Reporter: linux@bernd-steinhauser.de Regression: No
Created attachment 206321 --> https://bugzilla.kernel.org/attachment.cgi?id=206321&action=edit journal log from the time of the gpu lockup.
GPU is an AMD Kaveri: 1002:130f Happened when I started an application (way before the application actually showen) on KDE Plasma 5.5. The kernel version was 4.4.2 with an additional vblank fix applied (see [1]).
Since I wasn't prepared for this and ssh wasn't activated and I couldn't save dmesg output. Instead I'll attach what I found in the journal.
[1] https://bugs.freedesktop.org/show_bug.cgi?id=93746
https://bugzilla.kernel.org/show_bug.cgi?id=113341
--- Comment #1 from Michel Dänzer michel@daenzer.net --- Please attach the output of glxinfo and the Xorg log.
https://bugzilla.kernel.org/show_bug.cgi?id=113341
--- Comment #2 from Bernd Steinhauser linux@bernd-steinhauser.de --- Created attachment 206381 --> https://bugzilla.kernel.org/attachment.cgi?id=206381&action=edit glxinfo
https://bugzilla.kernel.org/show_bug.cgi?id=113341
--- Comment #3 from Bernd Steinhauser linux@bernd-steinhauser.de --- I forgot about Xorg log, sorry. Since only one old one is kept, it's gone already. Maybe logging to syslog/journal is a good idea ...
https://bugzilla.kernel.org/show_bug.cgi?id=113341
--- Comment #4 from Bernd Steinhauser linux@bernd-steinhauser.de --- Today I ran into this again and got the Xorg log, but there doesn't seem to be anything interesting in it. Kernel is now 4.4.4.
When the freeze occurs, I can still ssh into the system. Applications continue to run (i.e. I had a music player running), I could even reboot the system, although that seemed to take longer because X didn't like to shutdown (was markt as DSsl+ in ps aux).
https://bugzilla.kernel.org/show_bug.cgi?id=113341
--- Comment #5 from Bernd Steinhauser linux@bernd-steinhauser.de --- Created attachment 207721 --> https://bugzilla.kernel.org/attachment.cgi?id=207721&action=edit Xorg.0.log
https://bugzilla.kernel.org/show_bug.cgi?id=113341
--- Comment #6 from Michel Dänzer michel@daenzer.net --- Any chance you could try if this also happens with LLVM 3.8 or even current SVN/Git?
Does it always happen when starting a particular application?
https://bugzilla.kernel.org/show_bug.cgi?id=113341
--- Comment #7 from Bernd Steinhauser linux@bernd-steinhauser.de --- I tried to build LLVM/clang scm, but it failed (have to check why and if I can get around that).
I've had several freezes over the last few weeks (most of them when I bisected that other bug mentioned above (so could be unrelated), but only a few after staying on 4.4.x), but this one was the only one I could relate to a specific event, starting an application (thunderbird here). The other ones seemed to happen out of nowhere, so it could be that it was just coincidence that it happened when I started that application.
https://bugzilla.kernel.org/show_bug.cgi?id=113341
--- Comment #8 from Michel Dänzer michel@daenzer.net --- FWIW, right now it's better to try LLVM 3.8 than SVN/Git, because the latter will expose you to https://bugs.freedesktop.org/show_bug.cgi?id=94242 .
https://bugzilla.kernel.org/show_bug.cgi?id=113341
--- Comment #9 from Bernd Steinhauser linux@bernd-steinhauser.de --- I will try to build llvm, but it could require me a few days since it's not yet provided by my distribution and I have to check the changes in the build system.
BTW, I can usually ssh into the system. Is there any way I could gather more debug info when this happens?
https://bugzilla.kernel.org/show_bug.cgi?id=113341
--- Comment #10 from Bernd Steinhauser linux@bernd-steinhauser.de --- During the last 2 weeks I switched between amdgpu and radeon a couple of times. What I noticed is that with radeon I do get lockups here and there, I think almost always (not 100% sure though) when a video is running. Both with xv and vdpau as video output. llvm is now 3.8.
On amdgpu I haven't seen a lockup yet, except for a view when bisecting 4.4-rc2 or -rc3, but I guess that was a different problem which got fixed until the release.
https://bugzilla.kernel.org/show_bug.cgi?id=113341
Bernd Steinhauser linux@bernd-steinhauser.de changed:
What |Removed |Added ---------------------------------------------------------------------------- Status|NEW |RESOLVED Resolution|--- |UNREPRODUCIBLE
--- Comment #11 from Bernd Steinhauser linux@bernd-steinhauser.de --- Since I'm now exclusively using amdgpu and since that works very well for me, I didn't do much more tests with radeon and thus cannot tell if this is still present or not. Therefore closing the bug report.
dri-devel@lists.freedesktop.org