https://bugs.freedesktop.org/show_bug.cgi?id=98162
Bug ID: 98162 Summary: gpu hangs with unigine heaven on drm-next-4.9-wip Product: DRI Version: unspecified Hardware: Other OS: All Status: NEW Severity: normal Priority: medium Component: DRM/AMDgpu Assignee: dri-devel@lists.freedesktop.org Reporter: haagch@frickel.club
Created attachment 127138 --> https://bugs.freedesktop.org/attachment.cgi?id=127138&action=edit dmesg with gpu hang
XFX Radeon RX 480 XXX OC, latest mesa git and llvm svn.
Running unigine heaven for a while hangs the gpu like this:
Okt 08 10:31:14 c-l kernel: [drm:amdgpu_job_timedout] *ERROR* ring gfx timeout, last signaled seq=42581, last emitted seq=42583 Okt 08 10:31:14 c-l kernel: [drm] IP block:1 is hang! Okt 08 10:31:14 c-l kernel: [drm] IP block:5 is hang!
I tried bisecting and landed on 4be051aeb3964146d3922238fff0ed1e4a9656d1 "drm/amd/powerplay: use smu7 hwmgr to manager polaris10/11" but I'm not 100% confident I caught every bad commit because sometimes unigine heaven needs to run for several minutes before the hang happens and this commit is not trivial to revert.
https://bugs.freedesktop.org/show_bug.cgi?id=98162
--- Comment #1 from Christoph Haag haagch@frickel.club --- Does NOT happen with 4.9-rc1 even though this commit is in it.
Either my bisect result is just wrong or it only happens in combination with another commit not yet in 4.9-rc1.
https://bugs.freedesktop.org/show_bug.cgi?id=98162
--- Comment #2 from Christoph Haag haagch@frickel.club --- Created attachment 127334 --> https://bugs.freedesktop.org/attachment.cgi?id=127334&action=edit gpu hang dmesg with counter strike: global offensive
To document this: csgo ran fine, although very slow with stock 4.9-rc1. I did not try for very long, so perhaps it would still have happened after a while.
The performance issue was identified to be caused by 87744ab3832b83ba71b931f86f9cfdb000d07da5
After reverting this commit on 4.9-rc1, I still do not get GPU hangs with unigine heaven, but I start seeing GPU hangs with csgo.
Maybe not the same issue, but it doesn't happen on 4.8.
https://bugs.freedesktop.org/show_bug.cgi?id=98162
--- Comment #3 from Christoph Haag haagch@frickel.club --- Created attachment 127394 --> https://bugs.freedesktop.org/attachment.cgi?id=127394&action=edit csgo gpu fault and hang on amd-staging-4.7
I just don't know anymore. There seems to be a lot of brokenness on Polaris (or maybe just on my XFX Radeon RX 480 XXX OC model) right now.
This is a GPU fault + gpu hang that is caused by the native version of csgo on amd-staging-4.7.
https://bugs.freedesktop.org/show_bug.cgi?id=98162
--- Comment #4 from Christoph Haag haagch@frickel.club --- Created attachment 127395 --> https://bugs.freedesktop.org/attachment.cgi?id=127395&action=edit wine+nine csgo gpu fault and hang on stock 4.8
This is a GPU fault + hang I got on stock 4.8 when I tried the windows version of csgo with nine. The native version of csgo runs normally on stock 4.8.
https://bugs.freedesktop.org/show_bug.cgi?id=98162
--- Comment #5 from Christoph Haag haagch@frickel.club --- unigine heaven and csgo work fine with amdgpu-staging-4.7 f9c58ccc03147e652284f06053b089eca957e1e1
with drm-next-4.10 (with 87744ab3832b83ba71b931f86f9cfdb000d07da5) reverted for performance, unigine heaven works (I think), but csgo causes gpu hangs.
https://bugs.freedesktop.org/show_bug.cgi?id=98162
--- Comment #6 from Christoph Haag haagch@frickel.club --- Created attachment 127496 --> https://bugs.freedesktop.org/attachment.cgi?id=127496&action=edit wine+nine csgo gpu fault and hang amd-staging-4.7
Wait, this amd-staging-4.7 revision doesn't work well either. Tried with csgo nine and it fails too.
https://bugs.freedesktop.org/show_bug.cgi?id=98162
--- Comment #7 from Christoph Haag haagch@frickel.club --- On stock 4.8 again and native csgo and csgo with nine run both fine - on auto (high) graphics settings. It appears there is something with the lowest settings that breaks the driver on both the native version and the nine version.
I think I the original bug here is solved, because unigine heaven has been running stable on both amd-staging-4.7 and drm-next-4.9/4.10-wip, and that the csgo and soma bug is something else).
https://bugs.freedesktop.org/show_bug.cgi?id=98162
Christoph Haag haagch@frickel.club changed:
What |Removed |Added ---------------------------------------------------------------------------- Resolution|--- |DUPLICATE Status|NEW |RESOLVED
--- Comment #8 from Christoph Haag haagch@frickel.club --- I'm relatively sure that my bad GPU caused all of this (maybe except the first issue which was soon solved).
*** This bug has been marked as a duplicate of bug 98905 ***
dri-devel@lists.freedesktop.org