https://bugs.freedesktop.org/show_bug.cgi?id=99710
Bug ID: 99710 Summary: [amdgpu R9 390] GPU hang when playing Hearthstone in Wine Product: Mesa Version: git Hardware: x86-64 (AMD64) OS: Linux (All) Status: NEW Severity: normal Priority: medium Component: Drivers/Gallium/radeonsi Assignee: dri-devel@lists.freedesktop.org Reporter: garththeisen@hotmail.com QA Contact: dri-devel@lists.freedesktop.org
Created attachment 129407 --> https://bugs.freedesktop.org/attachment.cgi?id=129407&action=edit DDEBUG_DUMP_#1
Repeatedly able to hang system running Hearthstone via Wine. The crash is never predictable but it often occurs right as the game is entering a multiplayer match or some short while after. Screen goes blank and system is unresponsive, no luck with Magic SysRq key in most cases.
System Profile ...
GPU: R9 390X Distro: Gentoo Kernel: Linux 4.9.8 KMD: amdgpu UMD: Mesa (git)
https://bugs.freedesktop.org/show_bug.cgi?id=99710
--- Comment #1 from garththeisen@hotmail.com --- Created attachment 129408 --> https://bugs.freedesktop.org/attachment.cgi?id=129408&action=edit Xorg.log
https://bugs.freedesktop.org/show_bug.cgi?id=99710
--- Comment #2 from garththeisen@hotmail.com --- GPU: XFX R9 390
https://bugs.freedesktop.org/show_bug.cgi?id=99710
--- Comment #3 from garththeisen@hotmail.com --- Created attachment 129409 --> https://bugs.freedesktop.org/attachment.cgi?id=129409&action=edit lspci output
https://bugs.freedesktop.org/show_bug.cgi?id=99710
Mike Lothian mike@fireburn.co.uk changed:
What |Removed |Added ---------------------------------------------------------------------------- CC| |mike@fireburn.co.uk
--- Comment #4 from Mike Lothian mike@fireburn.co.uk --- Which graphics mode are you using? Default, CMST or Gallium Nine?
https://bugs.freedesktop.org/show_bug.cgi?id=99710
--- Comment #5 from garththeisen@hotmail.com --- I am able to recreate this on Wine-git using modes default and nine. Additionally this is also a problem in Crossover with Performance Enhanced Graphics enabled.
https://bugs.freedesktop.org/show_bug.cgi?id=99710
--- Comment #6 from garththeisen@hotmail.com --- Interesting discovery. I loaded Android OS Virtualbox guest using Genymotion (Google Nexus 7 image with ARM translation installed) and tested the Google Play version of Hearthstone.
I am able to reproduce the same behaviour, hard locking my host (the machine profiled above). Black screens after an indeterminate amount of time running a Standard match. My host machine is unresponsive to SSH access after most trials, ... any suggestions for capturing diagnostics would be appreciated.
https://bugs.freedesktop.org/show_bug.cgi?id=99710
--- Comment #7 from Sandeep sandy.8925@gmail.com --- I have the same GPU, and have also started experiencing system hangs since the past 1-2 months. I believe it may be related to this issue, since it only occurs when using 3D graphics in some form, either while playing Left 4 Dead 2 or when using the Chromium browser with GPU acceleration enabled. In the case of Left 4 Dead 2, the system always hangs unpredictably at some point.
I am using the AMDGPU driver, with AMDGPU CIK support enabled.
I tried running 4.13 stable today, and the crash still occurred. I will try older kernels to see if I am still able to reproduce.
https://bugs.freedesktop.org/show_bug.cgi?id=99710
--- Comment #8 from Sandeep sandy.8925@gmail.com --- Well, I tried running 4.11 RC3, and that also had the same problem. Will see if I can go further back (4.10, 4.9 etc.) and see if I can get it to work without problems. Otherwise, the problem lies somewhere else, but is definitely related to the GPU drivers, since it doesn't get triggered by anything else.
https://bugs.freedesktop.org/show_bug.cgi?id=99710
--- Comment #9 from Sandeep sandy.8925@gmail.com --- I've tried 4.11.9 and 4.10.13 kernels, and the hang occurs on both of them.
https://bugs.freedesktop.org/show_bug.cgi?id=99710
--- Comment #10 from Sandeep sandy.8925@gmail.com --- I tried the 4.11.0 kernel since I suspected that the buggy change might also be present in the point releases, but that also caused a hang whenplaying Left 4 Dead 2. Will try older kernels and see if they work correctly (which they should since this hang definitely wasn't present 2 months ago).
https://bugs.freedesktop.org/show_bug.cgi?id=99710
--- Comment #11 from garththeisen@hotmail.com --- (In reply to Sandeep from comment #10)
I tried the 4.11.0 kernel since I suspected that the buggy change might also be present in the point releases, but that also caused a hang whenplaying Left 4 Dead 2. Will try older kernels and see if they work correctly (which they should since this hang definitely wasn't present 2 months ago).
Yep, Sandeep, I think this behaviour is tied to DPM issues highlighted in Bug 91880 : 'Radeonsi on Grenada cards (r9 390) exceptionally unstable and poorly performing'. I suggest following, reading, and commenting on that issue.
https://bugs.freedesktop.org/show_bug.cgi?id=99710
--- Comment #12 from Sandeep sandy.8925@gmail.com --- Well, to clarify, I have been using the AMDGPU driver for the past year, not the Radeon driver. I've only faced this issue since the past 2 months, never had the problem earlier - so I don't think the other bug applies. Also, the devs say old firmware is the culprit there, but I run Arch Linux, and the linux-firmware has whatever's present on 7th September of this year - so I doubt the firmware is out of date.
I did test 4.10.6, but Left 4 Dead 2 crashed less than a second after loading a level - this is weird, since I did run 4.10.x kernels without any problems. Makes me think the problem lies elsewhere.
Will see if I can figure out what's causing this.
https://bugs.freedesktop.org/show_bug.cgi?id=99710
--- Comment #13 from Sandeep sandy.8925@gmail.com --- Ok fine - I tested wtih amdgpu.dpm=0 and no hangs after 2 whole levels of Left 4 Dead 2.
Looks like DPM is the problem - seems to be a regression, since it was working fine with DPM 2-3 months ago.
https://bugs.freedesktop.org/show_bug.cgi?id=99710
--- Comment #14 from Sandeep sandy.8925@gmail.com --- And I just found out that suspend/resume is totally broken if I disable DPM.
This is a critical bug for us Linux users, especially given that this is a $300 card, it doesn't feel like I got my money's worth. I've never had so many problems with Intel and NVIDIA GPUs as I've had with AMD GPU drivers.
https://bugs.freedesktop.org/show_bug.cgi?id=99710
--- Comment #15 from Sandeep sandy.8925@gmail.com --- Trying to get an apitrace trace so that I can reproduce consistently.
Found out that I can reproduce reliably with OpenArena. Will try to create a trace now.
https://bugs.freedesktop.org/show_bug.cgi?id=99710
--- Comment #16 from Sandeep sandy.8925@gmail.com --- Ok, I was able to successfully reproduce using a trace from OpenArena.
I'll try uploading the file and paste a link here.
https://bugs.freedesktop.org/show_bug.cgi?id=99710
--- Comment #17 from Sandeep sandy.8925@gmail.com --- Definitely seems to be DPM related, I used the following command to force DPM performance level, and Left 4 Dead 2 and OpenArena worked fine with no hangs.
"echo high > /sys/class/drm/card0/device/power_dpm_force_performance_level"
Works fine if I set "low" as well.
The system hangs if I set "auto".
Well, atleast I have a trace that reproduces it.
https://bugs.freedesktop.org/show_bug.cgi?id=99710
GitLab Migration User gitlab-migration@fdo.invalid changed:
What |Removed |Added ---------------------------------------------------------------------------- Status|NEW |RESOLVED Resolution|--- |MOVED
--- Comment #18 from GitLab Migration User gitlab-migration@fdo.invalid --- -- GitLab Migration Automatic Message --
This bug has been migrated to freedesktop.org's GitLab instance and has been closed from further activity.
You can subscribe and participate further through the new bug through this link to our GitLab instance: https://gitlab.freedesktop.org/mesa/mesa/issues/1251.
dri-devel@lists.freedesktop.org