https://bugzilla.kernel.org/show_bug.cgi?id=214197
Bug ID: 214197 Summary: [Asus G713QY] RX6800M not usable after exiting Vulkan application Product: Drivers Version: 2.5 Kernel Version: 5.13.13 Hardware: x86-64 OS: Linux Tree: Mainline Status: NEW Severity: normal Priority: P1 Component: Video(DRI - non Intel) Assignee: drivers_video-dri@kernel-bugs.osdl.org Reporter: velemas@gmail.com Regression: No
Asus ROG Strix G17 Advantage Edition (G713QY) has hybrid-graphics with dGPU RX6800M. After exiting any Vulkan application, it becomes unusable. Vulkaninfo sees dGPU before Vulkan app and does not see RX6800M after.
After Vulkan app close, dmesg reports:
[ 154.385749] amdgpu 0000:03:00.0: amdgpu: RAS: optional ras ta ucode is not available [ 154.401405] amdgpu 0000:03:00.0: amdgpu: SECUREDISPLAY: securedisplay ta ucode is not available [ 154.401409] amdgpu 0000:03:00.0: amdgpu: SMU is resuming... [ 159.038150] amdgpu 0000:03:00.0: amdgpu: message: RunDcBtc (54) param: 0x00000000 is timeout (no response) [ 159.038154] amdgpu 0000:03:00.0: amdgpu: Failed to setup smc hw! [ 159.038156] [drm:amdgpu_device_ip_resume_phase2 [amdgpu]] *ERROR* resume of IP block <smu> failed -62 [ 159.038220] amdgpu 0000:03:00.0: amdgpu: amdgpu_device_ip_resume failed (-62).
Using amdgpu.runpm=0 parameter fixes the issue.
https://bugzilla.kernel.org/show_bug.cgi?id=214197
Alex Deucher (alexdeucher@gmail.com) changed:
What |Removed |Added ---------------------------------------------------------------------------- CC| |alexdeucher@gmail.com
--- Comment #1 from Alex Deucher (alexdeucher@gmail.com) --- Please attach your full dmesg output from boot through the problematic case.
https://bugzilla.kernel.org/show_bug.cgi?id=214197
--- Comment #2 from Alex Deucher (alexdeucher@gmail.com) --- Does this patch fix the issue? https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/?i...
https://bugzilla.kernel.org/show_bug.cgi?id=214197
--- Comment #3 from velemas@gmail.com --- Created attachment 298505 --> https://bugzilla.kernel.org/attachment.cgi?id=298505&action=edit full dmesg output
https://bugzilla.kernel.org/show_bug.cgi?id=214197
--- Comment #4 from velemas@gmail.com ---
(In reply to Alex Deucher from comment #2)
Does this patch fix the issue? https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/ ?id=202ead5a3c589b0594a75cb99f080174f6851fed
Kernel 5.13.13 has this patch already. So apparently it does not fix the problem. It occurs with radv, amdvlk, and amdvlk-pro. External monitor is attached via HDMI (although it happens without ext. monitor too).
Sometimes dmesg does not contain above mentioned lines but dGPU is still unusable. Sometimes DXVK delivers VK_ERROR_DEVICE_LOST status even during runtime.
https://bugzilla.kernel.org/show_bug.cgi?id=214197
Pablo Cholaky (waltercool@slash.cl) changed:
What |Removed |Added ---------------------------------------------------------------------------- CC| |waltercool@slash.cl
--- Comment #5 from Pablo Cholaky (waltercool@slash.cl) --- Can confirm this issue as well under MSI Delta with RX6700M, in order to discard any "laptop specific issue". Both are Zen3 Navi cards.
Now, while it doesn't break GPU usage, but its a waste of power resources.
This issue it's kinda common, even with kernel 5.15.0-rc5. I don't have any steps to reproduce sadly.
https://bugzilla.kernel.org/show_bug.cgi?id=214197
--- Comment #6 from Alex Deucher (alexdeucher@gmail.com) --- Does this patch help? https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/?i...
https://bugzilla.kernel.org/show_bug.cgi?id=214197
--- Comment #7 from velemas@gmail.com --- Kernel 5.14.14 already has it but it's not fixed. I got mostly the same dmesg message but somewhat different:
[ 367.167527] amdgpu 0000:03:00.0: amdgpu: RAS: optional ras ta ucode is not available [ 367.183399] amdgpu 0000:03:00.0: amdgpu: SECUREDISPLAY: securedisplay ta ucode is not available [ 367.183406] amdgpu 0000:03:00.0: amdgpu: SMU is resuming... [ 371.863082] amdgpu 0000:03:00.0: amdgpu: Failed to setup smc hw! [ 371.863085] [drm:amdgpu_device_ip_resume_phase2 [amdgpu]] *ERROR* resume of IP block <smu> failed -62 [ 371.863147] amdgpu 0000:03:00.0: amdgpu: amdgpu_device_ip_resume failed (-62).
https://bugzilla.kernel.org/show_bug.cgi?id=214197
--- Comment #8 from velemas@gmail.com --- Recent kernels in 5.15.* and 5.16.* fix the issue for me.
https://bugzilla.kernel.org/show_bug.cgi?id=214197
velemas@gmail.com changed:
What |Removed |Added ---------------------------------------------------------------------------- Status|NEW |RESOLVED Resolution|--- |OBSOLETE
dri-devel@lists.freedesktop.org