https://bugzilla.kernel.org/show_bug.cgi?id=204987
Bug ID: 204987 Summary: general protection fault in amdgpu_dm_atomic_commit_tail (Vega64) Product: Drivers Version: 2.5 Kernel Version: 5.3.1 Hardware: x86-64 OS: Linux Tree: Mainline Status: NEW Severity: high Priority: P1 Component: Video(DRI - non Intel) Assignee: drivers_video-dri@kernel-bugs.osdl.org Reporter: steinex@nognu.de Regression: No
drm.debug=0x54 log uploaded here due to attachment size limit: https://nognu.de/p/1569355650
This is on 5.3.1 with this patch series applied: https://patchwork.freedesktop.org/series/64505/
It happens between ~5 and ~45 minutes after the system is booted into X. There is no obvious pattern what triggers it.
https://bugzilla.kernel.org/show_bug.cgi?id=204987
Frank Steinborn (steinex@nognu.de) changed:
What |Removed |Added ---------------------------------------------------------------------------- Regression|No |Yes
https://bugzilla.kernel.org/show_bug.cgi?id=204987
--- Comment #1 from Frank Steinborn (steinex@nognu.de) --- I can reproduce this reliably with running compton in combination with redshift. As soon as compton is running and redshifts starts to shift the screen, the failed commits start to show up.
Running Unigine Heaven while redshift is running but not shifting triggers it too as it resets the redshift gamma and redshift tries to shift it back.
The bug is not triggered when compton uses xrender as backend instead of GLX, which is somewhat expected I guess.
https://bugzilla.kernel.org/show_bug.cgi?id=204987
Frank Steinborn (steinex@nognu.de) changed:
What |Removed |Added ---------------------------------------------------------------------------- Summary|general protection fault in |fault in |amdgpu_dm_atomic_commit_tai |amdgpu_dm_atomic_commit_tai |l (Vega64) |l on Vega64 with compton | |and redshift
https://bugzilla.kernel.org/show_bug.cgi?id=204987
--- Comment #2 from Frank Steinborn (steinex@nognu.de) --- Still happens on 5.4.2.
https://bugzilla.kernel.org/show_bug.cgi?id=204987
Bhasker C V (bhasker@unixindia.com) changed:
What |Removed |Added ---------------------------------------------------------------------------- CC| |bhasker@unixindia.com
--- Comment #3 from Bhasker C V (bhasker@unixindia.com) --- I get this error after hibernation and resume. This does not happen during immediate resume but if left overnight and resume in the morning, I see the amdgpu_dm_atomic_commit_tail. I am failing to load kexec kernel when on AMD ryzen. Hence I have a shapshot of the error message. The system freezes and there is nothing that can be done other than to cold reboot.
Attaching a photo of the crash
https://bugzilla.kernel.org/show_bug.cgi?id=204987
--- Comment #4 from Bhasker C V (bhasker@unixindia.com) --- Created attachment 292395 --> https://bugzilla.kernel.org/attachment.cgi?id=292395&action=edit Kernel crash AMD GPU at amdgpu_dm_atomic_commit_tail
https://bugzilla.kernel.org/show_bug.cgi?id=204987
Frank Steinborn (steinex@nognu.de) changed:
What |Removed |Added ---------------------------------------------------------------------------- Status|NEW |RESOLVED Resolution|--- |OBSOLETE
dri-devel@lists.freedesktop.org