https://bugs.freedesktop.org/show_bug.cgi?id=105883
Bug ID: 105883 Summary: booting with kernel using amd-staging-drm-next on 2400G hangs Product: DRI Version: DRI git Hardware: x86-64 (AMD64) OS: Linux (All) Status: NEW Severity: blocker Priority: medium Component: DRM/AMDgpu Assignee: dri-devel@lists.freedesktop.org Reporter: joshua613@gmail.com
I am running an AMD Ryzen 2400G, using its integrated graphics in the Linux host and a gtx 1070 bound to VFIO for virtualized passthrough. When I boot with a kernel fetched from git using amd-dri-next on 4.16, it halts the booting process, usually somewhere around when it checks UTMP. It does not lock up my keyboard lights, but nothing is displayed on the screen, it does respond to control-alt-delete. My system is set up to boot to the command line, I use X11 from there with "startx" usually; when the system completes boot of course.
https://bugs.freedesktop.org/show_bug.cgi?id=105883
--- Comment #1 from Alex Deucher alexdeucher@gmail.com --- Can you attach your kernel log or dmesg output from the boot? Do other kernels work?
https://bugs.freedesktop.org/show_bug.cgi?id=105883
--- Comment #2 from Joshua Lee joshua613@gmail.com --- Where is my kernel log located? I successfully use a 4.16 mainline kernel... it's dri-next-staging that's causing problems. I am using mesa from git also... though I don't know if it ever gets around to using 3d graphics before the driver fails anyway.
https://bugs.freedesktop.org/show_bug.cgi?id=105883
--- Comment #3 from Joshua Lee joshua613@gmail.com --- How do I find an prior boot's dmesg?
https://bugs.freedesktop.org/show_bug.cgi?id=105883
--- Comment #4 from Edward Kigwana edwardwwgk@gmail.com --- Try
options amdgpu dpm=0 dc=1 and seee if it still locks up.
https://bugs.freedesktop.org/show_bug.cgi?id=105883
--- Comment #5 from Joshua Lee joshua613@gmail.com --- (In reply to Edward Kigwana from comment #4)
Try
options amdgpu dpm=0 dc=1 and seee if it still locks up.
That's in /etc/modconf.d or the like, right?
https://bugs.freedesktop.org/show_bug.cgi?id=105883
--- Comment #6 from Harry Wentland harry.wentland@amd.com --- On Ubuntu the kernel log keeps appending to /var/log/kern.log, but that might look different on different distros.
If you have a luxury of a second system you might be able to ssh into the Ryzen system and run dmesg that way.
As for the options Edward mentioned, you can pass them to the kernel command line. If you use grub for your bootloader you'd press 'e' on the selected kernel and append " amdgpu.dpm=0 amdgpu.dc=1" at the end of the line that starts with "linux". Alternatively you can append those to GRUB_CMDLINE_LINUX in /etc/default/grub and run "sudo update-grub"
Keep in mind that this is how I'd do it on Ubuntu. There' might be a way to pass these through /etc/modconf.d as well.
https://bugs.freedesktop.org/show_bug.cgi?id=105883
--- Comment #7 from Joshua Lee joshua613@gmail.com --- I'm not sure where the kernel log is on arch. When I add that option to my kernel command line that you recommended, both my drm-next-staging kernel and 4.16 mainline kernels fail. I have to remove it, then my 4.16 kernel works, but the drm-next-staging kernel still fails to operate the screen. (The kernel doesn't crash, as my keyboard still works, I can even press control-alt-delete to reboot, so I suspect it just isn't using my screen in the amdgpu driver.)
https://bugs.freedesktop.org/show_bug.cgi?id=105883
--- Comment #8 from Michel Dänzer michel@daenzer.net --- Is CONFIG_DRM_AMD_DC_DCN1_0 enabled in the kernel build configuration in both cases?
https://bugs.freedesktop.org/show_bug.cgi?id=105883
--- Comment #9 from taijian@posteo.de --- This is possibly the same bug described in bug #105760.
https://bugs.freedesktop.org/show_bug.cgi?id=105883
taijian@posteo.de changed:
What |Removed |Added ---------------------------------------------------------------------------- See Also| |https://bugs.freedesktop.or | |g/show_bug.cgi?id=105760
https://bugs.freedesktop.org/show_bug.cgi?id=105883
--- Comment #10 from taijian@posteo.de --- And yes, Arch has had CONFIG_DRM_AMD_DC_DCN1_0=y since 4.15.
https://bugs.freedesktop.org/show_bug.cgi?id=105883
--- Comment #11 from taijian@posteo.de --- (In reply to Joshua Lee from comment #3)
How do I find an prior boot's dmesg?
Try "journalctl -b -1" (for the boot attempt directly prior to this one, -2 for the one before that, etc...).
https://bugs.freedesktop.org/show_bug.cgi?id=105883
--- Comment #12 from Joshua Lee joshua613@gmail.com --- Someone on the /r/VFIO discord with a Ryzen APU (he usually boots his VM from the console, rather than having a graphical host) confirmed the crashiness by running Furmark, which crashed his GPU driver in ten minutes; his dmesg showed that as well.
13877 0.1 0.0 0 0 pts/1 ZNl+ 10:09 0:00 [GpuTest] <defunct> [90972.383503] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx timeout, last signaled seq=36081, last emitted seq=36083 [90972.383512] [drm] IP block:psp is hung! [90972.383514] [drm] GPU recovery disabled.
https://bugs.freedesktop.org/show_bug.cgi?id=105883
--- Comment #13 from Joshua Lee joshua613@gmail.com --- To be clear, the Furmark was being run in his host system, not within a VM.
https://bugs.freedesktop.org/show_bug.cgi?id=105883
Martin Peres martin.peres@free.fr changed:
What |Removed |Added ---------------------------------------------------------------------------- Resolution|--- |MOVED Status|NEW |RESOLVED
--- Comment #14 from Martin Peres martin.peres@free.fr --- -- GitLab Migration Automatic Message --
This bug has been migrated to freedesktop.org's GitLab instance and has been closed from further activity.
You can subscribe and participate further through the new bug through this link to our GitLab instance: https://gitlab.freedesktop.org/drm/amd/issues/342.
dri-devel@lists.freedesktop.org