https://bugs.freedesktop.org/show_bug.cgi?id=96243
Bug ID: 96243 Summary: GPU initialization fails Product: DRI Version: unspecified Hardware: x86-64 (AMD64) OS: Linux (All) Status: NEW Severity: normal Priority: medium Component: DRM/AMDgpu Assignee: dri-devel@lists.freedesktop.org Reporter: hiwatari.seiji@gmail.com
Created attachment 124126 --> https://bugs.freedesktop.org/attachment.cgi?id=124126&action=edit dmesg output after modprobing amdgpu
Setup: QEMU + VFIO running VM with Ubuntu 16.04 and the latest amdgpu driver (amdgpu-pro 16.20.3)
If the system is started normally, the kernel crashes during boot with various errors (varies from boot to boot!).
If the module (amdgpu) is blacklisted during boot (grub.cfg) and modprobed afterwards the error always the same [see attached file]:
sw_init 5 failed -12 amdgpu_init failed memory type 2 has not been initialized amdgpu probe failed with error -12
Experiments:
- Using the amdgpu version shipped with Ubuntu 16.04, booting works ONCE. Rebooting/Shutting-Down the VM leads to consecutive errors of not beeing able to init the GPU again, probably due to Bonaire PCI-Reset issues. Restarting the Host itself allows booting the VM once more. Hereafter: Same error.
- A Windows 10 VM does not suffer from those issues. Rebooting/Shutting-Down this VM works without issues / host-reboot
https://bugs.freedesktop.org/show_bug.cgi?id=96243
John Bridgman john.bridgman@amd.com changed:
What |Removed |Added ---------------------------------------------------------------------------- Summary|GPU initialization fails |GPU initialization fails | |when running in VM
https://bugs.freedesktop.org/show_bug.cgi?id=96243
--- Comment #1 from Alex Deucher alexdeucher@gmail.com --- Can you try a newer kernel?
https://bugs.freedesktop.org/show_bug.cgi?id=96243
--- Comment #2 from hiwatari.seiji@gmail.com --- I did another round of testing and am quite pleased with the results.
New setup:
# Host qemu: 2.10.0 kernel: 4.13.5-gentoo
# Guest: kernel: vmlinuz-4.13.6-1-default (OpenSUSE) cmdline: amdgpu.cik_support=1 modprobe.blacklist=radeon
I did a couple of restarts and am quite pleased, that no host-hangups occured. The chance of a successfull initialization of the graphics hardware within the guest is at around 60%.
When it fails, the following line is always the last visible:
fb: switching to amdgpudrmfb from EFI VGA
Additionally, even when everything works, there are a couple of ring test errors:
[drm:gfx_v7_0_ring_test_ring [amdgpu]] *ERROR* amdgpu: ring 2 test failed (scratch(0xC040)=0xCAFEDEAD)
I'll attach a new log-file showing the boot log when everything works.
https://bugs.freedesktop.org/show_bug.cgi?id=96243
--- Comment #3 from hiwatari.seiji@gmail.com --- Created attachment 135114 --> https://bugs.freedesktop.org/attachment.cgi?id=135114&action=edit boot_log_successfull_initialization
https://bugs.freedesktop.org/show_bug.cgi?id=96243
Martin Peres martin.peres@free.fr changed:
What |Removed |Added ---------------------------------------------------------------------------- Status|NEW |RESOLVED Resolution|--- |MOVED
--- Comment #4 from Martin Peres martin.peres@free.fr --- -- GitLab Migration Automatic Message --
This bug has been migrated to freedesktop.org's GitLab instance and has been closed from further activity.
You can subscribe and participate further through the new bug through this link to our GitLab instance: https://gitlab.freedesktop.org/drm/amd/issues/75.
dri-devel@lists.freedesktop.org