https://bugs.freedesktop.org/show_bug.cgi?id=109200
Bug ID: 109200 Summary: VMC page fault and Coherent Slave Error: Address violation after upgrading to 4.20 Product: DRI Version: DRI git Hardware: x86-64 (AMD64) OS: Linux (All) Status: NEW Severity: major Priority: medium Component: DRM/AMDgpu Assignee: dri-devel@lists.freedesktop.org Reporter: vicluo96@gmail.com
Created attachment 142927 --> https://bugs.freedesktop.org/attachment.cgi?id=142927&action=edit kernel log on 4.20.arch1-1
I'm currently using AMD 2500U on Thinkpad E585 with Archlinux kernel 4.20.arch1-1. Everything works fine with 4.19.12. However after upgrading to 4.20.arch1-1, system crashes after gnome-shell starts. The error log reports:
Dec 31 11:57:09 lzThinkpad kernel: amdgpu 0000:05:00.0: [mmhub] VMC page fault (src_id:0 ring:158 vmid:1 pasid:32768, for process gnome-shell pid 1008 thread gnome-shel:cs0 pid 1022) Dec 31 11:57:09 lzThinkpad kernel: amdgpu 0000:05:00.0: in page starting at address 0x0000800100020000 from 18 Dec 31 11:57:09 lzThinkpad kernel: amdgpu 0000:05:00.0: VM_L2_PROTECTION_FAULT_STATUS:0x0010013C Dec 31 11:57:09 lzThinkpad kernel: mce: [Hardware Error]: Machine check events logged Dec 31 11:57:09 lzThinkpad kernel: [Hardware Error]: Deferred error, no action required. Dec 31 11:57:09 lzThinkpad kernel: [Hardware Error]: CPU:0 (17:11:0) MC20_STATUS[-|-|MiscV|-|AddrV Dec 31 11:57:09 lzThinkpad kernel: amdgpu 0000:05:00.0: [mmhub] VMC page fault (src_id:0 ring:158 vmid:1 pasid:32768, for process gnome-shell pid 1008 thread gnome-shel:cs0 pid 1022) Dec 31 11:57:09 lzThinkpad kernel: [Hardware Error]: Error Addr: 0x00007ffcffffff00 Dec 31 11:57:09 lzThinkpad kernel: [Hardware Error]: IPID: 0x0000002e00000000, Syndrome: 0x000000005b240205 Dec 31 11:57:09 lzThinkpad kernel: [Hardware Error]: Coherent Slave Error: Address violation. Dec 31 11:57:09 lzThinkpad kernel: [Hardware Error]: Deferred error, no action required. Dec 31 11:57:09 lzThinkpad kernel: [Hardware Error]: Error Addr: 0x00007ffcffffff00 Dec 31 11:57:09 lzThinkpad kernel: [Hardware Error]: cache level: L3/GEN, mem/io: IO, mem-tx: IRD, part-proc: SRC (no timeout) Dec 31 11:57:09 lzThinkpad kernel: [Hardware Error]: Coherent Slave Extended Error Code: 1 Dec 31 11:57:09 lzThinkpad kernel: [Hardware Error]: Error Addr: 0x00007ffcffffff00 Dec 31 11:57:09 lzThinkpad kernel: [Hardware Error]: Error Addr: 0x00007ffcffffff00 Dec 31 11:57:09 lzThinkpad kernel: [Hardware Error]: Error Addr: 0x00007ffcffffff00 Dec 31 11:57:09 lzThinkpad kernel: [Hardware Error]: Error Addr: 0x00007ffcffffff00 (more Error Addr lines omitted)
https://bugs.freedesktop.org/show_bug.cgi?id=109200
--- Comment #1 from Zheng Luo vicluo96@gmail.com --- w/ mesa 18.3.1-1, gnome-shell & mutter & gnome-desktop 3.30.2-1, wayland 1.16.0-1, libva 2.3.0-1, linux-firmware 20181218.0f22c85-1
https://bugs.freedesktop.org/show_bug.cgi?id=109200
Zheng Luo vicluo96@gmail.com changed:
What |Removed |Added ---------------------------------------------------------------------------- See Also| |https://bugs.freedesktop.or | |g/show_bug.cgi?id=108992
https://bugs.freedesktop.org/show_bug.cgi?id=109200
Zheng Luo vicluo96@gmail.com changed:
What |Removed |Added ---------------------------------------------------------------------------- Status|NEW |RESOLVED Resolution|--- |DUPLICATE
--- Comment #2 from Zheng Luo vicluo96@gmail.com --- As mentioned in https://bugs.freedesktop.org/show_bug.cgi?id=108992, iommu=soft stills works. This looks like a duplicate of that issue.
*** This bug has been marked as a duplicate of bug 108992 ***
dri-devel@lists.freedesktop.org