https://bugs.freedesktop.org/show_bug.cgi?id=109200
Bug ID: 109200 Summary: VMC page fault and Coherent Slave Error: Address violation after upgrading to 4.20 Product: DRI Version: DRI git Hardware: x86-64 (AMD64) OS: Linux (All) Status: NEW Severity: major Priority: medium Component: DRM/AMDgpu Assignee: dri-devel@lists.freedesktop.org Reporter: vicluo96@gmail.com
Created attachment 142927 --> https://bugs.freedesktop.org/attachment.cgi?id=142927&action=edit kernel log on 4.20.arch1-1
I'm currently using AMD 2500U on Thinkpad E585 with Archlinux kernel 4.20.arch1-1. Everything works fine with 4.19.12. However after upgrading to 4.20.arch1-1, system crashes after gnome-shell starts. The error log reports:
Dec 31 11:57:09 lzThinkpad kernel: amdgpu 0000:05:00.0: [mmhub] VMC page fault (src_id:0 ring:158 vmid:1 pasid:32768, for process gnome-shell pid 1008 thread gnome-shel:cs0 pid 1022) Dec 31 11:57:09 lzThinkpad kernel: amdgpu 0000:05:00.0: in page starting at address 0x0000800100020000 from 18 Dec 31 11:57:09 lzThinkpad kernel: amdgpu 0000:05:00.0: VM_L2_PROTECTION_FAULT_STATUS:0x0010013C Dec 31 11:57:09 lzThinkpad kernel: mce: [Hardware Error]: Machine check events logged Dec 31 11:57:09 lzThinkpad kernel: [Hardware Error]: Deferred error, no action required. Dec 31 11:57:09 lzThinkpad kernel: [Hardware Error]: CPU:0 (17:11:0) MC20_STATUS[-|-|MiscV|-|AddrV Dec 31 11:57:09 lzThinkpad kernel: amdgpu 0000:05:00.0: [mmhub] VMC page fault (src_id:0 ring:158 vmid:1 pasid:32768, for process gnome-shell pid 1008 thread gnome-shel:cs0 pid 1022) Dec 31 11:57:09 lzThinkpad kernel: [Hardware Error]: Error Addr: 0x00007ffcffffff00 Dec 31 11:57:09 lzThinkpad kernel: [Hardware Error]: IPID: 0x0000002e00000000, Syndrome: 0x000000005b240205 Dec 31 11:57:09 lzThinkpad kernel: [Hardware Error]: Coherent Slave Error: Address violation. Dec 31 11:57:09 lzThinkpad kernel: [Hardware Error]: Deferred error, no action required. Dec 31 11:57:09 lzThinkpad kernel: [Hardware Error]: Error Addr: 0x00007ffcffffff00 Dec 31 11:57:09 lzThinkpad kernel: [Hardware Error]: cache level: L3/GEN, mem/io: IO, mem-tx: IRD, part-proc: SRC (no timeout) Dec 31 11:57:09 lzThinkpad kernel: [Hardware Error]: Coherent Slave Extended Error Code: 1 Dec 31 11:57:09 lzThinkpad kernel: [Hardware Error]: Error Addr: 0x00007ffcffffff00 Dec 31 11:57:09 lzThinkpad kernel: [Hardware Error]: Error Addr: 0x00007ffcffffff00 Dec 31 11:57:09 lzThinkpad kernel: [Hardware Error]: Error Addr: 0x00007ffcffffff00 Dec 31 11:57:09 lzThinkpad kernel: [Hardware Error]: Error Addr: 0x00007ffcffffff00 (more Error Addr lines omitted)