https://bugs.freedesktop.org/show_bug.cgi?id=110886
--- Comment #19 from Andrey Grodzovsky andrey.grodzovsky@amd.com --- (In reply to Kai-Heng Feng from comment #9)
(In reply to Andrey Grodzovsky from comment #8)
(In reply to Kai-Heng Feng from comment #6)
Created attachment 145044 [details] failed log when iommu is disabled.
What was the failur ewith IOMMU disabled ?
Blanked screen. Graphics no longer works.
Is it the same as with IOMMU enabled ?
Yes.
In the log I only see PSP errors on resume. Can you confirm that the only failure/error you observed in the log in that use case ?
Yes. I haven't seen "[drm:drm_atomic_helper_wait_for_flip_done [drm_kms_helper]] *ERROR* [CRTC:57:crtc-0] flip_done timed out" for a while.
Now it always shows PSP fail.
Can you please provide your FW versions by cat /sys/kernel/debug/dri/0/amdgpu_firmware_info
VCE feature version: 0, firmware version: 0x00000000 UVD feature version: 0, firmware version: 0x00000000 MC feature version: 0, firmware version: 0x00000000 ME feature version: 40, firmware version: 0x00000099 PFP feature version: 40, firmware version: 0x000000ae CE feature version: 40, firmware version: 0x0000004d RLC feature version: 1, firmware version: 0x00000213 RLC SRLC feature version: 1, firmware version: 0x00000001 RLC SRLG feature version: 1, firmware version: 0x00000001 RLC SRLS feature version: 1, firmware version: 0x00000001 MEC feature version: 40, firmware version: 0x0000018b MEC2 feature version: 40, firmware version: 0x0000018b SOS feature version: 0, firmware version: 0x00000000 ASD feature version: 0, firmware version: 0x001ad4d4 TA XGMI feature version: 0, firmware version: 0x00000000 TA RAS feature version: 0, firmware version: 0x00000000 SMC feature version: 0, firmware version: 0x00001e4f SDMA0 feature version: 41, firmware version: 0x000000a9 VCN feature version: 0, firmware version: 0x0110901c DMCU feature version: 0, firmware version: 0x00000000 VBIOS version: SWBRT32481.001
Can you please confirm the issue happens regardless of graphic enabled, load system in console mode and verify you still observe the problem.(In reply to Kai-Heng Feng from comment #12)
Now it always shows PSP fail.
I've dug up more info about this issue. It always times out in psp_cmd_submit_buf(). Particularly, this code section:
while (*((unsigned int *)psp->fence_buf) != index) { if (--timeout == 0) break; msleep(1); }
psp->fence_buf stuck at 406 and index stuck at 407 and it eventually times out. This _always_ happens at 27th time of S3, and freeze the whole system at 28th S3 attempt.
Does it happen also when no acceleration in system - i mean if you do S3 cycles from console mode ?