https://bugzilla.kernel.org/show_bug.cgi?id=214859
--- Comment #6 from James Zhu (jamesz@amd.com) --- Created attachment 299437 --> https://bugzilla.kernel.org/attachment.cgi?id=299437&action=edit analysis for this issue
Linux 5.14.15 + afd1818 can fix the issue.
Linux 5.15rc7 re-apply "init iommu after amdkfd device init" and "move iommu_resume before ip init/resume" which overwrote afd1818 caused the issue again.
714d9e4 drm/amdgpu: init iommu after amdkfd device init
f02abeb drm/amdgpu: move iommu_resume before ip init/resume
afd1818 drm/amdkfd: fix boot failure when iommu is disabled in Picasso.
286826d drm/amdgpu: init iommu after amdkfd device init
9cec53c drm/amdgpu: move iommu_resume before ip init/resume