https://bugzilla.kernel.org/show_bug.cgi?id=204241
--- Comment #16 from me@cschwarz.com --- Can confirm the patch 'drm/amdgpu: Move IB pool init after ucode bo creation' fixed the issue for me (96h and counting, failure normally within 24h, with ~2 suspend/resume cycles per day).