https://bugs.freedesktop.org/show_bug.cgi?id=109808
Bug ID: 109808 Summary: ROCm OpenCL segfaults on drm-next-5.1-wip Product: DRI Version: XOrg git Hardware: Other OS: All Status: NEW Severity: normal Priority: medium Component: DRM/AMDgpu Assignee: dri-devel@lists.freedesktop.org Reporter: bmilreu@gmail.com
rocm clinfo segfaults with this on dmesg:
mar 01 15:27:36 mjb kernel: kfd2kgd: init_user_pages: Failed to register MMU notifier: -19
Reverting: drm/amdgpu: use HMM callback to replace mmu notifier drm/amdgpu: replace get_user_pages with HMM mirror helpers drm/amdkfd: avoid HMM change cause circular lock drm/amdgpu: use HMM callback to replace mmu notifier
makes it work again, something might be wrong with those related commits.
https://bugs.freedesktop.org/show_bug.cgi?id=109808
--- Comment #1 from Philip Yang philip.yang@amd.com --- Error code -19 means NODEV, please check if the running kernel enables the kernel config option CONFIG_ZONE_DEVICE (read kernel config from file /proc/config.gz).
init_user_pages return -NODEV if userptr support is not enabled, userptr support depends on kernel option CONFIG_HMM_MIRROR/CONFIG_HMM, which depends on kernel option CONFIG_ZONE_DEVICE.
CONFIG_ZONE_DEVICE, CONFIG_HMM is by default ON, but kernel config file may not select to set CONFIG_ZONE_DEVICE if the kernel config file is from old kernel.
If userptr support is not enabled, then clinfo and KFD usreptr support will return error -19.
Please correct kernel config file by adding CONFIG_ZONE_DEVICE=y
https://bugs.freedesktop.org/show_bug.cgi?id=109808
bmilreu@gmail.com changed:
What |Removed |Added ---------------------------------------------------------------------------- Resolution|--- |NOTABUG Status|NEW |RESOLVED
--- Comment #2 from bmilreu@gmail.com --- Thanks for the answer, it was indeed a previous outdated config. Tested with CONFIG_ZONE_DEVICE=y and issue is gone, closing this.
Hi Philip, I was wondering. Is it possible so that the message would be either more descriptive so that user is informed about this or the kernel config to be modified to automatically satisfy dependencies?
On Sun, Mar 3, 2019 at 10:36 PM bugzilla-daemon@freedesktop.org wrote:
bmilreu@gmail.com changed bug 109808 https://bugs.freedesktop.org/show_bug.cgi?id=109808 What Removed Added Resolution --- NOTABUG Status NEW RESOLVED
*Comment # 2 https://bugs.freedesktop.org/show_bug.cgi?id=109808#c2 on bug 109808 https://bugs.freedesktop.org/show_bug.cgi?id=109808 from bmilreu@gmail.com bmilreu@gmail.com *
Thanks for the answer, it was indeed a previous outdated config. Tested with CONFIG_ZONE_DEVICE=y and issue is gone, closing this.
You are receiving this mail because:
- You are the assignee for the bug.
dri-devel mailing list dri-devel@lists.freedesktop.org https://lists.freedesktop.org/mailman/listinfo/dri-devel
https://bugs.freedesktop.org/show_bug.cgi?id=109808
--- Comment #3 from Michael Eagle xanto@egaming.ro --- Hi Philip, I was wondering. Is it possible so that the message would be either more descriptive so that user is informed about this or the kernel config to be modified to automatically satisfy dependencies?
On Sun, Mar 3, 2019 at 10:36 PM bugzilla-daemon@freedesktop.org wrote:
bmilreu@gmail.com changed bug 109808 https://bugs.freedesktop.org/show_bug.cgi?id=109808 What Removed Added Resolution --- NOTABUG Status NEW RESOLVED
*Comment # 2 https://bugs.freedesktop.org/show_bug.cgi?id=109808#c2 on bug 109808 https://bugs.freedesktop.org/show_bug.cgi?id=109808 from bmilreu@gmail.com bmilreu@gmail.com *
Thanks for the answer, it was indeed a previous outdated config. Tested with CONFIG_ZONE_DEVICE=y and issue is gone, closing this.
You are receiving this mail because:
- You are the assignee for the bug.
dri-devel mailing list dri-devel@lists.freedesktop.org https://lists.freedesktop.org/mailman/listinfo/dri-devel
https://bugs.freedesktop.org/show_bug.cgi?id=109808
--- Comment #4 from Philip Yang philip.yang@amd.com --- I will change the error message for this specific case to mention the missing kernel config option.
I cannot add select ZONE_DEVICE in driver Kconfig file because there will be a circular dependency issue. The old or wired kernel config file may select to don't enable HMM or ZONE_DEVICE.
Thanks, Philip
dri-devel@lists.freedesktop.org