Hi
Obviously if we have a dead gpu, we need to break out of this loop. But detecting a dead gpu (and returning an appropriate error like EIO) is the kernel's job.
In my case gpu isn't really dead. It works after some ioctl skip. I understend that it is a driver bug in any case, but i thing that working around this bugs with proper logging (absent in this patch) much better than lockup.
Regards, Anton