Fence wait in mmu_interval_notifier_ops::invalidate - dri-devel - freedesktop.org experimental mailing list

List overview All Threads
Download

Fence wait in mmu_interval_notifier_ops::invalidate

[PATCH v3] drm: rockchip: hdmi:...

[PATCH v5 1/6] drm/damage_helper:...

Thomas Hellström (Intel)

9 Dec 2020 9 Dec '20

4:36 p.m.

Jason, Christian

In most implementations of the callback mentioned in the subject there's a fence wait. What exactly is it needed for?

Thanks,

Thomas

Reply

Show replies by date

Jason Gunthorpe

9 Dec 9 Dec

4:37 p.m.

On Wed, Dec 09, 2020 at 05:36:16PM +0100, Thomas Hellström (Intel) wrote:

Invalidate must stop DMA before returning, so presumably drivers using a dma fence are relying on a dma fence mechanism to stop DMA.

Jason

Reply

Thomas Hellström (Intel)

4:46 p.m.

On 12/9/20 5:37 PM, Jason Gunthorpe wrote:

Yes, so far I follow, but what's the reason drivers need to stop DMA?

Is it for invlidation before breaking COW after fork or something related?

Thanks,

Thomas

Jason

Reply

Christian König

10 Dec 10 Dec

10:53 a.m.

Am 09.12.20 um 17:46 schrieb Thomas Hellström (Intel):

On 12/9/20 5:37 PM, Jason Gunthorpe wrote:

...
On Wed, Dec 09, 2020 at 05:36:16PM +0100, Thomas Hellström (Intel) wrote:

...
Jason, Christian

In most implementations of the callback mentioned in the subject there's a fence wait. What exactly is it needed for?

Invalidate must stop DMA before returning, so presumably drivers using a dma fence are relying on a dma fence mechanism to stop DMA.

Yes, so far I follow, but what's the reason drivers need to stop DMA?

Well in general an invalidation means that the specified part of the page tables are updated, either with new addresses or new access flags.

In both cases you need to stop the DMA because you could otherwise work with stale data, e.g. read/write with the wrong addresses or write to a read only region etc...

Is it for invlidation before breaking COW after fork or something related?

This is just one of many use cases which could invalidate a range. But there are many more, both from the kernel as well as userspace.

Just imaging that userspace first mmaps() some anonymous memory r/w, starts a DMA to it and while the DMA is ongoing does a readonly mmap() of libc to the same location.

Since most hardware doesn't have recoverable page faults guess what would happen if we don't wait for the DMA to finish? That would be a security hole you can push an elephant through :)

Cheers, Christian.

Thanks,

Thomas

...
Jason

Reply

Thomas Hellström (Intel)

11 Dec 11 Dec

7:50 a.m.

Hi, Christian

Thanks for the reply.

On 12/10/20 11:53 AM, Christian König wrote:

Yes. That's clear. I'm just trying to understand the complete implications of doing that.

My understanding of this particular case is that hardware would continue to DMA to orphaned pages that are pinned until the driver is done with DMA, unless hardware would somehow in-flight pick up the new PTE addresses pointing to libc but not the protection?

Thanks,

Thomas

Reply

Christian König

8:57 a.m.

Am 11.12.20 um 08:50 schrieb Thomas Hellström (Intel):

Exactly that is not guaranteed under all circumstances. Especially since HMM tries to avoid grabbing a reference to the underlying pages. And it depends when the destination addresses of the DMA are read and when the access flags are evaluated.

But even when it causes no security problem the requirement we have to fulfill here is that the DMA is coherent. In other words we either have to delay updates to the page tables until the DMA operation is completed or apply both address and access flag changes in a way the DMA operation immediately sees it as well.

Regards, Christian.

Reply

Thomas Hellström (Intel)

9:37 a.m.

On 12/11/20 9:57 AM, Christian König wrote:

Am 11.12.20 um 08:50 schrieb Thomas Hellström (Intel):

...
Hi, Christian

Thanks for the reply.

On 12/10/20 11:53 AM, Christian König wrote:

...
Am 09.12.20 um 17:46 schrieb Thomas Hellström (Intel):

...
On 12/9/20 5:37 PM, Jason Gunthorpe wrote:

...
On Wed, Dec 09, 2020 at 05:36:16PM +0100, Thomas Hellström (Intel) wrote:

...
Jason, Christian

In most implementations of the callback mentioned in the subject there's a fence wait. What exactly is it needed for?

Invalidate must stop DMA before returning, so presumably drivers using a dma fence are relying on a dma fence mechanism to stop DMA.

Yes, so far I follow, but what's the reason drivers need to stop DMA?

Well in general an invalidation means that the specified part of the page tables are updated, either with new addresses or new access flags.

In both cases you need to stop the DMA because you could otherwise work with stale data, e.g. read/write with the wrong addresses or write to a read only region etc...

Yes. That's clear. I'm just trying to understand the complete implications of doing that.

...
...
Is it for invlidation before breaking COW after fork or something related?

This is just one of many use cases which could invalidate a range. But there are many more, both from the kernel as well as userspace.

Just imaging that userspace first mmaps() some anonymous memory r/w, starts a DMA to it and while the DMA is ongoing does a readonly mmap() of libc to the same location.

My understanding of this particular case is that hardware would continue to DMA to orphaned pages that are pinned until the driver is done with DMA, unless hardware would somehow in-flight pick up the new PTE addresses pointing to libc but not the protection?

Exactly that is not guaranteed under all circumstances. Especially since HMM tries to avoid grabbing a reference to the underlying pages. And it depends when the destination addresses of the DMA are read and when the access flags are evaluated.

But even when it causes no security problem the requirement we have to fulfill here is that the DMA is coherent. In other words we either have to delay updates to the page tables until the DMA operation is completed or apply both address and access flag changes in a way the DMA operation immediately sees it as well.

Regards, Christian.

Got it.

Thanks! Thomas

Reply

Jason Gunthorpe

12:46 p.m.

On Fri, Dec 11, 2020 at 08:50:53AM +0100, Thomas Hellström (Intel) wrote:

mmu notifier replaces pinning as the locking mechanism. Drivers using mmu notifier should not be taking pins.

Keep in mind this was all built for HW with real shadow page tables that can do fine grained manipulation.

The GPU version of this to instead manipulate a command queue is a big aberration from what was intended.

Jason

Reply

Thomas Hellström (Intel)

13 Dec 13 Dec

3:09 p.m.

On 12/11/20 1:46 PM, Jason Gunthorpe wrote:

OK yes, that makes sense and in that context the fence wait is easier to understand. Looks like for example the radeon driver is using the notifier + get_user_pages() but there it looks like it's used to avoid having get_user_pages() clash with invalidation.

/Thomas

Reply

Daniel Vetter

14 Dec 14 Dec

9:52 a.m.

On Sun, Dec 13, 2020 at 04:09:25PM +0100, Thomas Hellström (Intel) wrote:

I think the radeon userptr implementation is bad enough that Christian wants to outright remove it. At least he keeps talking about doing that.

So maybe not a good example to look at :-) -Daniel

-- Daniel Vetter Software Engineer, Intel Corporation http://blog.ffwll.ch

Reply

Christian König

10:21 a.m.

Am 14.12.20 um 10:52 schrieb Daniel Vetter:

Oh, yes :) Key point is having time for that.

Christian.

Reply

1621

Age (days ago)

1626

Last active (days ago)

dri-devel@lists.freedesktop.org

10 comments

4 participants

tags (0)

participants (4)

Christian König
Daniel Vetter
Jason Gunthorpe
Thomas Hellström (Intel)