Stefano Stabellini
2012-May-14 11:23 UTC
[PATCH] xen: mark local pages as FOREIGN in the m2p_override
When the frontend and the backend reside on the same domain, even if we
add pages to the m2p_override, these pages will never be returned by
mfn_to_pfn because the check "get_phys_to_machine(pfn) != mfn" will
always fail, so the pfn of the frontend will be returned instead
(resulting in a deadlock because the frontend pages are already locked).
However m2p_add_override can easily find out whether another pfn
corresponding to the mfn exists in the m2p, and can set the FOREIGN bit
in the p2m, making sure that mfn_to_pfn returns the pfn of the backend.
This allows the backend to perform direct_IO on these pages, but as a
side effect prevents the frontend from using get_user_pages_fast on
them while they are being shared with the backend.
Signed-off-by: Stefano Stabellini <stefano.stabellini@eu.citrix.com>
---
arch/x86/xen/p2m.c | 18 ++++++++++++++++++
1 files changed, 18 insertions(+), 0 deletions(-)
diff --git a/arch/x86/xen/p2m.c b/arch/x86/xen/p2m.c
index 7ece122..c62ae5c 100644
--- a/arch/x86/xen/p2m.c
+++ b/arch/x86/xen/p2m.c
@@ -687,6 +687,7 @@ int m2p_add_override(unsigned long mfn, struct page *page,
unsigned long uninitialized_var(address);
unsigned level;
pte_t *ptep = NULL;
+ int ret = 0;
pfn = page_to_pfn(page);
if (!PageHighMem(page)) {
@@ -722,6 +723,16 @@ int m2p_add_override(unsigned long mfn, struct page *page,
list_add(&page->lru, &m2p_overrides[mfn_hash(mfn)]);
spin_unlock_irqrestore(&m2p_override_lock, flags);
+ /* p2m(m2p(mfn)) == mfn: the mfn is already present somewhere in
+ * this domain. Set the FOREIGN_FRAME_BIT in the p2m for the other
+ * pfn so that the following mfn_to_pfn(mfn) calls will return the
+ * pfn from the m2p_override (the backend pfn) instead.
+ * As a side effect GUPF might not be safe on the frontend pages
+ * while they are being shared with the backend. */
+ ret = __get_user(pfn, &machine_to_phys_mapping[mfn]);
+ if (ret >= 0 && get_phys_to_machine(pfn) == mfn)
+ set_phys_to_machine(pfn, FOREIGN_FRAME(mfn));
+
return 0;
}
EXPORT_SYMBOL_GPL(m2p_add_override);
@@ -733,6 +744,7 @@ int m2p_remove_override(struct page *page, bool clear_pte)
unsigned long uninitialized_var(address);
unsigned level;
pte_t *ptep = NULL;
+ int ret = 0;
pfn = page_to_pfn(page);
mfn = get_phys_to_machine(pfn);
@@ -802,6 +814,12 @@ int m2p_remove_override(struct page *page, bool clear_pte)
} else
set_phys_to_machine(pfn, page->index);
+ mfn &= ~FOREIGN_FRAME_BIT;
+ ret = __get_user(pfn, &machine_to_phys_mapping[mfn]);
+ if (ret >= 0 && get_phys_to_machine(pfn) == FOREIGN_FRAME(mfn)
&&
+ m2p_find_override(mfn) == NULL)
+ set_phys_to_machine(pfn, mfn);
+
return 0;
}
EXPORT_SYMBOL_GPL(m2p_remove_override);
--
1.7.2.5
Konrad Rzeszutek Wilk
2012-May-23 14:20 UTC
Re: [PATCH] xen: mark local pages as FOREIGN in the m2p_override
On Mon, May 14, 2012 at 12:23:07PM +0100, Stefano Stabellini wrote:> When the frontend and the backend reside on the same domain, even if we > add pages to the m2p_override, these pages will never be returned by > mfn_to_pfn because the check "get_phys_to_machine(pfn) != mfn" will > always fail, so the pfn of the frontend will be returned instead > (resulting in a deadlock because the frontend pages are already locked). > > However m2p_add_override can easily find out whether another pfn > corresponding to the mfn exists in the m2p, and can set the FOREIGN bit > in the p2m, making sure that mfn_to_pfn returns the pfn of the backend. > > This allows the backend to perform direct_IO on these pages, but as a > side effect prevents the frontend from using get_user_pages_fast on > them while they are being shared with the backend. > > Signed-off-by: Stefano Stabellini <stefano.stabellini@eu.citrix.com> > --- > arch/x86/xen/p2m.c | 18 ++++++++++++++++++ > 1 files changed, 18 insertions(+), 0 deletions(-) > > diff --git a/arch/x86/xen/p2m.c b/arch/x86/xen/p2m.c > index 7ece122..c62ae5c 100644 > --- a/arch/x86/xen/p2m.c > +++ b/arch/x86/xen/p2m.c > @@ -687,6 +687,7 @@ int m2p_add_override(unsigned long mfn, struct page *page, > unsigned long uninitialized_var(address); > unsigned level; > pte_t *ptep = NULL; > + int ret = 0; > > pfn = page_to_pfn(page); > if (!PageHighMem(page)) { > @@ -722,6 +723,16 @@ int m2p_add_override(unsigned long mfn, struct page *page, > list_add(&page->lru, &m2p_overrides[mfn_hash(mfn)]); > spin_unlock_irqrestore(&m2p_override_lock, flags); > > + /* p2m(m2p(mfn)) == mfn: the mfn is already present somewhere in > + * this domain. Set the FOREIGN_FRAME_BIT in the p2m for the other<nods> In other words, the MFN is local. And you want it to be forced to be !local - foreign right.> + * pfn so that the following mfn_to_pfn(mfn) calls will return the.. the other pfn being. So we want to override p2m(m2p(mfn)) == mfn[frontend] so that it becomes p2m(m2p(mfn)) == mfn[backend]? (nods) What happens if multiple m2p_add_override are called on the same page? This would be possible if the xen-blkfront is setup shared for the same disk, right? Won''t we loose the old frontend PFN -> backend MFN information then as we would overwrite the old P2M relationship?> + * pfn from the m2p_override (the backend pfn) instead.Can you explain in the comment why we want to do that. I think I know, but I am not going to remember it in a month.> + * As a side effect GUPF might not be safe on the frontend pages > + * while they are being shared with the backend. */How is it not safe?> + ret = __get_user(pfn, &machine_to_phys_mapping[mfn]); > + if (ret >= 0 && get_phys_to_machine(pfn) == mfn)if (ret == 0) [get_user only provides -EFAULT or 0]> + set_phys_to_machine(pfn, FOREIGN_FRAME(mfn)); > + > return 0; > } > EXPORT_SYMBOL_GPL(m2p_add_override); > @@ -733,6 +744,7 @@ int m2p_remove_override(struct page *page, bool clear_pte) > unsigned long uninitialized_var(address); > unsigned level; > pte_t *ptep = NULL; > + int ret = 0; > > pfn = page_to_pfn(page); > mfn = get_phys_to_machine(pfn); > @@ -802,6 +814,12 @@ int m2p_remove_override(struct page *page, bool clear_pte) > } else > set_phys_to_machine(pfn, page->index); >You also need a comment here.> + mfn &= ~FOREIGN_FRAME_BIT; > + ret = __get_user(pfn, &machine_to_phys_mapping[mfn]); > + if (ret >= 0 && get_phys_to_machine(pfn) == FOREIGN_FRAME(mfn) &&ret == 0> + m2p_find_override(mfn) == NULL) > + set_phys_to_machine(pfn, mfn); > + > return 0; > } > EXPORT_SYMBOL_GPL(m2p_remove_override); > -- > 1.7.2.5
Stefano Stabellini
2012-May-23 17:37 UTC
Re: [PATCH] xen: mark local pages as FOREIGN in the m2p_override
On Wed, 23 May 2012, Konrad Rzeszutek Wilk wrote:> On Mon, May 14, 2012 at 12:23:07PM +0100, Stefano Stabellini wrote: > > When the frontend and the backend reside on the same domain, even if we > > add pages to the m2p_override, these pages will never be returned by > > mfn_to_pfn because the check "get_phys_to_machine(pfn) != mfn" will > > always fail, so the pfn of the frontend will be returned instead > > (resulting in a deadlock because the frontend pages are already locked). > > > > However m2p_add_override can easily find out whether another pfn > > corresponding to the mfn exists in the m2p, and can set the FOREIGN bit > > in the p2m, making sure that mfn_to_pfn returns the pfn of the backend. > > > > This allows the backend to perform direct_IO on these pages, but as a > > side effect prevents the frontend from using get_user_pages_fast on > > them while they are being shared with the backend. > > > > Signed-off-by: Stefano Stabellini <stefano.stabellini@eu.citrix.com> > > --- > > arch/x86/xen/p2m.c | 18 ++++++++++++++++++ > > 1 files changed, 18 insertions(+), 0 deletions(-) > > > > diff --git a/arch/x86/xen/p2m.c b/arch/x86/xen/p2m.c > > index 7ece122..c62ae5c 100644 > > --- a/arch/x86/xen/p2m.c > > +++ b/arch/x86/xen/p2m.c > > @@ -687,6 +687,7 @@ int m2p_add_override(unsigned long mfn, struct page *page, > > unsigned long uninitialized_var(address); > > unsigned level; > > pte_t *ptep = NULL; > > + int ret = 0; > > > > pfn = page_to_pfn(page); > > if (!PageHighMem(page)) { > > @@ -722,6 +723,16 @@ int m2p_add_override(unsigned long mfn, struct page *page, > > list_add(&page->lru, &m2p_overrides[mfn_hash(mfn)]); > > spin_unlock_irqrestore(&m2p_override_lock, flags); > > > > + /* p2m(m2p(mfn)) == mfn: the mfn is already present somewhere in > > + * this domain. Set the FOREIGN_FRAME_BIT in the p2m for the other > > <nods> In other words, the MFN is local. And you want it to be forced > to be !local - foreign right.Yes> > + * pfn so that the following mfn_to_pfn(mfn) calls will return the > > .. the other pfn being. So we want to override p2m(m2p(mfn)) == mfn[frontend] > so that it becomes p2m(m2p(mfn)) == mfn[backend]? (nods)Exactly> What happens if multiple m2p_add_override are called on the same page? > This would be possible if the xen-blkfront is setup shared for the same > disk, right?It is possible even with a single disk, I can see the frontend sharing three times the same page, before the first request has completed, when mounting an ext4 partition. But it is completely safe: m2p_add_override is going to set the page as FOREIGN the first time and m2p_remove_override is going to remove the bit only when the last page is unshared (it does so by checking m2p_find_override(mfn) == NULL).> Won''t we loose the old frontend PFN -> backend MFN information then as > we would overwrite the old P2M relationship?No, we are just changing it to be: old frontend PFN -> (backend MFN | FOREIGN_FRAME_BIT)> > + * pfn from the m2p_override (the backend pfn) instead. > > Can you explain in the comment why we want to do that. I think > I know, but I am not going to remember it in a month.Good point. The reason is that the pages that have been shared by xen-blkfront are locked (lock_page, called by do_read_cache_page). When the userspace backend tries to use them with direct_IO, mfn_to_pfn returns the pfn of the frontend, so do_blockdev_direct_IO is going to try to lock the same pages again. I''ll add this comment to the commit message.> > + * As a side effect GUPF might not be safe on the frontend pages > > + * while they are being shared with the backend. */ > > How is it not safe?Because get_user_pages_fast ends up calling mfn_to_pfn that would now return the backend pfn rather than the frontend pfn.> > + ret = __get_user(pfn, &machine_to_phys_mapping[mfn]); > > + if (ret >= 0 && get_phys_to_machine(pfn) == mfn) > > if (ret == 0) > > [get_user only provides -EFAULT or 0]OK, I''ll change the check.> > + set_phys_to_machine(pfn, FOREIGN_FRAME(mfn)); > > + > > return 0; > > } > > EXPORT_SYMBOL_GPL(m2p_add_override); > > @@ -733,6 +744,7 @@ int m2p_remove_override(struct page *page, bool clear_pte) > > unsigned long uninitialized_var(address); > > unsigned level; > > pte_t *ptep = NULL; > > + int ret = 0; > > > > pfn = page_to_pfn(page); > > mfn = get_phys_to_machine(pfn); > > @@ -802,6 +814,12 @@ int m2p_remove_override(struct page *page, bool clear_pte) > > } else > > set_phys_to_machine(pfn, page->index); > > > > You also need a comment here.OK, I''ll add one.> > + mfn &= ~FOREIGN_FRAME_BIT; > > + ret = __get_user(pfn, &machine_to_phys_mapping[mfn]); > > + if (ret >= 0 && get_phys_to_machine(pfn) == FOREIGN_FRAME(mfn) && > > ret == 0OK> > + m2p_find_override(mfn) == NULL) > > + set_phys_to_machine(pfn, mfn); > > + > > return 0; > > } > > EXPORT_SYMBOL_GPL(m2p_remove_override); > > -- > > 1.7.2.5 >