thr3ads.net - Xen devel - [Xen-devel] [PATCH][v2] Hybrid extension support in Xen [Feb 2010]

If this information is useful, please help other people find it:
Share via:

Sheng Yang

2010-Feb-02 08:16 UTC

[Xen-devel] [PATCH][v2] Hybrid extension support in Xen

Hi Keir

Here is the second version of Hybrid extension support in Xen. Mostly the 
patch did:

1. Enable SMP support through VCPU_OP in arch_set_info_guest().
2. A new hypercall in hvm_ops to enable hybrid.
3. Mapping IRQ to VIRQ when deliver to the guest.
4. Inject a guest defined vector to deliver notification for events.
5. Use CPUID leaf 0x40000002 to support hybrid feature
6. Reserve some space at the end of MMIO hole for grant table use.

Please review. Thanks!

-- 
regards
Yang, Sheng

--
 tools/firmware/hvmloader/config.h    |    6 ++-
 tools/firmware/hvmloader/hvmloader.c |    6 +++
 xen/arch/x86/domain.c                |    9 ++++
 xen/arch/x86/hvm/hvm.c               |   62 +++++++++++++++++++++++++++++---
 xen/arch/x86/hvm/irq.c               |   66 
++++++++++++++++++++++++++++-------
 xen/arch/x86/hvm/vmx/intr.c          |    3 +
 xen/arch/x86/traps.c                 |    9 ++++
 xen/include/asm-x86/hvm/hvm.h        |    4 +-
 xen/include/asm-x86/hvm/irq.h        |    4 +-
 xen/include/public/arch-x86/cpuid.h  |    7 +++
 xen/include/public/hvm/hvm_op.h      |    6 +++
 xen/include/public/xen.h             |    7 +++
 xen/include/xen/sched.h              |    9 ++++
 13 files changed, 175 insertions(+), 23 deletions(-)


_______________________________________________
Xen-devel mailing list
Xen-devel@lists.xensource.com
http://lists.xensource.com/xen-devel

Ian Campbell

2010-Feb-02 11:22 UTC

head link

Re: [Xen-devel] [PATCH][v2] Hybrid extension support in Xen

On Tue, 2010-02-02 at 08:16 +0000, Sheng Yang wrote:> 
> +/* Reserve 128KB for grant table */
> +#define GNTTAB_MEMBASE     0xfbfe0000
> +#define GNTTAB_MEMSIZE     0x20000 
Why is this necessary? Isn''t the grant table contained within one of
the
BARS on the virtual PCI device? What needs grant tables for prior to the
kernel finding the PCI device which necessitates hardcoding these
addresses in both guest and hypervisor?

Ian.


_______________________________________________
Xen-devel mailing list
Xen-devel@lists.xensource.com
http://lists.xensource.com/xen-devel

Ian Campbell

2010-Feb-02 11:26 UTC

head link

Re: [Xen-devel] [PATCH][v2] Hybrid extension support in Xen

On Tue, 2010-02-02 at 08:16 +0000, Sheng Yang wrote:> 
> +static hvm_hypercall_t *hvm_hypercall_hybrid64_table[NR_hypercalls] > {
> +    [ __HYPERVISOR_memory_op ] = (hvm_hypercall_t *)hvm_memory_op,
> +    [ __HYPERVISOR_grant_table_op ] = (hvm_hypercall_t
> *)hvm_grant_table_op,
> +    HYPERCALL(xen_version),
> +    HYPERCALL(console_io),
> +    HYPERCALL(vcpu_op),
> +    HYPERCALL(sched_op),
> +    HYPERCALL(event_channel_op),
> +    HYPERCALL(hvm_op),
> +};
Why not just expand the exiting hvm hypercall table to incorporate these
new hypercalls?

In fact, why is hybrid mode considered a separate mode by the hypervisor
at all? Shouldn''t it just be an extension to regular HVM mode which
guests may choose to use? This seems like it would eliminate a bunch of
the random conditionals.

Have you verified that all of the operations you are making available
are safe to be called from (hybrid)hvm mode?

Ian.

_______________________________________________
Xen-devel mailing list
Xen-devel@lists.xensource.com
http://lists.xensource.com/xen-devel

Paul Durrant

2010-Feb-02 11:32 UTC

head link

Re: [Xen-devel] [PATCH][v2] Hybrid extension support in Xen

Sheng Yang wrote:> 
> 1. Enable SMP support through VCPU_OP in arch_set_info_guest().
> 2. A new hypercall in hvm_ops to enable hybrid.
> 3. Mapping IRQ to VIRQ when deliver to the guest.
> 4. Inject a guest defined vector to deliver notification for events.
> 5. Use CPUID leaf 0x40000002 to support hybrid feature
> 6. Reserve some space at the end of MMIO hole for grant table use.
> 
> Please review. Thanks!
> 
Why a complete new hypercall table; could you not just use the existing 
HVM table and add the extra hypercalls in?
Doing it this way would negate the need for the concept of a
''hybrid''
HVM guest would it not? HVM guests that were aware of the hybrid 
extensions would use them, those that were unaware would not.

Also, why the extra E820 space for the grant table? You don''t need the 
table that early so could you not just use fake PCI BAR space or 
somesuch and do the XENMEM_add_to_physmap hypercall to add 
XENMAPSPACE_grant_table later on?

   Paul

-- 
==============================Paul Durrant, Software Engineer

Citrix Systems (R&D) Ltd.
First Floor, Building 101
Cambridge Science Park
Milton Road
Cambridge CB4 0FY
United Kingdom
==============================
_______________________________________________
Xen-devel mailing list
Xen-devel@lists.xensource.com
http://lists.xensource.com/xen-devel

Sheng Yang

2010-Feb-02 12:54 UTC

head link

Re: [Xen-devel] [PATCH][v2] Hybrid extension support in Xen

On Tuesday 02 February 2010 19:22:21 Ian Campbell wrote:> On Tue, 2010-02-02 at 08:16 +0000, Sheng Yang wrote:
> > +/* Reserve 128KB for grant table */
> > +#define GNTTAB_MEMBASE     0xfbfe0000
> > +#define GNTTAB_MEMSIZE     0x20000
> 
> Why is this necessary? Isn''t the grant table contained within one
of the
> BARS on the virtual PCI device? What needs grant tables for prior to the
> kernel finding the PCI device which necessitates hardcoding these
> addresses in both guest and hypervisor?
> Thanks for so quick and detail comments. :)

And this one is purposed, because we don''t want to depends on QEmu. As
you
see, we now have PV drivers, QEmu is an alternative way to provide device 
model now. We think that still involving QEmu as a requirement is somehow 
strange. So this reserved region is there, to drop the dependence with QEmu to 
provide PV driver(PV driver depends on QEmu is still strange, right?).

-- 
regards
Yang, Sheng

_______________________________________________
Xen-devel mailing list
Xen-devel@lists.xensource.com
http://lists.xensource.com/xen-devel

Sheng Yang

2010-Feb-02 13:06 UTC

head link

Re: [Xen-devel] [PATCH][v2] Hybrid extension support in Xen

On Tuesday 02 February 2010 19:26:55 Ian Campbell wrote:> On Tue, 2010-02-02 at 08:16 +0000, Sheng Yang wrote:
> > +static hvm_hypercall_t *hvm_hypercall_hybrid64_table[NR_hypercalls]
> > {
> > +    [ __HYPERVISOR_memory_op ] = (hvm_hypercall_t *)hvm_memory_op,
> > +    [ __HYPERVISOR_grant_table_op ] = (hvm_hypercall_t
> > *)hvm_grant_table_op,
> > +    HYPERCALL(xen_version),
> > +    HYPERCALL(console_io),
> > +    HYPERCALL(vcpu_op),
> > +    HYPERCALL(sched_op),
> > +    HYPERCALL(event_channel_op),
> > +    HYPERCALL(hvm_op),
> > +};
> 
> Why not just expand the exiting hvm hypercall table to incorporate these
> new hypercalls?
I am just afraid the normal HVM guest called these hypercalls would result in 
some chaos, so add a limitation(for hybrid only) here. (I admit it
didn''t much
improve the security for a malicious guest...)> 
> In fact, why is hybrid mode considered a separate mode by the hypervisor
> at all? Shouldn''t it just be an extension to regular HVM mode
which
> guests may choose to use? This seems like it would eliminate a bunch of
> the random conditionals.
There is still something different from normal HVM guest. For example, to use 
PV timer, we should clean the tsc offset in HVM domain; and for event 
delivery, we would use predefined VIRQ rather than emulated IOAPIC/APIC. These 
code are exclusive, we need them wrapped with flag(which we called
"hybrid").
The word "mode" here may be is inaccuracy, a "extension"
should be more
proper. I would change the phrase next time.> 
> Have you verified that all of the operations you are making available
> are safe to be called from (hybrid)hvm mode?
You mean malicious guest attack? Sorry, I haven''t try it yet...

-- 
regards
Yang, Sheng

_______________________________________________
Xen-devel mailing list
Xen-devel@lists.xensource.com
http://lists.xensource.com/xen-devel

Ian Campbell

2010-Feb-02 13:19 UTC

head link

Re: [Xen-devel] [PATCH][v2] Hybrid extension support in Xen

On Tue, 2010-02-02 at 12:54 +0000, Sheng Yang wrote:> On Tuesday 02 February 2010 19:22:21 Ian Campbell wrote:
> > On Tue, 2010-02-02 at 08:16 +0000, Sheng Yang wrote:
> > > +/* Reserve 128KB for grant table */
> > > +#define GNTTAB_MEMBASE     0xfbfe0000
> > > +#define GNTTAB_MEMSIZE     0x20000
> > 
> > Why is this necessary? Isn''t the grant table contained within
one of the
> > BARS on the virtual PCI device? What needs grant tables for prior to
the
> > kernel finding the PCI device which necessitates hardcoding these
> > addresses in both guest and hypervisor?
> > 
> Thanks for so quick and detail comments. :)
> 
> And this one is purposed, because we don''t want to depends on
QEmu. As you
> see, we now have PV drivers, QEmu is an alternative way to provide device 
> model now. We think that still involving QEmu as a requirement is somehow 
> strange. So this reserved region is there, to drop the dependence with QEmu
to
> provide PV driver(PV driver depends on QEmu is still strange, right?).
So with your patchset you can run an HVM guest with no qemu process at
all? What about the other emulated devices which have no PV equivalent?
How does the VM boot, does the BIOS have pv INT 13 handler?

Ian.




_______________________________________________
Xen-devel mailing list
Xen-devel@lists.xensource.com
http://lists.xensource.com/xen-devel

Keir Fraser

2010-Feb-02 13:23 UTC

head link

Re: [Xen-devel] [PATCH][v2] Hybrid extension support in Xen

On 02/02/2010 08:16, "Sheng Yang" <sheng@linux.intel.com> wrote:
> 
> Here is the second version of Hybrid extension support in Xen. Mostly the
> patch did:
> 
> 1. Enable SMP support through VCPU_OP in arch_set_info_guest().
What does that mean and why would we want it?

 -- Keir



_______________________________________________
Xen-devel mailing list
Xen-devel@lists.xensource.com
http://lists.xensource.com/xen-devel

Sheng Yang

2010-Feb-02 13:28 UTC

head link

Re: [Xen-devel] [PATCH][v2] Hybrid extension support in Xen

On Tuesday 02 February 2010 21:19:32 Ian Campbell wrote:> On Tue, 2010-02-02 at 12:54 +0000, Sheng Yang wrote:
> > On Tuesday 02 February 2010 19:22:21 Ian Campbell wrote:
> > > On Tue, 2010-02-02 at 08:16 +0000, Sheng Yang wrote:
> > > > +/* Reserve 128KB for grant table */
> > > > +#define GNTTAB_MEMBASE     0xfbfe0000
> > > > +#define GNTTAB_MEMSIZE     0x20000
> > >
> > > Why is this necessary? Isn''t the grant table contained
within one of
> > > the BARS on the virtual PCI device? What needs grant tables for
prior
> > > to the kernel finding the PCI device which necessitates
hardcoding
> > > these addresses in both guest and hypervisor?
> >
> > Thanks for so quick and detail comments. :)
> >
> > And this one is purposed, because we don''t want to depends on
QEmu. As
> > you see, we now have PV drivers, QEmu is an alternative way to provide
> > device model now. We think that still involving QEmu as a requirement
is
> > somehow strange. So this reserved region is there, to drop the
dependence
> > with QEmu to provide PV driver(PV driver depends on QEmu is still
> > strange, right?).
> 
> So with your patchset you can run an HVM guest with no qemu process at
> all? What about the other emulated devices which have no PV equivalent?
> How does the VM boot, does the BIOS have pv INT 13 handler?
No, not currently... Sorry for confusion. 

We just think QEmu provided the availability of PV driver seems unelegant, so 
we want to decouple them. Because what QEmu provided is a PCI IRQ and a MMIO 
region for grant table. And event channel is available without that PCI IRQ, 
so we think decouple MMIO from QEmu should be more elegant...

-- 
regards
Yang, Sheng

_______________________________________________
Xen-devel mailing list
Xen-devel@lists.xensource.com
http://lists.xensource.com/xen-devel

Keir Fraser

2010-Feb-02 13:35 UTC

head link

Re: [Xen-devel] [PATCH][v2] Hybrid extension support in Xen

On 02/02/2010 11:22, "Ian Campbell" <Ian.Campbell@eu.citrix.com>
wrote:
> On Tue, 2010-02-02 at 08:16 +0000, Sheng Yang wrote:
>> 
>> +/* Reserve 128KB for grant table */
>> +#define GNTTAB_MEMBASE     0xfbfe0000
>> +#define GNTTAB_MEMSIZE     0x20000
> 
> Why is this necessary? Isn''t the grant table contained within one
of the
> BARS on the virtual PCI device? What needs grant tables for prior to the
> kernel finding the PCI device which necessitates hardcoding these
> addresses in both guest and hypervisor?
I didn''t even find where these get used, except to reserve an area in
e820,
and it wasn''t clear why that reservation is necessary.

 -- Keir



_______________________________________________
Xen-devel mailing list
Xen-devel@lists.xensource.com
http://lists.xensource.com/xen-devel

Sheng Yang

2010-Feb-02 13:37 UTC

head link

Re: [Xen-devel] [PATCH][v2] Hybrid extension support in Xen

On Tuesday 02 February 2010 21:23:11 Keir Fraser wrote:> On 02/02/2010 08:16, "Sheng Yang" <sheng@linux.intel.com>
wrote:
> > Here is the second version of Hybrid extension support in Xen. Mostly
the
> > patch did:
> >
> > 1. Enable SMP support through VCPU_OP in arch_set_info_guest().
> 
> What does that mean and why would we want it?
It''s not a good description, sorry... We need a entry address and boot
up code
AP. For that purpose, I reuse VCPUOP_initialise and VCPUOP_up hypercall. The 
former guest set the entry address of AP, the later one boot up them. For 
this, I add some code in arch_set_info_guest() to initialize the AP(mostly 
what we did after INIT-SIPI-SIPI sequence), that''s what I means here...

-- 
regards
Yang, Sheng

_______________________________________________
Xen-devel mailing list
Xen-devel@lists.xensource.com
http://lists.xensource.com/xen-devel

Ian Campbell

2010-Feb-02 13:50 UTC

head link

Re: [Xen-devel] [PATCH][v2] Hybrid extension support in Xen

On Tue, 2010-02-02 at 13:28 +0000, Sheng Yang wrote:> On Tuesday 02 February 2010 21:19:32 Ian Campbell wrote:
> > On Tue, 2010-02-02 at 12:54 +0000, Sheng Yang wrote:
> > > On Tuesday 02 February 2010 19:22:21 Ian Campbell wrote:
> > > > On Tue, 2010-02-02 at 08:16 +0000, Sheng Yang wrote:
> > > > > +/* Reserve 128KB for grant table */
> > > > > +#define GNTTAB_MEMBASE     0xfbfe0000
> > > > > +#define GNTTAB_MEMSIZE     0x20000
> > > >
> > > > Why is this necessary? Isn''t the grant table
contained within one of
> > > > the BARS on the virtual PCI device? What needs grant tables
for prior
> > > > to the kernel finding the PCI device which necessitates
hardcoding
> > > > these addresses in both guest and hypervisor?
> > >
> > > Thanks for so quick and detail comments. :)
> > >
> > > And this one is purposed, because we don''t want to
depends on QEmu. As
> > > you see, we now have PV drivers, QEmu is an alternative way to
provide
> > > device model now. We think that still involving QEmu as a
requirement is
> > > somehow strange. So this reserved region is there, to drop the
dependence
> > > with QEmu to provide PV driver(PV driver depends on QEmu is still
> > > strange, right?).
> > 
> > So with your patchset you can run an HVM guest with no qemu process at
> > all? What about the other emulated devices which have no PV
equivalent?
> > How does the VM boot, does the BIOS have pv INT 13 handler?
> 
> No, not currently... Sorry for confusion. 
> 
> We just think QEmu provided the availability of PV driver seems unelegant,
so
> we want to decouple them. Because what QEmu provided is a PCI IRQ and a
MMIO
> region for grant table. And event channel is available without that PCI
IRQ,
> so we think decouple MMIO from QEmu should be more elegant...
OK, but in that case I think we should have a mechanism for the guest to
query the location of the grant table pages (hypercall, MSR etc) rather
than hardcoding a magic address. It may well end up being hardcoded on
the tools/hypervisor side for now but there is no reason to expose that
to the guest.

I wonder if we could turn things around and have the guest pick some
pages and tell the hypervisor to put the grant table there, removing the
need to reserve any of the physical address space up front. How do
full-PV guests find their grant table, is that a mechanism which could
be re-used here instead of reserving magic regions?

Ian.


_______________________________________________
Xen-devel mailing list
Xen-devel@lists.xensource.com
http://lists.xensource.com/xen-devel

Sheng Yang

2010-Feb-02 13:52 UTC

head link

Re: [Xen-devel] [PATCH][v2] Hybrid extension support in Xen

On Tuesday 02 February 2010 21:35:23 Keir Fraser wrote:> On 02/02/2010 11:22, "Ian Campbell"
<Ian.Campbell@eu.citrix.com> wrote:
> > On Tue, 2010-02-02 at 08:16 +0000, Sheng Yang wrote:
> >> +/* Reserve 128KB for grant table */
> >> +#define GNTTAB_MEMBASE     0xfbfe0000
> >> +#define GNTTAB_MEMSIZE     0x20000
> >
> > Why is this necessary? Isn''t the grant table contained within
one of the
> > BARS on the virtual PCI device? What needs grant tables for prior to
the
> > kernel finding the PCI device which necessitates hardcoding these
> > addresses in both guest and hypervisor?
> 
> I didn''t even find where these get used, except to reserve an area
in e820,
> and it wasn''t clear why that reservation is necessary.
> It has been used in the last [6/6] patch of Linux kernel side, which would use 
the pages to map grant table. It works the same as the MMIO region in PVonHVM 
device. Reserve it in BIOS because we think it''s more elegant than
depends on
QEmu to provide the reserved memory space.

-- 
regards
Yang, Sheng
 

_______________________________________________
Xen-devel mailing list
Xen-devel@lists.xensource.com
http://lists.xensource.com/xen-devel

Ian Campbell

2010-Feb-02 13:52 UTC

head link

Re: [Xen-devel] [PATCH][v2] Hybrid extension support in Xen

On Tue, 2010-02-02 at 13:06 +0000, Sheng Yang wrote:> On Tuesday 02 February 2010 19:26:55 Ian Campbell wrote:
> > On Tue, 2010-02-02 at 08:16 +0000, Sheng Yang wrote:
> > > +static hvm_hypercall_t
*hvm_hypercall_hybrid64_table[NR_hypercalls] > > > {
> > > +    [ __HYPERVISOR_memory_op ] = (hvm_hypercall_t
*)hvm_memory_op,
> > > +    [ __HYPERVISOR_grant_table_op ] = (hvm_hypercall_t
> > > *)hvm_grant_table_op,
> > > +    HYPERCALL(xen_version),
> > > +    HYPERCALL(console_io),
> > > +    HYPERCALL(vcpu_op),
> > > +    HYPERCALL(sched_op),
> > > +    HYPERCALL(event_channel_op),
> > > +    HYPERCALL(hvm_op),
> > > +};
> > 
> > Why not just expand the exiting hvm hypercall table to incorporate
these
> > new hypercalls?
> 
> I am just afraid the normal HVM guest called these hypercalls would result
in
> some chaos, so add a limitation(for hybrid only) here. (I admit it
didn''t much
> improve the security for a malicious guest...)
I don''t think this limitation adds any meaningful security or reduction
in chaos. A non-hybrid aware HVM guest simply won''t make those
hypercalls.
> > In fact, why is hybrid mode considered a separate mode by the
hypervisor
> > at all? Shouldn''t it just be an extension to regular HVM mode
which
> > guests may choose to use? This seems like it would eliminate a bunch
of
> > the random conditionals.
> 
> There is still something different from normal HVM guest. For example, to
use
> PV timer, we should clean the tsc offset in HVM domain; and for event 
> delivery, we would use predefined VIRQ rather than emulated IOAPIC/APIC.
These
> code are exclusive, we need them wrapped with flag(which we called
"hybrid").
> The word "mode" here may be is inaccuracy, a
"extension" should be more
> proper. I would change the phrase next time.
But the old mechanisms (emulated IOAPIC etc) are still present until the
enable_hybrid HVMOP is called, aren''t they? Why can''t you
perform the
switch at the point at which the new feature is requested by the guest
e.g. when the VIRQ is configured?

It looks like you are using evtchn''s for all interrupt injection,
including any emulated or passthrough devices which may be present.
Using evtchn''s for PV devices obviously makes sense but I think this
needs to coexist with emulated interrupt injection for non-PV devices so
the IOAPIC/APIC should not be mutually exclusive with using PV evtchns.

Ian.


_______________________________________________
Xen-devel mailing list
Xen-devel@lists.xensource.com
http://lists.xensource.com/xen-devel

Ian Campbell

2010-Feb-02 13:52 UTC

head link

Re: [Xen-devel] [PATCH][v2] Hybrid extension support in Xen

On Tue, 2010-02-02 at 08:16 +0000, Sheng Yang wrote:> diff --git a/xen/arch/x86/hvm/irq.c b/xen/arch/x86/hvm/irq.c
> --- a/xen/arch/x86/hvm/irq.c
> +++ b/xen/arch/x86/hvm/irq.c
> @@ -46,8 +46,18 @@ static void __hvm_pci_intx_assert(
>      if ( (hvm_irq->pci_link_assert_count[link]++ == 0) &&
isa_irq &&
>           (hvm_irq->gsi_assert_count[isa_irq]++ == 0) )
>      {
> -        vioapic_irq_positive_edge(d, isa_irq);
> -        vpic_irq_positive_edge(d, isa_irq);
> +        if ( !is_hybrid_evtchn_enabled_domain(d) )
> +        {
> +            vioapic_irq_positive_edge(d, isa_irq);
> +            vpic_irq_positive_edge(d, isa_irq);
> +        }
> +        else
> +        {
> +            /* TODO fix the critical region here */
> +            spin_unlock(&d->arch.hvm_domain.irq_lock);
> +            send_guest_global_virq(d, VIRQ_EMUL_PIN(isa_irq));
> +            spin_lock(&d->arch.hvm_domain.irq_lock);
> +       }
>      }
>  } 
This can''t be right, at least not without a big comment explaining why
it is safe to drop the lock here...

Ian.



_______________________________________________
Xen-devel mailing list
Xen-devel@lists.xensource.com
http://lists.xensource.com/xen-devel

Sheng Yang

2010-Feb-02 13:53 UTC

head link

Re: [Xen-devel] [PATCH][v2] Hybrid extension support in Xen

On Tuesday 02 February 2010 21:52:55 Ian Campbell wrote:> On Tue, 2010-02-02 at 08:16 +0000, Sheng Yang wrote:
> > diff --git a/xen/arch/x86/hvm/irq.c b/xen/arch/x86/hvm/irq.c
> > --- a/xen/arch/x86/hvm/irq.c
> > +++ b/xen/arch/x86/hvm/irq.c
> > @@ -46,8 +46,18 @@ static void __hvm_pci_intx_assert(
> >      if ( (hvm_irq->pci_link_assert_count[link]++ == 0) &&
isa_irq &&
> >           (hvm_irq->gsi_assert_count[isa_irq]++ == 0) )
> >      {
> > -        vioapic_irq_positive_edge(d, isa_irq);
> > -        vpic_irq_positive_edge(d, isa_irq);
> > +        if ( !is_hybrid_evtchn_enabled_domain(d) )
> > +        {
> > +            vioapic_irq_positive_edge(d, isa_irq);
> > +            vpic_irq_positive_edge(d, isa_irq);
> > +        }
> > +        else
> > +        {
> > +            /* TODO fix the critical region here */
> > +            spin_unlock(&d->arch.hvm_domain.irq_lock);
> > +            send_guest_global_virq(d, VIRQ_EMUL_PIN(isa_irq));
> > +            spin_lock(&d->arch.hvm_domain.irq_lock);
> > +       }
> >      }
> >  }
> 
> This can''t be right, at least not without a big comment explaining
why
> it is safe to drop the lock here...
... Would address this in the next version.

-- 
regards
Yang, Sheng


_______________________________________________
Xen-devel mailing list
Xen-devel@lists.xensource.com
http://lists.xensource.com/xen-devel

Tim Deegan

2010-Feb-02 14:00 UTC

head link

Re: [Xen-devel] [PATCH][v2] Hybrid extension support in Xen

At 13:50 +0000 on 02 Feb (1265118605), Ian Campbell
wrote:> I wonder if we could turn things around and have the guest pick some
> pages and tell the hypervisor to put the grant table there, removing the
> need to reserve any of the physical address space up front. 
Yes; this is how all existing PV-on-HVM drivers work, using the
grant-table hypercall.  I don''t see the need to add another mechanism.

Tim.

-- 
Tim Deegan <Tim.Deegan@citrix.com>
Principal Software Engineer, Citrix Systems (R&D) Ltd.
[Company #02300071, SL9 0DZ, UK.]

_______________________________________________
Xen-devel mailing list
Xen-devel@lists.xensource.com
http://lists.xensource.com/xen-devel

Keir Fraser

2010-Feb-02 14:01 UTC

head link

Re: [Xen-devel] [PATCH][v2] Hybrid extension support in Xen

On 02/02/2010 13:52, "Sheng Yang" <sheng@linux.intel.com> wrote:
>> I didn''t even find where these get used, except to reserve an
area in e820,
>> and it wasn''t clear why that reservation is necessary.
>> 
> It has been used in the last [6/6] patch of Linux kernel side, which would
use
> the pages to map grant table. It works the same as the MMIO region in
PVonHVM
> device. Reserve it in BIOS because we think it''s more elegant than
depends on
> QEmu to provide the reserved memory space.
Hmm. Can''t this be done in phases? It seems unnecessary to be making
changes
solely to remove qemu dependencies in the intial patchset, when the patchset
does not actually achieve that aim.

I think an HVM guest with no PCI space is a little way off, and perhaps we
can find a better way than hardcoding an address in two places.

 -- Keir



_______________________________________________
Xen-devel mailing list
Xen-devel@lists.xensource.com
http://lists.xensource.com/xen-devel

Keir Fraser

2010-Feb-02 14:03 UTC

head link

Re: [Xen-devel] [PATCH][v2] Hybrid extension support in Xen

On 02/02/2010 13:37, "Sheng Yang" <sheng@linux.intel.com> wrote:
>>> 1. Enable SMP support through VCPU_OP in arch_set_info_guest().
>> 
>> What does that mean and why would we want it?
> 
> It''s not a good description, sorry... We need a entry address and
boot up code
> AP. For that purpose, I reuse VCPUOP_initialise and VCPUOP_up hypercall.
The
> former guest set the entry address of AP, the later one boot up them. For
> this, I add some code in arch_set_info_guest() to initialize the AP(mostly
> what we did after INIT-SIPI-SIPI sequence), that''s what I means
here...
Okay, so that leads to the obvious next question: why do you want to avoid
using INIT-SIPI-SIPI?

 -- Keir



_______________________________________________
Xen-devel mailing list
Xen-devel@lists.xensource.com
http://lists.xensource.com/xen-devel

Stefano Stabellini

2010-Feb-02 14:04 UTC

head link

Re: [Xen-devel] [PATCH][v2] Hybrid extension support in Xen

On Tue, 2 Feb 2010, Ian Campbell wrote:> It looks like you are using evtchn''s for all interrupt injection,
> including any emulated or passthrough devices which may be present.
> Using evtchn''s for PV devices obviously makes sense but I think
this
> needs to coexist with emulated interrupt injection for non-PV devices so
> the IOAPIC/APIC should not be mutually exclusive with using PV evtchns.
> 
Indeed.
Evtchn''s for PV devices are more than welcome, but this:
> diff --git a/xen/arch/x86/hvm/irq.c b/xen/arch/x86/hvm/irq.c
> --- a/xen/arch/x86/hvm/irq.c
> +++ b/xen/arch/x86/hvm/irq.c
> @@ -46,8 +46,18 @@ static void __hvm_pci_intx_assert(
>      if ( (hvm_irq->pci_link_assert_count[link]++ == 0) &&
isa_irq &&
>           (hvm_irq->gsi_assert_count[isa_irq]++ == 0) )
>      {
> -        vioapic_irq_positive_edge(d, isa_irq);
> -        vpic_irq_positive_edge(d, isa_irq);
> +        if ( !is_hybrid_evtchn_enabled_domain(d) )
> +        {
> +            vioapic_irq_positive_edge(d, isa_irq);
> +            vpic_irq_positive_edge(d, isa_irq);
> +        }
> +        else
> +        {
> +            /* TODO fix the critical region here */
> +            spin_unlock(&d->arch.hvm_domain.irq_lock);
> +            send_guest_global_virq(d, VIRQ_EMUL_PIN(isa_irq));
> +            spin_lock(&d->arch.hvm_domain.irq_lock);
> +	}
>      }
>  }
>  
and this:
> @@ -101,11 +114,21 @@ void hvm_isa_irq_assert(
>      if ( !__test_and_set_bit(isa_irq, &hvm_irq->isa_irq.i)
&&
>           (hvm_irq->gsi_assert_count[gsi]++ == 0) )
>      {
> -        vioapic_irq_positive_edge(d, gsi);
> -        vpic_irq_positive_edge(d, isa_irq);
> +        if ( !is_hybrid_evtchn_enabled_domain(d) )
> +        {
> +            vioapic_irq_positive_edge(d, gsi);
> +            vpic_irq_positive_edge(d, isa_irq);
> +        }
> +        else
> +        {
> +            send_virq = 1;
> +        }
>      }
>  
and this:
> @@ -120,7 +143,10 @@ void hvm_isa_irq_deassert(
>  
>      if ( __test_and_clear_bit(isa_irq, &hvm_irq->isa_irq.i)
&&
>           (--hvm_irq->gsi_assert_count[gsi] == 0) )
> -        vpic_irq_negative_edge(d, isa_irq);
> +    {
> +        if ( !is_hybrid_evtchn_enabled_domain(d) )
> +            vpic_irq_negative_edge(d, isa_irq);
> +    }
>  
>      spin_unlock(&d->arch.hvm_domain.irq_lock);
>  }
> 
I think they are a very bad idea and a maintenance pain, especially when
you think about pci passthrought.


_______________________________________________
Xen-devel mailing list
Xen-devel@lists.xensource.com
http://lists.xensource.com/xen-devel

Sheng Yang

2010-Feb-02 14:07 UTC

head link

Re: [Xen-devel] [PATCH][v2] Hybrid extension support in Xen

On Tuesday 02 February 2010 21:52:44 Ian Campbell wrote:> On Tue, 2010-02-02 at 13:06 +0000, Sheng Yang wrote:
> > On Tuesday 02 February 2010 19:26:55 Ian Campbell wrote:
> > > On Tue, 2010-02-02 at 08:16 +0000, Sheng Yang wrote:
> > > > +static hvm_hypercall_t
*hvm_hypercall_hybrid64_table[NR_hypercalls]
> > > > = {
> > > > +    [ __HYPERVISOR_memory_op ] = (hvm_hypercall_t
*)hvm_memory_op,
> > > > +    [ __HYPERVISOR_grant_table_op ] = (hvm_hypercall_t
> > > > *)hvm_grant_table_op,
> > > > +    HYPERCALL(xen_version),
> > > > +    HYPERCALL(console_io),
> > > > +    HYPERCALL(vcpu_op),
> > > > +    HYPERCALL(sched_op),
> > > > +    HYPERCALL(event_channel_op),
> > > > +    HYPERCALL(hvm_op),
> > > > +};
> > >
> > > Why not just expand the exiting hvm hypercall table to
incorporate
> > > these new hypercalls?
> >
> > I am just afraid the normal HVM guest called these hypercalls would
> > result in some chaos, so add a limitation(for hybrid only) here. (I
admit
> > it didn''t much improve the security for a malicious guest...)
> 
> I don''t think this limitation adds any meaningful security or
reduction
> in chaos. A non-hybrid aware HVM guest simply won''t make those
> hypercalls.
Um, yes... Would update this in the next version.> 
> > > In fact, why is hybrid mode considered a separate mode by the
> > > hypervisor at all? Shouldn''t it just be an extension to
regular HVM
> > > mode which guests may choose to use? This seems like it would
eliminate
> > > a bunch of the random conditionals.
> >
> > There is still something different from normal HVM guest. For example,
to
> > use PV timer, we should clean the tsc offset in HVM domain; and for
event
> > delivery, we would use predefined VIRQ rather than emulated
IOAPIC/APIC.
> > These code are exclusive, we need them wrapped with flag(which we
called
> > "hybrid"). The word "mode" here may be is
inaccuracy, a "extension"
> > should be more proper. I would change the phrase next time.
> 
> But the old mechanisms (emulated IOAPIC etc) are still present until the
> enable_hybrid HVMOP is called, aren''t they? Why can''t you
perform the
> switch at the point at which the new feature is requested by the guest
> e.g. when the VIRQ is configured?
Yes, they are there before the enable_hybrid HVMOP called. But sorry for I 
don''t quite understand about the point "when the VIRQ is
configured?"> 
> It looks like you are using evtchn''s for all interrupt injection,
> including any emulated or passthrough devices which may be present.
> Using evtchn''s for PV devices obviously makes sense but I think
this
> needs to coexist with emulated interrupt injection for non-PV devices so
> the IOAPIC/APIC should not be mutually exclusive with using PV evtchns.
Yes. similar to dom0(maybe more like pv_ops dom0). Currently passthrough 
devices have issue with it, due to lack of MSI/MSI-X support(which would be 
our next step and can be shared with pv_ops dom0); for INTx method, because 
event channel is edge triggered, but INTx is level triggered, so they also 
can''t be used. Passthrough devices can be used with some hack, if
MSI2INTX
translation is worked.

But let IOAPIC/APIC coexisted with event channel is not our target. As we 
know, the overhead brought by them, notably EOI by LAPIC, impact performance 
badly. We want event channel that because event channel have much less 
overhead compared to IOAPIC/LAPIC, a completely virtualization aware solution 
which elimiate all the unnecessary overhead. That''s what we want Xen
guest to
be benefit.

-- 
regards
Yang, Sheng

_______________________________________________
Xen-devel mailing list
Xen-devel@lists.xensource.com
http://lists.xensource.com/xen-devel

Sheng Yang

2010-Feb-02 14:08 UTC

head link

Re: [Xen-devel] [PATCH][v2] Hybrid extension support in Xen

On Tuesday 02 February 2010 22:03:31 Keir Fraser wrote:> On 02/02/2010 13:37, "Sheng Yang" <sheng@linux.intel.com>
wrote:
> >>> 1. Enable SMP support through VCPU_OP in
arch_set_info_guest().
> >>
> >> What does that mean and why would we want it?
> >
> > It''s not a good description, sorry... We need a entry address
and boot up
> > code AP. For that purpose, I reuse VCPUOP_initialise and VCPUOP_up
> > hypercall. The former guest set the entry address of AP, the later one
> > boot up them. For this, I add some code in arch_set_info_guest() to
> > initialize the AP(mostly what we did after INIT-SIPI-SIPI sequence),
> > that''s what I means here...
> 
> Okay, so that leads to the obvious next question: why do you want to avoid
> using INIT-SIPI-SIPI?
> Because we don''t have IOAPIC/LAPIC...

-- 
regards
Yang, Sheng
 

_______________________________________________
Xen-devel mailing list
Xen-devel@lists.xensource.com
http://lists.xensource.com/xen-devel

Sheng Yang

2010-Feb-02 14:13 UTC

head link

Re: [Xen-devel] [PATCH][v2] Hybrid extension support in Xen

On Tuesday 02 February 2010 22:01:43 Keir Fraser wrote:> On 02/02/2010 13:52, "Sheng Yang" <sheng@linux.intel.com>
wrote:
> >> I didn''t even find where these get used, except to
reserve an area in
> >> e820, and it wasn''t clear why that reservation is
necessary.
> >
> > It has been used in the last [6/6] patch of Linux kernel side, which
> > would use the pages to map grant table. It works the same as the MMIO
> > region in PVonHVM device. Reserve it in BIOS because we think
it''s more
> > elegant than depends on QEmu to provide the reserved memory space.
> 
> Hmm. Can''t this be done in phases? It seems unnecessary to be
making
>  changes solely to remove qemu dependencies in the intial patchset, when
>  the patchset does not actually achieve that aim.
> 
> I think an HVM guest with no PCI space is a little way off, and perhaps we
> can find a better way than hardcoding an address in two places.
Sure. I would try stick to old QEmu provided MMIO first. (Maybe try Ian 
Campbell''s advice to get some more flexible ones later).

-- 
regards
Yang, Sheng


_______________________________________________
Xen-devel mailing list
Xen-devel@lists.xensource.com
http://lists.xensource.com/xen-devel

Sheng Yang

2010-Feb-02 14:22 UTC

head link

Re: [Xen-devel] [PATCH][v2] Hybrid extension support in Xen

On Tuesday 02 February 2010 22:00:47 Tim Deegan wrote:> At 13:50 +0000 on 02 Feb (1265118605), Ian Campbell wrote:
> > I wonder if we could turn things around and have the guest pick some
> > pages and tell the hypervisor to put the grant table there, removing
the
> > need to reserve any of the physical address space up front.
> 
> Yes; this is how all existing PV-on-HVM drivers work, using the
> grant-table hypercall.  I don''t see the need to add another
mechanism.
...Not quite understand. I think the PV-on-HVM drivers reserved the memory 
space in QEmu provided device''s MMIO space, similar to what I did
here(except
reserved in BIOS)?...

-- 
regards
Yang, Sheng

_______________________________________________
Xen-devel mailing list
Xen-devel@lists.xensource.com
http://lists.xensource.com/xen-devel

Sheng Yang

2010-Feb-02 14:28 UTC

head link

Re: [Xen-devel] [PATCH][v2] Hybrid extension support in Xen

On Tuesday 02 February 2010 21:50:05 Ian Campbell wrote:> On Tue, 2010-02-02 at 13:28 +0000, Sheng Yang wrote:
> > On Tuesday 02 February 2010 21:19:32 Ian Campbell wrote:
> > > On Tue, 2010-02-02 at 12:54 +0000, Sheng Yang wrote:
> > > > On Tuesday 02 February 2010 19:22:21 Ian Campbell wrote:
> > > > > On Tue, 2010-02-02 at 08:16 +0000, Sheng Yang wrote:
> > > > > > +/* Reserve 128KB for grant table */
> > > > > > +#define GNTTAB_MEMBASE     0xfbfe0000
> > > > > > +#define GNTTAB_MEMSIZE     0x20000
> > > > >
> > > > > Why is this necessary? Isn''t the grant table
contained within one
> > > > > of the BARS on the virtual PCI device? What needs grant
tables for
> > > > > prior to the kernel finding the PCI device which
necessitates
> > > > > hardcoding these addresses in both guest and
hypervisor?
> > > >
> > > > Thanks for so quick and detail comments. :)
> > > >
> > > > And this one is purposed, because we don''t want to
depends on QEmu.
> > > > As you see, we now have PV drivers, QEmu is an alternative
way to
> > > > provide device model now. We think that still involving QEmu
as a
> > > > requirement is somehow strange. So this reserved region is
there, to
> > > > drop the dependence with QEmu to provide PV driver(PV driver
depends
> > > > on QEmu is still strange, right?).
> > >
> > > So with your patchset you can run an HVM guest with no qemu
process at
> > > all? What about the other emulated devices which have no PV
equivalent?
> > > How does the VM boot, does the BIOS have pv INT 13 handler?
> >
> > No, not currently... Sorry for confusion.
> >
> > We just think QEmu provided the availability of PV driver seems
> > unelegant, so we want to decouple them. Because what QEmu provided is
a
> > PCI IRQ and a MMIO region for grant table. And event channel is
available
> > without that PCI IRQ, so we think decouple MMIO from QEmu should be
more
> > elegant...
> 
> OK, but in that case I think we should have a mechanism for the guest to
> query the location of the grant table pages (hypercall, MSR etc) rather
> than hardcoding a magic address. It may well end up being hardcoded on
> the tools/hypervisor side for now but there is no reason to expose that
> to the guest.
Yes. I would try to use QEmu provided device for this - though I think it may 
still have problem if I use PCI probe mechanism to find the address, because 
the initialization of grant table maybe earlier.

Rather than probe as a PCI device, I prefer a query or hardcode in this case.
> 
> I wonder if we could turn things around and have the guest pick some
> pages and tell the hypervisor to put the grant table there, removing the
> need to reserve any of the physical address space up front. How do
> full-PV guests find their grant table, is that a mechanism which could
> be re-used here instead of reserving magic regions?
I would check if we can follow the PV solution.(I remember I once checked 
that, but forgot the reason for not doing so now...)

-- 
regards
Yang, Sheng

_______________________________________________
Xen-devel mailing list
Xen-devel@lists.xensource.com
http://lists.xensource.com/xen-devel

Keir Fraser

2010-Feb-02 14:32 UTC

head link

Re: [Xen-devel] [PATCH][v2] Hybrid extension support in Xen

On 02/02/2010 14:08, "Sheng Yang" <sheng@linux.intel.com> wrote:
>> Okay, so that leads to the obvious next question: why do you want to
avoid
>> using INIT-SIPI-SIPI?
>> 
> Because we don''t have IOAPIC/LAPIC...
Is it necessary to remove the LAPICs completely? If you go very far down the
route of ripping emulated stuff out of HVM, it starts to feel like starting
with a pure PV guest and HVMing it up is closer in spirit to what you might
be aiming for.

 -- Keir



_______________________________________________
Xen-devel mailing list
Xen-devel@lists.xensource.com
http://lists.xensource.com/xen-devel

Keir Fraser

2010-Feb-02 14:37 UTC

head link

Re: [Xen-devel] [PATCH][v2] Hybrid extension support in Xen

On 02/02/2010 14:32, "Keir Fraser" <keir.fraser@eu.citrix.com>
wrote:
>>> Okay, so that leads to the obvious next question: why do you want
to avoid
>>> using INIT-SIPI-SIPI?
>>> 
>> Because we don''t have IOAPIC/LAPIC...
> 
> Is it necessary to remove the LAPICs completely? If you go very far down
the
> route of ripping emulated stuff out of HVM, it starts to feel like starting
> with a pure PV guest and HVMing it up is closer in spirit to what you might
> be aiming for.
The other thing is, removing some of this stuff weakens the argument that
the hybrid approach lets us ''ride the wave'' of improving
hardware virt
support. For example, LAPIC is pretty architectural these days, and is ripe
for fuller hardware virtualisation. If you put HVM onto event channels and
totally rip out the LAPIC, where does that leave us if full LAPIC virt comes
along, with all its potential for especially improving device passthru
performance?

 -- Keir



_______________________________________________
Xen-devel mailing list
Xen-devel@lists.xensource.com
http://lists.xensource.com/xen-devel

Sheng Yang

2010-Feb-02 14:39 UTC

head link

Re: [Xen-devel] [PATCH][v2] Hybrid extension support in Xen

On Tuesday 02 February 2010 22:32:09 Keir Fraser wrote:> On 02/02/2010 14:08, "Sheng Yang" <sheng@linux.intel.com>
wrote:
> >> Okay, so that leads to the obvious next question: why do you want
to
> >> avoid using INIT-SIPI-SIPI?
> >
> > Because we don''t have IOAPIC/LAPIC...
> 
> Is it necessary to remove the LAPICs completely? If you go very far down
>  the route of ripping emulated stuff out of HVM, it starts to feel like
>  starting with a pure PV guest and HVMing it up is closer in spirit to what
>  you might be aiming for.
The performance of interrupt delivery is what we care about. The IOAPIC/LAPIC 
don''t work well now, so we want event channel to eliminate their
overhead,
notably EOI in LAPIC.

And you know, start from a PV guest is another story...

-- 
regards
Yang, Sheng


_______________________________________________
Xen-devel mailing list
Xen-devel@lists.xensource.com
http://lists.xensource.com/xen-devel

Sheng Yang

2010-Feb-02 15:51 UTC

head link

Re: [Xen-devel] [PATCH][v2] Hybrid extension support in Xen

On Tuesday 02 February 2010 22:37:06 Keir Fraser wrote:> On 02/02/2010 14:32, "Keir Fraser"
<keir.fraser@eu.citrix.com> wrote:
> >>> Okay, so that leads to the obvious next question: why do you
want to
> >>> avoid using INIT-SIPI-SIPI?
> >>
> >> Because we don''t have IOAPIC/LAPIC...
> >
> > Is it necessary to remove the LAPICs completely? If you go very far
down
> > the route of ripping emulated stuff out of HVM, it starts to feel like
> > starting with a pure PV guest and HVMing it up is closer in spirit to
> > what you might be aiming for.
> 
> The other thing is, removing some of this stuff weakens the argument that
> the hybrid approach lets us ''ride the wave'' of improving
hardware virt
> support. For example, LAPIC is pretty architectural these days, and is ripe
> for fuller hardware virtualisation. If you put HVM onto event channels and
> totally rip out the LAPIC, where does that leave us if full LAPIC virt
>  comes along, with all its potential for especially improving device
>  passthru performance?
In fact, our target is improving device passthru performance(before full LAPIC 
virtualziation ready in every server). And what we mostly targeted is MSI/MSI-
X interrupt intensive devices. This didn''t include in this patchset(we
have
experiment patches of course), because that can''t avoid touching the
some
generic kernel code, e.g. MSI related things in ioapic.c, make it harder to be 
checked in. The code to support hybrid MSI/MSI-X device can be shared with 
pv_ops dom0, so that is what we planed next step.

And these hybrid features are component based. I admit, sooner or later, these 
PV solutions would be replaced with hardware features. If you got new enough 
hardwares, you can simply disable them; if you didn''t, you can still
benefit
from them. Of course, as you know, it takes time to get new enough machine for 
everyone. So I think, it''s better to do something now rather than
simply
waiting for hardware feature come. :)

-- 
regards
Yang, Sheng

_______________________________________________
Xen-devel mailing list
Xen-devel@lists.xensource.com
http://lists.xensource.com/xen-devel

Ian Campbell

2010-Feb-02 16:15 UTC

head link

Re: [Xen-devel] [PATCH][v2] Hybrid extension support in Xen

On Tue, 2010-02-02 at 14:07 +0000, Sheng Yang wrote:> On Tuesday 02 February 2010 21:52:44 Ian Campbell wrote:
> > On Tue, 2010-02-02 at 13:06 +0000, Sheng Yang wrote:
> > > On Tuesday 02 February 2010 19:26:55 Ian Campbell wrote:
> > > > On Tue, 2010-02-02 at 08:16 +0000, Sheng Yang wrote:
> > > > In fact, why is hybrid mode considered a separate mode by
the
> > > > hypervisor at all? Shouldn''t it just be an
extension to regular HVM
> > > > mode which guests may choose to use? This seems like it
would eliminate
> > > > a bunch of the random conditionals.
> > >
> > > There is still something different from normal HVM guest. For
example, to
> > > use PV timer, we should clean the tsc offset in HVM domain; and
for event
> > > delivery, we would use predefined VIRQ rather than emulated
IOAPIC/APIC.
> > > These code are exclusive, we need them wrapped with flag(which we
called
> > > "hybrid"). The word "mode" here may be is
inaccuracy, a "extension"
> > > should be more proper. I would change the phrase next time.
> > 
> > But the old mechanisms (emulated IOAPIC etc) are still present until
the
> > enable_hybrid HVMOP is called, aren''t they? Why
can''t you perform the
> > switch at the point at which the new feature is requested by the guest
> > e.g. when the VIRQ is configured?
> 
> Yes, they are there before the enable_hybrid HVMOP called. But sorry for I 
> don''t quite understand about the point "when the VIRQ is
configured?"
I just meant that you should enable evtchn mode at the point at which
the guest tries to use event channels and not in some specific "enable
hybrid mode call", similarly for PV-timer mode etc.

Of course that presupposes that event channels and APIC etc are mutually
exclusive, which I don''t think is a given.

I''m of the opinion that the word "hybrid" should not occur
anywhere in
the tools or hypervisor -- instead there should simply be PV
functionalities which are made available to HVM guest kernels which
request them.

I''m similarly of the opinion that the hybrid-ness should not leak out
of
the core Xen-aware kernel code, for example the word hybrid should not
be used in the front end drivers, rather the differences should be
encapsulated in the evtchn code (etc).
> But let IOAPIC/APIC coexisted with event channel is not our target. As
> we know, the overhead brought by them, notably EOI by LAPIC, impact
> performance badly. We want event channel that because event channel
> have much less overhead compared to IOAPIC/LAPIC, a completely
> virtualization aware solution which elimiate all the unnecessary
> overhead. That''s what we want Xen guest to be benefit.
It certainly makes sense to deliver interrupts to PV drivers via
evtchns. I don''t think it necessarily follows that all interrupts
should
be injected via event channels. For example for low throughput emulated
devices I think it makes sense to continue to inject interrupts via the
emulated *APIC paths such that the treatment of a given device is either
consistently emulated or consistently paravirtualised but not some
mixture.

Ian.


_______________________________________________
Xen-devel mailing list
Xen-devel@lists.xensource.com
http://lists.xensource.com/xen-devel

Sheng Yang

2010-Feb-02 16:31 UTC

head link

Re: [Xen-devel] [PATCH][v2] Hybrid extension support in Xen

On Wednesday 03 February 2010 00:15:50 Ian Campbell
wrote:> On Tue, 2010-02-02 at 14:07 +0000, Sheng Yang wrote:
> > On Tuesday 02 February 2010 21:52:44 Ian Campbell wrote:
> > > On Tue, 2010-02-02 at 13:06 +0000, Sheng Yang wrote:
> > > > On Tuesday 02 February 2010 19:26:55 Ian Campbell wrote:
> > > > > On Tue, 2010-02-02 at 08:16 +0000, Sheng Yang wrote:
> > > > > In fact, why is hybrid mode considered a separate mode
by the
> > > > > hypervisor at all? Shouldn''t it just be an
extension to regular HVM
> > > > > mode which guests may choose to use? This seems like it
would
> > > > > eliminate a bunch of the random conditionals.
> > > >
> > > > There is still something different from normal HVM guest.
For
> > > > example, to use PV timer, we should clean the tsc offset in
HVM
> > > > domain; and for event delivery, we would use predefined VIRQ
rather
> > > > than emulated IOAPIC/APIC. These code are exclusive, we need
them
> > > > wrapped with flag(which we called "hybrid"). The
word "mode" here may
> > > > be is inaccuracy, a "extension" should be more
proper. I would change
> > > > the phrase next time.
> > >
> > > But the old mechanisms (emulated IOAPIC etc) are still present
until
> > > the enable_hybrid HVMOP is called, aren''t they? Why
can''t you perform
> > > the switch at the point at which the new feature is requested by
the
> > > guest e.g. when the VIRQ is configured?
> >
> > Yes, they are there before the enable_hybrid HVMOP called. But sorry
for
> > I don''t quite understand about the point "when the VIRQ
is configured?"
> 
> I just meant that you should enable evtchn mode at the point at which
> the guest tries to use event channels and not in some specific "enable
> hybrid mode call", similarly for PV-timer mode etc.
> 
> Of course that presupposes that event channels and APIC etc are mutually
> exclusive, which I don''t think is a given.
I would give it try. Seems I need to bind tsc offset clean in event channel 
binding hypercall(if I didn''t introduce another hypercall), sounds a
little
messy.
 > I''m of the opinion that the word "hybrid" should not
occur anywhere in
> the tools or hypervisor -- instead there should simply be PV
> functionalities which are made available to HVM guest kernels which
> request them.
> 
> I''m similarly of the opinion that the hybrid-ness should not leak
out of
> the core Xen-aware kernel code, for example the word hybrid should not
> be used in the front end drivers, rather the differences should be
> encapsulated in the evtchn code (etc).
That''s reasonable. I would try to make them more like natural PV
feature.> 
> > But let IOAPIC/APIC coexisted with event channel is not our target. As
> > we know, the overhead brought by them, notably EOI by LAPIC, impact
> > performance badly. We want event channel that because event channel
> > have much less overhead compared to IOAPIC/LAPIC, a completely
> > virtualization aware solution which elimiate all the unnecessary
> > overhead. That''s what we want Xen guest to be benefit.
> 
> It certainly makes sense to deliver interrupts to PV drivers via
> evtchns. I don''t think it necessarily follows that all interrupts
should
> be injected via event channels. For example for low throughput emulated
> devices I think it makes sense to continue to inject interrupts via the
> emulated *APIC paths such that the treatment of a given device is either
> consistently emulated or consistently paravirtualised but not some
> mixture.
Well, for us, we want evtchn because we want to improve interrupt intensive 
passthru device''s performance(though too big for the first step, we
have
experiment patches, but would like to consolidate with the solution of pv_ops 
dom0). The situation won''t change if we still use emulated APIC path...

-- 
regards
Yang, Sheng

_______________________________________________
Xen-devel mailing list
Xen-devel@lists.xensource.com
http://lists.xensource.com/xen-devel

Ian Campbell

2010-Feb-02 18:03 UTC

head link

Re: [Xen-devel] [PATCH][v2] Hybrid extension support in Xen

On Tue, 2010-02-02 at 16:31 +0000, Sheng Yang wrote:> 
> > I just meant that you should enable evtchn mode at the point at
> which
> > the guest tries to use event channels and not in some specific
> "enable
> > hybrid mode call", similarly for PV-timer mode etc.
> > 
> > Of course that presupposes that event channels and APIC etc are
> mutually
> > exclusive, which I don''t think is a given.
> 
> I would give it try. Seems I need to bind tsc offset clean in event
> channel binding hypercall(if I didn''t introduce another
hypercall),
> sounds a little messy. 
I guess to clarify what I really meant was in the code which enables the
PV timer for a domain, rather than specifically in the event channel
hypercall which implements the binding to the timer evtchn. I''m not
sure
if enabling the timer is a separate hypercall or is just a side effect
of binding to the evtchn.

Ian.



_______________________________________________
Xen-devel mailing list
Xen-devel@lists.xensource.com
http://lists.xensource.com/xen-devel

Stefano Stabellini

2010-Feb-02 18:27 UTC

head link

Re: [Xen-devel] [PATCH][v2] Hybrid extension support in Xen

On Tue, 2 Feb 2010, Sheng Yang wrote:> Well, for us, we want evtchn because we want to improve interrupt intensive
> passthru device''s performance(though too big for the first step,
we have
> experiment patches, but would like to consolidate with the solution of
pv_ops
> dom0). The situation won''t change if we still use emulated APIC
path...
> 
I think you should keep this as the last step: once you have all the
other PV features working on HVM like Ian suggested, send a patch to
allow a guest kernel to switch from emulated APIC to evtchn for every
device.
If you manage to keep it simple, it could be a win for everyone.


_______________________________________________
Xen-devel mailing list
Xen-devel@lists.xensource.com
http://lists.xensource.com/xen-devel

Sheng Yang

2010-Feb-03 05:15 UTC

head link

Re: [Xen-devel] [PATCH][v2] Hybrid extension support in Xen

On Wednesday 03 February 2010 02:27:42 Stefano Stabellini
wrote:> On Tue, 2 Feb 2010, Sheng Yang wrote:
> > Well, for us, we want evtchn because we want to improve interrupt
> > intensive passthru device''s performance(though too big for
the first
> > step, we have experiment patches, but would like to consolidate with
the
> > solution of pv_ops dom0). The situation won''t change if we
still use
> > emulated APIC path...
> 
> I think you should keep this as the last step: once you have all the
> other PV features working on HVM like Ian suggested, send a patch to
> allow a guest kernel to switch from emulated APIC to evtchn for every
> device.
> If you manage to keep it simple, it could be a win for everyone.
> Make event channel coexist with IOAPIC/LAPIC is quite different from what we 
are doing now, and Xen already have PV-on-HVM for that.

Of course, I also want to have a elegant solution in the end, so I would try 
hard to keep things simple, and keep the code clear.

-- 
regards
Yang, Sheng

_______________________________________________
Xen-devel mailing list
Xen-devel@lists.xensource.com
http://lists.xensource.com/xen-devel

Tim Deegan

2010-Feb-03 10:39 UTC

head link

Re: [Xen-devel] [PATCH][v2] Hybrid extension support in Xen

At 05:15 +0000 on 03 Feb (1265174106), Sheng Yang wrote:> Make event channel coexist with IOAPIC/LAPIC is quite different from what
we
> are doing now, and Xen already have PV-on-HVM for that.
Indeed it does.  I don''t see any advantage in adding more mechanism to
both Xen and linux to give us something we basically already have.

Cheers,

Tim.

-- 
Tim Deegan <Tim.Deegan@citrix.com>
Principal Software Engineer, Citrix Systems (R&D) Ltd.
[Company #02300071, SL9 0DZ, UK.]

_______________________________________________
Xen-devel mailing list
Xen-devel@lists.xensource.com
http://lists.xensource.com/xen-devel

Xen devel - Feb 2010 - [PATCH][v2] Hybrid extension support in Xen

[Xen-devel] [PATCH][v2] Hybrid extension support in Xen

Re: [Xen-devel] [PATCH][v2] Hybrid extension support in Xen

Re: [Xen-devel] [PATCH][v2] Hybrid extension support in Xen

Re: [Xen-devel] [PATCH][v2] Hybrid extension support in Xen

Re: [Xen-devel] [PATCH][v2] Hybrid extension support in Xen

Re: [Xen-devel] [PATCH][v2] Hybrid extension support in Xen

Re: [Xen-devel] [PATCH][v2] Hybrid extension support in Xen

Re: [Xen-devel] [PATCH][v2] Hybrid extension support in Xen

Re: [Xen-devel] [PATCH][v2] Hybrid extension support in Xen

Re: [Xen-devel] [PATCH][v2] Hybrid extension support in Xen

Re: [Xen-devel] [PATCH][v2] Hybrid extension support in Xen

Re: [Xen-devel] [PATCH][v2] Hybrid extension support in Xen

Re: [Xen-devel] [PATCH][v2] Hybrid extension support in Xen

Re: [Xen-devel] [PATCH][v2] Hybrid extension support in Xen

Re: [Xen-devel] [PATCH][v2] Hybrid extension support in Xen

Re: [Xen-devel] [PATCH][v2] Hybrid extension support in Xen

Re: [Xen-devel] [PATCH][v2] Hybrid extension support in Xen

Re: [Xen-devel] [PATCH][v2] Hybrid extension support in Xen

Re: [Xen-devel] [PATCH][v2] Hybrid extension support in Xen

Re: [Xen-devel] [PATCH][v2] Hybrid extension support in Xen

Re: [Xen-devel] [PATCH][v2] Hybrid extension support in Xen

Re: [Xen-devel] [PATCH][v2] Hybrid extension support in Xen

Re: [Xen-devel] [PATCH][v2] Hybrid extension support in Xen

Re: [Xen-devel] [PATCH][v2] Hybrid extension support in Xen

Re: [Xen-devel] [PATCH][v2] Hybrid extension support in Xen

Re: [Xen-devel] [PATCH][v2] Hybrid extension support in Xen

Re: [Xen-devel] [PATCH][v2] Hybrid extension support in Xen

Re: [Xen-devel] [PATCH][v2] Hybrid extension support in Xen

Re: [Xen-devel] [PATCH][v2] Hybrid extension support in Xen

Re: [Xen-devel] [PATCH][v2] Hybrid extension support in Xen

Re: [Xen-devel] [PATCH][v2] Hybrid extension support in Xen

Re: [Xen-devel] [PATCH][v2] Hybrid extension support in Xen

Re: [Xen-devel] [PATCH][v2] Hybrid extension support in Xen

Re: [Xen-devel] [PATCH][v2] Hybrid extension support in Xen

Re: [Xen-devel] [PATCH][v2] Hybrid extension support in Xen