On Mon, Sep 13, 2021 at 2:50 PM Michael S. Tsirkin <mst at redhat.com>
wrote:>
> On Mon, Sep 13, 2021 at 02:37:42AM -0400, Michael S. Tsirkin wrote:
> > On Mon, Sep 13, 2021 at 02:34:01PM +0800, Jason Wang wrote:
> > > On Mon, Sep 13, 2021 at 2:28 PM Michael S. Tsirkin <mst at
redhat.com> wrote:
> > > >
> > > > On Mon, Sep 13, 2021 at 02:08:02PM +0800, Jason Wang wrote:
> > > > > On Mon, Sep 13, 2021 at 2:04 PM Michael S. Tsirkin
<mst at redhat.com> wrote:
> > > > > >
> > > > > > On Mon, Sep 13, 2021 at 01:53:50PM +0800, Jason
Wang wrote:
> > > > > > > We used to synchronize pending MSI-X irq
handlers via
> > > > > > > synchronize_irq(), this may not work for the
untrusted device which
> > > > > > > may keep sending interrupts after reset which
may lead unexpected
> > > > > > > results. Similarly, we should not enable
MSI-X interrupt until the
> > > > > > > device is ready. So this patch fixes those
two issues by:
> > > > > > >
> > > > > > > 1) switching to use disable_irq() to prevent
the virtio interrupt
> > > > > > > handlers to be called after the device is
reset.
> > > > > > > 2) using IRQF_NO_AUTOEN and enable the MSI-X
irq during .ready()
> > > > > > >
> > > > > > > This can make sure the virtio interrupt
handler won't be called before
> > > > > > > virtio_device_ready() and after reset.
> > > > > > >
> > > > > > > Signed-off-by: Jason Wang <jasowang at
redhat.com>
> > > > > >
> > > > > > I don't get the threat model here. Isn't
disabling irqs done by the
> > > > > > hypervisor anyway? Is there a reason to trust
disable_irq but not
> > > > > > device reset?
> > > > >
> > > > > My understanding is that e.g in the case of SEV/TDX we
don't trust the
> > > > > hypervisor. So the hypervisor can keep sending
interrupts even if the
> > > > > device is reset. The guest can only trust its own
software interrupt
> > > > > management logic to avoid call virtio callback in this
case.
> > > > >
> > > > > Thanks
> > > >
> > > > Hmm but I don't see how do these patches do this.
> > > > They call disable_irq but can't the hypervisor keep
> > > > sending interrupts after disable_irq, too?
> > >
> > > Yes, but since the irq is disabled, the vring or config callback
won't
> > > be called in this case.
> > >
> > > Thanks
> >
> > But doen't "irq is disabled" basically mean "we
told the hypervisor
> > to disable the irq"? What extractly prevents hypervisor from
> > sending the irq even if guest thinks it disabled it?
>
> More generally, can't we for example blow away the
> indir_desc array that we use to keep the ctx pointers?
> Won't that be enough?
I'm not sure how it is related to the indirect descriptor but an
example is that all the current driver will assume:
1) the interrupt won't be raised before virtio_device_ready()
2) the interrupt won't be raised after reset()
All of these above two are not true if we don't trust the device. If
we want to audit all the drivers it would be not trivial and hard to
be correct. So it looks to me it's better to avoid those in the virtio
core.
>
>
> And looking at all this closely, I suspect it is wise to just completely
> disable hotplug/unplug for encrypted guests. Or maybe it's already
> the case?
I think not, it's more about:
1) what happens if the vq interrupt handler is called between
init_vqs() and virito_device_ready()
2) what happens if vq interrupt handler is called after reset
Instead of trying to answer those hard questions, it's better simply
not to have those conditions.
Thanks
>
>
> > > >
> > > >
> > > >
> > > > > >
> > > > > > Cc a bunch more people ...
> > > > > >
> > > > > >
> > > > > > > ---
> > > > > > > drivers/virtio/virtio_pci_common.c | 27
+++++++++++++++++++++------
> > > > > > > drivers/virtio/virtio_pci_common.h | 6
++++--
> > > > > > > drivers/virtio/virtio_pci_legacy.c | 5
+++--
> > > > > > > drivers/virtio/virtio_pci_modern.c | 6
++++--
> > > > > > > 4 files changed, 32 insertions(+), 12
deletions(-)
> > > > > > >
> > > > > > > diff --git
a/drivers/virtio/virtio_pci_common.c b/drivers/virtio/virtio_pci_common.c
> > > > > > > index b35bb2d57f62..0b9523e6dd39 100644
> > > > > > > --- a/drivers/virtio/virtio_pci_common.c
> > > > > > > +++ b/drivers/virtio/virtio_pci_common.c
> > > > > > > @@ -24,8 +24,8 @@
MODULE_PARM_DESC(force_legacy,
> > > > > > > "Force legacy mode for
transitional virtio 1 devices");
> > > > > > > #endif
> > > > > > >
> > > > > > > -/* wait for pending irq handlers */
> > > > > > > -void vp_synchronize_vectors(struct
virtio_device *vdev)
> > > > > > > +/* disable irq handlers */
> > > > > > > +void vp_disable_vectors(struct virtio_device
*vdev)
> > > > > > > {
> > > > > > > struct virtio_pci_device *vp_dev =
to_vp_device(vdev);
> > > > > > > int i;
> > > > > > > @@ -34,7 +34,20 @@ void
vp_synchronize_vectors(struct virtio_device *vdev)
> > > > > > >
synchronize_irq(vp_dev->pci_dev->irq);
> > > > > > >
> > > > > > > for (i = 0; i <
vp_dev->msix_vectors; ++i)
> > > > > > > -
synchronize_irq(pci_irq_vector(vp_dev->pci_dev, i));
> > > > > > > +
disable_irq(pci_irq_vector(vp_dev->pci_dev, i));
> > > > > > > +}
> > > > > > > +
> > > > > > > +/* enable irq handlers */
> > > > > > > +void vp_enable_vectors(struct virtio_device
*vdev)
> > > > > > > +{
> > > > > > > + struct virtio_pci_device *vp_dev =
to_vp_device(vdev);
> > > > > > > + int i;
> > > > > > > +
> > > > > > > + if (vp_dev->intx_enabled)
> > > > > > > + return;
> > > > > > > +
> > > > > > > + for (i = 0; i <
vp_dev->msix_vectors; ++i)
> > > > > > > +
enable_irq(pci_irq_vector(vp_dev->pci_dev, i));
> > > > > > > }
> > > > > > >
> > > > > > > /* the notify function used when creating a
virt queue */
> > > > > > > @@ -141,7 +154,8 @@ static int
vp_request_msix_vectors(struct virtio_device *vdev, int nvectors,
> > > > > > > snprintf(vp_dev->msix_names[v],
sizeof *vp_dev->msix_names,
> > > > > > > "%s-config", name);
> > > > > > > err =
request_irq(pci_irq_vector(vp_dev->pci_dev, v),
> > > > > > > - vp_config_changed, 0,
vp_dev->msix_names[v],
> > > > > > > + vp_config_changed,
IRQF_NO_AUTOEN,
> > > > > > > +
vp_dev->msix_names[v],
> > > > > > > vp_dev);
> > > > > > > if (err)
> > > > > > > goto error;
> > > > > > > @@ -160,7 +174,8 @@ static int
vp_request_msix_vectors(struct virtio_device *vdev, int nvectors,
> > > > > > >
snprintf(vp_dev->msix_names[v], sizeof *vp_dev->msix_names,
> > > > > > >
"%s-virtqueues", name);
> > > > > > > err =
request_irq(pci_irq_vector(vp_dev->pci_dev, v),
> > > > > > > -
vp_vring_interrupt, 0, vp_dev->msix_names[v],
> > > > > > > +
vp_vring_interrupt, IRQF_NO_AUTOEN,
> > > > > > > +
vp_dev->msix_names[v],
> > > > > > > vp_dev);
> > > > > > > if (err)
> > > > > > > goto error;
> > > > > > > @@ -337,7 +352,7 @@ static int
vp_find_vqs_msix(struct virtio_device *vdev, unsigned nvqs,
> > > > > > > "%s-%s",
> > > > > > >
dev_name(&vp_dev->vdev.dev), names[i]);
> > > > > > > err =
request_irq(pci_irq_vector(vp_dev->pci_dev, msix_vec),
> > > > > > > -
vring_interrupt, 0,
> > > > > > > +
vring_interrupt, IRQF_NO_AUTOEN,
> > > > > > >
vp_dev->msix_names[msix_vec],
> > > > > > > vqs[i]);
> > > > > > > if (err)
> > > > > > > diff --git
a/drivers/virtio/virtio_pci_common.h b/drivers/virtio/virtio_pci_common.h
> > > > > > > index beec047a8f8d..a235ce9ff6a5 100644
> > > > > > > --- a/drivers/virtio/virtio_pci_common.h
> > > > > > > +++ b/drivers/virtio/virtio_pci_common.h
> > > > > > > @@ -102,8 +102,10 @@ static struct
virtio_pci_device *to_vp_device(struct virtio_device *vdev)
> > > > > > > return container_of(vdev, struct
virtio_pci_device, vdev);
> > > > > > > }
> > > > > > >
> > > > > > > -/* wait for pending irq handlers */
> > > > > > > -void vp_synchronize_vectors(struct
virtio_device *vdev);
> > > > > > > +/* disable irq handlers */
> > > > > > > +void vp_disable_vectors(struct virtio_device
*vdev);
> > > > > > > +/* enable irq handlers */
> > > > > > > +void vp_enable_vectors(struct virtio_device
*vdev);
> > > > > > > /* the notify function used when creating a
virt queue */
> > > > > > > bool vp_notify(struct virtqueue *vq);
> > > > > > > /* the config->del_vqs() implementation
*/
> > > > > > > diff --git
a/drivers/virtio/virtio_pci_legacy.c b/drivers/virtio/virtio_pci_legacy.c
> > > > > > > index d62e9835aeec..bdf6bc667ab5 100644
> > > > > > > --- a/drivers/virtio/virtio_pci_legacy.c
> > > > > > > +++ b/drivers/virtio/virtio_pci_legacy.c
> > > > > > > @@ -97,8 +97,8 @@ static void vp_reset(struct
virtio_device *vdev)
> > > > > > > /* Flush out the status write, and
flush in device writes,
> > > > > > > * including MSi-X interrupts, if any.
*/
> > > > > > > ioread8(vp_dev->ioaddr +
VIRTIO_PCI_STATUS);
> > > > > > > - /* Flush pending VQ/configuration
callbacks. */
> > > > > > > - vp_synchronize_vectors(vdev);
> > > > > > > + /* Disable VQ/configuration callbacks.
*/
> > > > > > > + vp_disable_vectors(vdev);
> > > > > > > }
> > > > > > >
> > > > > > > static u16 vp_config_vector(struct
virtio_pci_device *vp_dev, u16 vector)
> > > > > > > @@ -194,6 +194,7 @@ static void del_vq(struct
virtio_pci_vq_info *info)
> > > > > > > }
> > > > > > >
> > > > > > > static const struct virtio_config_ops
virtio_pci_config_ops = {
> > > > > > > + .ready = vp_enable_vectors,
> > > > > > > .get = vp_get,
> > > > > > > .set = vp_set,
> > > > > > > .get_status = vp_get_status,
> > > > > > > diff --git
a/drivers/virtio/virtio_pci_modern.c b/drivers/virtio/virtio_pci_modern.c
> > > > > > > index 30654d3a0b41..acf0f6b6381d 100644
> > > > > > > --- a/drivers/virtio/virtio_pci_modern.c
> > > > > > > +++ b/drivers/virtio/virtio_pci_modern.c
> > > > > > > @@ -172,8 +172,8 @@ static void
vp_reset(struct virtio_device *vdev)
> > > > > > > */
> > > > > > > while (vp_modern_get_status(mdev))
> > > > > > > msleep(1);
> > > > > > > - /* Flush pending VQ/configuration
callbacks. */
> > > > > > > - vp_synchronize_vectors(vdev);
> > > > > > > + /* Disable VQ/configuration callbacks.
*/
> > > > > > > + vp_disable_vectors(vdev);
> > > > > > > }
> > > > > > >
> > > > > > > static u16 vp_config_vector(struct
virtio_pci_device *vp_dev, u16 vector)
> > > > > > > @@ -380,6 +380,7 @@ static bool
vp_get_shm_region(struct virtio_device *vdev,
> > > > > > > }
> > > > > > >
> > > > > > > static const struct virtio_config_ops
virtio_pci_config_nodev_ops = {
> > > > > > > + .ready = vp_enable_vectors,
> > > > > > > .get = NULL,
> > > > > > > .set = NULL,
> > > > > > > .generation = vp_generation,
> > > > > > > @@ -397,6 +398,7 @@ static const struct
virtio_config_ops virtio_pci_config_nodev_ops = {
> > > > > > > };
> > > > > > >
> > > > > > > static const struct virtio_config_ops
virtio_pci_config_ops = {
> > > > > > > + .ready = vp_enable_vectors,
> > > > > > > .get = vp_get,
> > > > > > > .set = vp_set,
> > > > > > > .generation = vp_generation,
> > > > > > > --
> > > > > > > 2.25.1
> > > > > >
> > > >
>