> From: Jean-Philippe Brucker <jean-philippe at linaro.org> > Sent: Wednesday, October 13, 2021 8:11 PM > > Support identity domains, allowing to only enable IOMMU protection for a > subset of endpoints (those assigned to userspace, for example). Users > may enable identity domains at compile time > (CONFIG_IOMMU_DEFAULT_PASSTHROUGH), boot time > (iommu.passthrough=1) or > runtime (/sys/kernel/iommu_groups/*/type = identity).Do we want to use consistent terms between spec (bypass domain) and code (identity domain)?> > Patches 1-2 support identity domains using the optional > VIRTIO_IOMMU_F_BYPASS_CONFIG feature. The feature bit is not yet in the > spec, see [1] for the latest proposal. > > Patches 3-5 add a fallback to identity mappings, when the feature is not > supported. > > Note that this series doesn't touch the global bypass bit added by > VIRTIO_IOMMU_F_BYPASS_CONFIG. All endpoints managed by the IOMMU > should > be attached to a domain, so global bypass isn't in use after endpointsI saw a concept of deferred attach in iommu core. See iommu_is_ attach_deferred(). Currently this is vendor specific and I haven't looked into the exact reason why some vendor sets it now. Just be curious whether the same reason might be applied to virtio-iommu.> are probed. Before that, the global bypass policy is decided by the > hypervisor and firmware. So I don't think Linux needs to touch theThis reminds me one thing. The spec says that the global bypass bit is sticky and not affected by reset. This implies that in the case of rebooting the VM into a different OS, the previous OS actually has the right to override this setting for the next OS. Is it a right design? Even the firmware itself is unable to identify the original setting enforced by the hypervisor after reboot. I feel the hypervisor setting should be recovered after reset since it reflects the security measure enforced by the virtual platform?> global bypass bit, but there are some patches available on my > virtio-iommu/bypass branch [2] to test it. > > QEMU patches are on my virtio-iommu/bypass branch [3] (and the list) > > [1] https://www.mail-archive.com/virtio-dev at lists.oasis- > open.org/msg07898.html > [2] https://jpbrucker.net/git/linux/log/?h=virtio-iommu/bypass > [3] https://jpbrucker.net/git/qemu/log/?h=virtio-iommu/bypass > > Jean-Philippe Brucker (5): > iommu/virtio: Add definitions for VIRTIO_IOMMU_F_BYPASS_CONFIG > iommu/virtio: Support bypass domains > iommu/virtio: Sort reserved regions > iommu/virtio: Pass end address to viommu_add_mapping() > iommu/virtio: Support identity-mapped domains > > include/uapi/linux/virtio_iommu.h | 8 ++- > drivers/iommu/virtio-iommu.c | 113 +++++++++++++++++++++++++----- > 2 files changed, 101 insertions(+), 20 deletions(-) > > -- > 2.33.0
On Thu, Oct 14, 2021 at 03:00:38AM +0000, Tian, Kevin wrote:> I saw a concept of deferred attach in iommu core. See iommu_is_ > attach_deferred(). Currently this is vendor specific and I haven't > looked into the exact reason why some vendor sets it now. Just > be curious whether the same reason might be applied to virtio-iommu.The reason for attach_deferred is kdump support, where the IOMMU driver needs to keep the mappings from the old kernel until the device driver of the new kernel takes over. Regards, Joerg
Jean-Philippe Brucker
2021-Oct-18 15:23 UTC
[PATCH 0/5] iommu/virtio: Add identity domains
On Thu, Oct 14, 2021 at 03:00:38AM +0000, Tian, Kevin wrote:> > From: Jean-Philippe Brucker <jean-philippe at linaro.org> > > Sent: Wednesday, October 13, 2021 8:11 PM > > > > Support identity domains, allowing to only enable IOMMU protection for a > > subset of endpoints (those assigned to userspace, for example). Users > > may enable identity domains at compile time > > (CONFIG_IOMMU_DEFAULT_PASSTHROUGH), boot time > > (iommu.passthrough=1) or > > runtime (/sys/kernel/iommu_groups/*/type = identity). > > Do we want to use consistent terms between spec (bypass domain) > and code (identity domain)?I don't think we have to. Linux uses "identity" domains and "passthrough" IOMMU. The old virtio-iommu feature was "bypass" so we should keep that for the new one, to be consistent. And then I've used "bypass" for domains as well, in the spec, because it would look strange to use a different term for the same concept. I find that it sort of falls into place: Linux' identity domains can be implemented either with bypass or identity-mapped virtio-iommu domains.> > > > Patches 1-2 support identity domains using the optional > > VIRTIO_IOMMU_F_BYPASS_CONFIG feature. The feature bit is not yet in the > > spec, see [1] for the latest proposal. > > > > Patches 3-5 add a fallback to identity mappings, when the feature is not > > supported. > > > > Note that this series doesn't touch the global bypass bit added by > > VIRTIO_IOMMU_F_BYPASS_CONFIG. All endpoints managed by the IOMMU > > should > > be attached to a domain, so global bypass isn't in use after endpoints > > I saw a concept of deferred attach in iommu core. See iommu_is_ > attach_deferred(). Currently this is vendor specific and I haven't > looked into the exact reason why some vendor sets it now. Just > be curious whether the same reason might be applied to virtio-iommu. > > > are probed. Before that, the global bypass policy is decided by the > > hypervisor and firmware. So I don't think Linux needs to touch the > > This reminds me one thing. The spec says that the global bypass > bit is sticky and not affected by reset.The spec talks about *device* reset, triggered by software writing 0 to the status register, but it doesn't mention system reset. Would be good to clarify that. Something like: If the device offers the VIRTIO_IOMMU_F_BYPASS_CONFIG feature, it MAY initialize the \field{bypass} field to 1. Field \field{bypass} SHOULD NOT change on device reset, but SHOULD be restored to its initial value on system reset.> This implies that in the case > of rebooting the VM into a different OS, the previous OS actually > has the right to override this setting for the next OS. Is it a right > design? Even the firmware itself is unable to identify the original > setting enforced by the hypervisor after reboot. I feel the hypervisor > setting should be recovered after reset since it reflects the > security measure enforced by the virtual platform?So I think clarifying system reset should address your questions. I believe we should leave bypass sticky across device reset, so a FW->OS transition, where the OS resets the device, does not open a vulnerability (if bypass was enabled at boot and then left disabled by FW.) It's still a good idea for the OS to restore on shutdown the bypass value it was booted with. So it can kexec into an OS that doesn't support virtio-iommu, for example. Thanks, Jean