thr3ads.net - Linux Virtualization - [RFC PATCH 1/1] virtio: write back features before verify [Oct 2021]

If this information is useful, please help other people find it:
Share via:

Cornelia Huck

2021-Oct-04 15:50 UTC

[RFC PATCH 1/1] virtio: write back features before verify

On Mon, Oct 04 2021, "Michael S. Tsirkin" <mst at redhat.com>
wrote:
> On Mon, Oct 04, 2021 at 04:33:21PM +0200, Cornelia Huck wrote:
>> On Mon, Oct 04 2021, "Michael S. Tsirkin" <mst at
redhat.com> wrote:
>> 
>> > On Mon, Oct 04, 2021 at 02:19:55PM +0200, Cornelia Huck wrote:
>> >> 
>> >> [cc:qemu-devel]
>> >> 
>> >> On Sat, Oct 02 2021, "Michael S. Tsirkin" <mst at
redhat.com> wrote:
>> >> 
>> >> > On Fri, Oct 01, 2021 at 09:21:25AM +0200, Halil Pasic
wrote:
>> >> >> On Thu, 30 Sep 2021 07:12:21 -0400
>> >> >> "Michael S. Tsirkin" <mst at
redhat.com> wrote:
>> >> >> 
>> >> >> > On Thu, Sep 30, 2021 at 03:20:49AM +0200, Halil
Pasic wrote:
>> >> >> > > This patch fixes a regression introduced by
commit 82e89ea077b9
>> >> >> > > ("virtio-blk: Add validation for block
size in config space") and
>> >> >> > > enables similar checks in verify() on big
endian platforms.
>> >> >> > > 
>> >> >> > > The problem with checking multi-byte config
fields in the verify
>> >> >> > > callback, on big endian platforms, and with
a possibly transitional
>> >> >> > > device is the following. The verify()
callback is called between
>> >> >> > > config->get_features() and
virtio_finalize_features(). That we have a
>> >> >> > > device that offered F_VERSION_1 then we
have the following options
>> >> >> > > either the device is transitional, and then
it has to present the legacy
>> >> >> > > interface, i.e. a big endian config space
until F_VERSION_1 is
>> >> >> > > negotiated, or we have a non-transitional
device, which makes
>> >> >> > > F_VERSION_1 mandatory, and only implements
the non-legacy interface and
>> >> >> > > thus presents a little endian config space.
Because at this point we
>> >> >> > > can't know if the device is
transitional or non-transitional, we can't
>> >> >> > > know do we need to byte swap or not.  
>> >> >> > 
>> >> >> > Hmm which transport does this refer to?
>> >> >> 
>> >> >> It is the same with virtio-ccw and virtio-pci. I see
the same problem
>> >> >> with both on s390x. I didn't try with
virtio-blk-pci-non-transitional
>> >> >> yet (have to figure out how to do that with libvirt)
for pci I used
>> >> >> virtio-blk-pci.
>> >> >> 
>> >> >> > Distinguishing between legacy and modern drivers
is transport
>> >> >> > specific.  PCI presents
>> >> >> > legacy and modern at separate addresses so
distinguishing
>> >> >> > between these two should be no trouble.
>> >> >> 
>> >> >> You mean the device id? Yes that is bolted down in
the spec, but
>> >> >> currently we don't exploit that information.
Furthermore there
>> >> >> is a fat chance that with QEMU even the allegedly
non-transitional
>> >> >> devices only present a little endian config space
after VERSION_1
>> >> >> was negotiated. Namely get_config for virtio-blk is
implemented in
>> >> >> virtio_blk_update_config() which does
virtio_stl_p(vdev,
>> >> >> &blkcfg.blk_size, blk_size) and in there we
don't care
>> >> >> about transitional or not:
>> >> >> 
>> >> >> static inline bool
virtio_access_is_big_endian(VirtIODevice *vdev)
>> >> >> {
>> >> >> #if defined(LEGACY_VIRTIO_IS_BIENDIAN)
>> >> >>     return virtio_is_big_endian(vdev);
>> >> >> #elif defined(TARGET_WORDS_BIGENDIAN)
>> >> >>     if (virtio_vdev_has_feature(vdev,
VIRTIO_F_VERSION_1)) {
>> >> >>         /* Devices conforming to VIRTIO 1.0 or later
are always LE. */
>> >> >>         return false;
>> >> >>     }
>> >> >>     return true;
>> >> >> #else
>> >> >>     return false;
>> >> >> #endif
>> >> >> }
>> >> >> 
>> >> >
>> >> > ok so that's a QEMU bug. Any virtio 1.0 and up
>> >> > compatible device must use LE.
>> >> > It can also present a legacy config space where the
>> >> > endian depends on the guest.
>> >> 
>> >> So, how is the virtio core supposed to determine this? A
>> >> transport-specific callback?
>> >
>> > I'd say a field in VirtIODevice is easiest.
>> 
>> The transport needs to set this as soon as it has figured out whether
>> we're using legacy or not.
>
> Basically on each device config access?
Prior to the first one, I think. It should not change again, should it?
>
>> I guess we also need to fence off any
>> accesses respectively error out the device if the driver tries any
>> read/write operations that would depend on that knowledge?
>> 
>> And using a field in VirtIODevice would probably need some care when
>> migrating. Hm...
>
> It's just a shorthand to minimize changes. No need to migrate I think.
If we migrate in from an older QEMU, we don't know whether we are
dealing with legacy or not, until feature negotiation is already
done... don't we have to ask the transport?

Michael S. Tsirkin

2021-Oct-04 19:17 UTC

head link

[RFC PATCH 1/1] virtio: write back features before verify

On Mon, Oct 04, 2021 at 05:50:44PM +0200, Cornelia Huck
wrote:> On Mon, Oct 04 2021, "Michael S. Tsirkin" <mst at
redhat.com> wrote:
> 
> > On Mon, Oct 04, 2021 at 04:33:21PM +0200, Cornelia Huck wrote:
> >> On Mon, Oct 04 2021, "Michael S. Tsirkin" <mst at
redhat.com> wrote:
> >> 
> >> > On Mon, Oct 04, 2021 at 02:19:55PM +0200, Cornelia Huck
wrote:
> >> >> 
> >> >> [cc:qemu-devel]
> >> >> 
> >> >> On Sat, Oct 02 2021, "Michael S. Tsirkin"
<mst at redhat.com> wrote:
> >> >> 
> >> >> > On Fri, Oct 01, 2021 at 09:21:25AM +0200, Halil
Pasic wrote:
> >> >> >> On Thu, 30 Sep 2021 07:12:21 -0400
> >> >> >> "Michael S. Tsirkin" <mst at
redhat.com> wrote:
> >> >> >> 
> >> >> >> > On Thu, Sep 30, 2021 at 03:20:49AM +0200,
Halil Pasic wrote:
> >> >> >> > > This patch fixes a regression
introduced by commit 82e89ea077b9
> >> >> >> > > ("virtio-blk: Add validation for
block size in config space") and
> >> >> >> > > enables similar checks in verify() on
big endian platforms.
> >> >> >> > > 
> >> >> >> > > The problem with checking multi-byte
config fields in the verify
> >> >> >> > > callback, on big endian platforms, and
with a possibly transitional
> >> >> >> > > device is the following. The verify()
callback is called between
> >> >> >> > > config->get_features() and
virtio_finalize_features(). That we have a
> >> >> >> > > device that offered F_VERSION_1 then
we have the following options
> >> >> >> > > either the device is transitional, and
then it has to present the legacy
> >> >> >> > > interface, i.e. a big endian config
space until F_VERSION_1 is
> >> >> >> > > negotiated, or we have a
non-transitional device, which makes
> >> >> >> > > F_VERSION_1 mandatory, and only
implements the non-legacy interface and
> >> >> >> > > thus presents a little endian config
space. Because at this point we
> >> >> >> > > can't know if the device is
transitional or non-transitional, we can't
> >> >> >> > > know do we need to byte swap or not.  
> >> >> >> > 
> >> >> >> > Hmm which transport does this refer to?
> >> >> >> 
> >> >> >> It is the same with virtio-ccw and virtio-pci. I
see the same problem
> >> >> >> with both on s390x. I didn't try with
virtio-blk-pci-non-transitional
> >> >> >> yet (have to figure out how to do that with
libvirt) for pci I used
> >> >> >> virtio-blk-pci.
> >> >> >> 
> >> >> >> > Distinguishing between legacy and modern
drivers is transport
> >> >> >> > specific.  PCI presents
> >> >> >> > legacy and modern at separate addresses so
distinguishing
> >> >> >> > between these two should be no trouble.
> >> >> >> 
> >> >> >> You mean the device id? Yes that is bolted down
in the spec, but
> >> >> >> currently we don't exploit that information.
Furthermore there
> >> >> >> is a fat chance that with QEMU even the
allegedly non-transitional
> >> >> >> devices only present a little endian config
space after VERSION_1
> >> >> >> was negotiated. Namely get_config for virtio-blk
is implemented in
> >> >> >> virtio_blk_update_config() which does
virtio_stl_p(vdev,
> >> >> >> &blkcfg.blk_size, blk_size) and in there we
don't care
> >> >> >> about transitional or not:
> >> >> >> 
> >> >> >> static inline bool
virtio_access_is_big_endian(VirtIODevice *vdev)
> >> >> >> {
> >> >> >> #if defined(LEGACY_VIRTIO_IS_BIENDIAN)
> >> >> >>     return virtio_is_big_endian(vdev);
> >> >> >> #elif defined(TARGET_WORDS_BIGENDIAN)
> >> >> >>     if (virtio_vdev_has_feature(vdev,
VIRTIO_F_VERSION_1)) {
> >> >> >>         /* Devices conforming to VIRTIO 1.0 or
later are always LE. */
> >> >> >>         return false;
> >> >> >>     }
> >> >> >>     return true;
> >> >> >> #else
> >> >> >>     return false;
> >> >> >> #endif
> >> >> >> }
> >> >> >> 
> >> >> >
> >> >> > ok so that's a QEMU bug. Any virtio 1.0 and up
> >> >> > compatible device must use LE.
> >> >> > It can also present a legacy config space where the
> >> >> > endian depends on the guest.
> >> >> 
> >> >> So, how is the virtio core supposed to determine this? A
> >> >> transport-specific callback?
> >> >
> >> > I'd say a field in VirtIODevice is easiest.
> >> 
> >> The transport needs to set this as soon as it has figured out
whether
> >> we're using legacy or not.
> >
> > Basically on each device config access?
> 
> Prior to the first one, I think. It should not change again, should it?
Well yes but we never prohibited someone from poking at both ..
Doing it on each access means we don't have state to migrate.
> >
> >> I guess we also need to fence off any
> >> accesses respectively error out the device if the driver tries any
> >> read/write operations that would depend on that knowledge?
> >> 
> >> And using a field in VirtIODevice would probably need some care
when
> >> migrating. Hm...
> >
> > It's just a shorthand to minimize changes. No need to migrate I
think.
> 
> If we migrate in from an older QEMU, we don't know whether we are
> dealing with legacy or not, until feature negotiation is already
> done... don't we have to ask the transport?
Right but the only thing that can happen is config access.
Well and for legacy a kick I guess.

-- 
MST

Linux Virtualization - Oct 2021 - [RFC PATCH 1/1] virtio: write back features before verify

[RFC PATCH 1/1] virtio: write back features before verify

[RFC PATCH 1/1] virtio: write back features before verify