Willem de Bruijn
2023-Feb-22 15:14 UTC
[patch net-next v2] net: virtio_net: implement exact header length guest feature
Jiri Pirko wrote:> Tue, Feb 21, 2023 at 05:12:33PM CET, willemdebruijn.kernel at gmail.com wrote: > >Jiri Pirko wrote: > >> Tue, Feb 21, 2023 at 04:11:53PM CET, willemdebruijn.kernel at gmail.com wrote: > >> >Jiri Pirko wrote: > >> >> From: Jiri Pirko <jiri at nvidia.com> > >> >> > >> >> Virtio spec introduced a feature VIRTIO_NET_F_GUEST_HDRLEN which when > >> >> set implicates that the driver provides the exact size of the header. > >> >> > >> >> Quoting the original virtio spec: > >> >> "hdr_len is a hint to the device as to how much of the header needs to > >> >> be kept to copy into each packet" > >> >> > >> >> "a hint" might not be clear for the reader what does it mean, if it is > >> >> "maybe like that" of "exactly like that". This feature just makes it > >> >> crystal clear and let the device count on the hdr_len being filled up > >> >> by the exact length of header. > >> >> > >> >> Also note the spec already has following note about hdr_len: > >> >> "Due to various bugs in implementations, this field is not useful > >> >> as a guarantee of the transport header size." > >> >> > >> >> Without this feature the device needs to parse the header in core > >> >> data path handling. Accurate information helps the device to eliminate > >> >> such header parsing and directly use the hardware accelerators > >> >> for GSO operation. > >> >> > >> >> virtio_net_hdr_from_skb() fills up hdr_len to skb_headlen(skb). > >> >> The driver already complies to fill the correct value. Introduce the > >> >> feature and advertise it. > >> >> > >> >> Note that virtio spec also includes following note for device > >> >> implementation: > >> >> "Caution should be taken by the implementation so as to prevent > >> >> a malicious driver from attacking the device by setting > >> >> an incorrect hdr_len." > >> >> > >> >> There is a plan to support this feature in our emulated device. > >> >> A device of SolidRun offers this feature bit. They claim this feature > >> >> will save the device a few cycles for every GSO packet. > >> >> > >> >> Signed-off-by: Jiri Pirko <jiri at nvidia.com> > >> >> --- > >> >> v1->v2: > >> >> - extended patch description > >> > > >> >Is the expectation that in-kernel devices support this feature, and > >> >if so how would it affect them? If I read the spec correctly, devices > >> > >> Well, the tap driver actually trusts the hdr_len to be of correct header > >> size nowadays. > > > >tap_get_user performs basic bounds checking on the length passed. > > Sure. It trusts the hdr_len, but it sanitizes the input. > > > > > >> > >> >still need to be careful against malicious drivers, so cannot assume > >> >much beyond what they do today (i.e., a hint). > >> > >> Malicious how? There is upper limit of size in tap which is checked. > >> I assume that for hw implementation, that would be the same. > > > >A device cannot blindly trust a hdr_len passed from a driver. We have > >had bugs in the kernel with this before, such as the one fixed in > >commit 57031eb79490 ("packet: round up linear to header len"). > > > >> But anyway, this discussion would be rather part of the spec/device > >> patch, don't you think? > > > >I disagree. If it's not much effort to make a commit self-documenting > >that is preferable. And if not, then an explicit reference to an > >authoratitive external reference is preferable over "it is trivial to > >look it up". > > Sorry, I don't follow. What exactly do you want me to do?Either including the link that Michael shared or quoting the relevant part verbatim in the commit message would help, thanks. Thinking it over, my main concern is that the prescriptive section in the spec does not state what to do when the value is clearly garbage, as we have seen with syzkaller. Having to sanitize input, by dropping if < ETH_HLEN or > length, to me means that the device cannot trust the field, as the spec says it should. Sanitization is harder in the kernel, because it has to support all kinds of link layers, including variable length. Perhaps that's a discussion for the spec rather than this commit. But it's a point to clarify as we add support to the code.
Michael S. Tsirkin
2023-Feb-22 16:11 UTC
[patch net-next v2] net: virtio_net: implement exact header length guest feature
On Wed, Feb 22, 2023 at 10:14:21AM -0500, Willem de Bruijn wrote:> Either including the link that Michael shared or quoting the relevant > part verbatim in the commit message would help, thanks. > > Thinking it over, my main concern is that the prescriptive section in > the spec does not state what to do when the value is clearly garbage, > as we have seen with syzkaller. > > Having to sanitize input, by dropping if < ETH_HLEN or > length, to > me means that the device cannot trust the field, as the spec says it > should.Right. I think the implication is that if device detects and illegal value it's OK for it to just drop the packet or reset or enter a broken mode until reset. By contrast without the feature bit the header size can be used as a hint e.g. to size allocations but you must recover if it's incorrect. And yes tap seems to break if you make it too small or if you make it huge so it does not really follow the spec in this regard. Setting the flag will not fix tap because we can't really affort breaking all drivers who don't set it. But it will prepare the ground for when tens of years from now we actually look back and say all drivers set it, no problem. So that's a good reason to ack this patch. However if someone is worried about this then fixing tap so it recovers from incorrect header length without packet loss is a good idea.> Sanitization is harder in the kernel, because it has to support all > kinds of link layers, including variable length. > > Perhaps that's a discussion for the spec rather than this commit. But > it's a point to clarify as we add support to the code.-- MST
Possibly Parallel Threads
- [patch net-next v2] net: virtio_net: implement exact header length guest feature
- [PATCH net-next v2 0/5] virtio-net tx napi
- [PATCH net-next v2 0/5] virtio-net tx napi
- [PATCH net-next v3] virtio_net: Introduce skb_vnet_common_hdr to avoid typecasting
- [PATCH net-next] virtio_net: implement VIRTIO_CONFIG_S_NEEDS_RESET