Linus Lüssing
2016-Feb-15 02:07 UTC
[Bridge] [PATCH] net: fix bridge multicast packet checksum validation
We need to update the skb->csum after pulling the skb, otherwise checksum validation will fail. This fixes multicast packet loss for bridges and splats like the following: [...] [ 43.986968] eth0: hw csum failure [ 43.990344] CPU: 3 PID: 0 Comm: swapper/3 Not tainted 4.4.0 #2 [ 43.996193] Hardware name: BCM2709 [ 43.999647] [<800204e0>] (unwind_backtrace) from [<8001cf14>] (show_stack+0x10/0x14) [ 44.007432] [<8001cf14>] (show_stack) from [<801ab614>] (dump_stack+0x80/0x90) [ 44.014695] [<801ab614>] (dump_stack) from [<802e4548>] (__skb_checksum_complete+0x6c/0xac) [ 44.023090] [<802e4548>] (__skb_checksum_complete) from [<803a055c>] (ipv6_mc_validate_checksum+0x104/0x178) [ 44.032959] [<803a055c>] (ipv6_mc_validate_checksum) from [<802e111c>] (skb_checksum_trimmed+0x130/0x188) [ 44.042565] [<802e111c>] (skb_checksum_trimmed) from [<803a06e8>] (ipv6_mc_check_mld+0x118/0x338) [ 44.051501] [<803a06e8>] (ipv6_mc_check_mld) from [<803b2c98>] (br_multicast_rcv+0x5dc/0xd00) [ 44.060077] [<803b2c98>] (br_multicast_rcv) from [<803aa510>] (br_handle_frame_finish+0xac/0x51c) [...] Fixes: 9afd85c9e455 ("net: Export IGMP/MLD message validation code") Reported-by: ?lvaro Fern?ndez Rojas <noltari at gmail.com> Signed-off-by: Linus L?ssing <linus.luessing at c0d3.blue> --- Steinar, can you check whether this fixes the bridge issues you reported on bugzilla #99081? Not quite sure whether it is the same as yours as you do not seem to have any such call traces. I am not super happy with how this patch looks, but there is no "skb_push_rcsum" available and skb_pull_rcsum() seems non-reversible as is. Alternative suggestions always welcome. net/core/skbuff.c | 19 ++++++++++++++++++- 1 file changed, 18 insertions(+), 1 deletion(-) diff --git a/net/core/skbuff.c b/net/core/skbuff.c index 5bf88f5..6c34ef6 100644 --- a/net/core/skbuff.c +++ b/net/core/skbuff.c @@ -4076,6 +4076,11 @@ struct sk_buff *skb_checksum_trimmed(struct sk_buff *skb, struct sk_buff *skb_chk; unsigned int offset = skb_transport_offset(skb); __sum16 ret; + int ip_summed; + int csum_valid; + int csum_level; + int csum_bad; + __wsum csum; skb_chk = skb_checksum_maybe_trim(skb, transport_len); if (!skb_chk) @@ -4084,10 +4089,22 @@ struct sk_buff *skb_checksum_trimmed(struct sk_buff *skb, if (!pskb_may_pull(skb_chk, offset)) goto err; - __skb_pull(skb_chk, offset); + ip_summed = skb->ip_summed; + csum_valid = skb->csum_valid; + csum_level = skb->csum_level; + csum_bad = skb->csum_bad; + csum = skb->csum; + + skb_pull_rcsum(skb_chk, offset); ret = skb_chkf(skb_chk); __skb_push(skb_chk, offset); + skb->ip_summed = ip_summed; + skb->csum_valid = csum_valid; + skb->csum_level = csum_level; + skb->csum_bad = csum_bad; + skb->csum = csum; + if (ret) goto err; -- 1.7.10.4
Steinar H. Gunderson
2016-Feb-18 12:51 UTC
[Bridge] [PATCH] net: fix bridge multicast packet checksum validation
On Mon, Feb 15, 2016 at 03:07:06AM +0100, Linus L?ssing wrote:> Steinar, can you check whether this fixes the bridge issues you reported on > bugzilla #99081? Not quite sure whether it is the same as yours as you > do not seem to have any such call traces.It doesn't immediately sound like the same problem; why would promisc change anything if the problem is the checksumming? I don't have any reboots scheduled for this machine right now, but I'll see what I can do wrt. testing. /* Steinar */ -- Homepage: https://www.sesse.net/
Linus Lüssing
2016-Feb-18 13:36 UTC
[Bridge] [PATCH] net: fix bridge multicast packet checksum validation
On Thu, Feb 18, 2016 at 01:51:34PM +0100, Steinar H. Gunderson wrote:> On Mon, Feb 15, 2016 at 03:07:06AM +0100, Linus L?ssing wrote: > > Steinar, can you check whether this fixes the bridge issues you reported on > > bugzilla #99081? Not quite sure whether it is the same as yours as you > > do not seem to have any such call traces. > > It doesn't immediately sound like the same problem; why would promisc change > anything if the problem is the checksumming?The mdb you provided in the bugzilla ticket misses reports, so it was unable to parse reports. Which could point to a checksumming problem. Enabling promisc probably did not fix the parsing for you, but instead promisc forces to forward packets upstream on your interface independent of the mdb. I would assume that even with promisc, your output from "bridge mdb show" looks rather empty. Can you check?> > I don't have any reboots scheduled for this machine right now, but I'll see > what I can do wrt. testing.Thanks :).
David Miller
2016-Feb-18 20:35 UTC
[Bridge] [PATCH] net: fix bridge multicast packet checksum validation
From: Linus L?ssing <linus.luessing at c0d3.blue> Date: Mon, 15 Feb 2016 03:07:06 +0100> @@ -4084,10 +4089,22 @@ struct sk_buff *skb_checksum_trimmed(struct sk_buff *skb, > if (!pskb_may_pull(skb_chk, offset)) > goto err; > > - __skb_pull(skb_chk, offset); > + ip_summed = skb->ip_summed; > + csum_valid = skb->csum_valid; > + csum_level = skb->csum_level; > + csum_bad = skb->csum_bad; > + csum = skb->csum; > + > + skb_pull_rcsum(skb_chk, offset); > ret = skb_chkf(skb_chk); > __skb_push(skb_chk, offset); > > + skb->ip_summed = ip_summed; > + skb->csum_valid = csum_valid; > + skb->csum_level = csum_level; > + skb->csum_bad = csum_bad; > + skb->csum = csum; > +There really has to be a better way to fix this :-/