Antonio Quartulli
2013-Sep-26 20:19 UTC
[Bridge] [RFC net] bridge: clean the nf_bridge status when forwarding the skb
From: Antonio Quartulli <antonio at open-mesh.com>
Even if enslaving a bridge interface into another bridge is
forbidden, it is still possible to create a chain of
virtual interfaces including two distinct bridges.
In this case, the skb entering the second bridge could have
the nf_bridge field already set due to a previous operation
and consequently lead to a wrong processing of the packet
itself.
To prevent this behaviour release and set to NULL the
nf_bridge field of the skb when exiting the bridge interface.
Signed-off-by: Antonio Quartulli <antonio at open-mesh.com>
---
I am not sure if this is a wanted behaviour or a real BUG. I found this
"misbehaviour" while testing batman-adv with the following
configuration:
- br0 (bridge interface) having bat0 and eth0 as slaves
- bat0 (which is a virtual interface provided by the batman-adv module and that
works similarly to a bridge - to some extends) having br1 as slave
- br1 (second bridge interface) having eth1 as slave
Then follow these events:
- a broadcast packet arrives on eth0
- the skb enters br0 and skb->nf_bridge gets initialised and used
- the skb enters bat0 and the packet *gets encapsulated in the batman-adv packet
which adds a batman-adv header and another Ethernet header*
- the skb enters br1 and gets ruined because nf_bridge_maybe_copy_header() (in
br_dev_queue_push_xmit()) will try to restore an header that does not make
sense anymore.
With this patch the nf_bridge gets de-initialised before exiting br0 and
therefore it is processed properly inside br1: nf_bridge_maybe_copy_header()
does not take place at all because nf_bridge is never initialised (the packet is
non-IP since it is a batman-adv packet)
To the developers of the bridge module I would like to ask:
1) is skb->nf_bridge allowed to be non NULL when entering br_dev_xmit() ? If
so,
when is this supposed to happen?
2) do you think this patch is logically correct but the nf_bridge release should
be done in batman-adv since it is the one re-encapsulating the packet?
I hope I have made the problem clear.
Best regards,
net/bridge/br_forward.c | 5 +++++
1 file changed, 5 insertions(+)
diff --git a/net/bridge/br_forward.c b/net/bridge/br_forward.c
index 4b81b14..65864bc 100644
--- a/net/bridge/br_forward.c
+++ b/net/bridge/br_forward.c
@@ -49,6 +49,11 @@ int br_dev_queue_push_xmit(struct sk_buff *skb)
} else {
skb_push(skb, ETH_HLEN);
br_drop_fake_rtable(skb);
+
+ /* clean the NF bridge data */
+ nf_bridge_put(skb->nf_bridge);
+ skb->nf_bridge = NULL;
+
dev_queue_xmit(skb);
}
--
1.8.1.5
Stephen Hemminger
2013-Sep-26 21:08 UTC
Re: [RFC net] bridge: clean the nf_bridge status when forwarding the skb
On Thu, 26 Sep 2013 22:19:50 +0200 Antonio Quartulli <antonio@meshcoding.com> wrote:> From: Antonio Quartulli <antonio@open-mesh.com> > > Even if enslaving a bridge interface into another bridge is > forbidden, it is still possible to create a chain of > virtual interfaces including two distinct bridges. > > In this case, the skb entering the second bridge could have > the nf_bridge field already set due to a previous operation > and consequently lead to a wrong processing of the packet > itself. > > To prevent this behaviour release and set to NULL the > nf_bridge field of the skb when exiting the bridge interface. > > Signed-off-by: Antonio Quartulli <antonio@open-mesh.com> > --- > > I am not sure if this is a wanted behaviour or a real BUG. I found this > "misbehaviour" while testing batman-adv with the following configuration: > > - br0 (bridge interface) having bat0 and eth0 as slaves > - bat0 (which is a virtual interface provided by the batman-adv module and that > works similarly to a bridge - to some extends) having br1 as slave > - br1 (second bridge interface) having eth1 as slave > > Then follow these events: > - a broadcast packet arrives on eth0 > - the skb enters br0 and skb->nf_bridge gets initialised and used > - the skb enters bat0 and the packet *gets encapsulated in the batman-adv packet > which adds a batman-adv header and another Ethernet header* > - the skb enters br1 and gets ruined because nf_bridge_maybe_copy_header() (in > br_dev_queue_push_xmit()) will try to restore an header that does not make > sense anymore. > > With this patch the nf_bridge gets de-initialised before exiting br0 and > therefore it is processed properly inside br1: nf_bridge_maybe_copy_header() > does not take place at all because nf_bridge is never initialised (the packet is > non-IP since it is a batman-adv packet) > > To the developers of the bridge module I would like to ask: > 1) is skb->nf_bridge allowed to be non NULL when entering br_dev_xmit() ? If so, > when is this supposed to happen? > > 2) do you think this patch is logically correct but the nf_bridge release should > be done in batman-adv since it is the one re-encapsulating the packet? > > > I hope I have made the problem clear. > > Best regards, > > > net/bridge/br_forward.c | 5 +++++ > 1 file changed, 5 insertions(+) > > diff --git a/net/bridge/br_forward.c b/net/bridge/br_forward.c > index 4b81b14..65864bc 100644 > --- a/net/bridge/br_forward.c > +++ b/net/bridge/br_forward.c > @@ -49,6 +49,11 @@ int br_dev_queue_push_xmit(struct sk_buff *skb) > } else { > skb_push(skb, ETH_HLEN); > br_drop_fake_rtable(skb); > + > + /* clean the NF bridge data */ > + nf_bridge_put(skb->nf_bridge); > + skb->nf_bridge = NULL; > + > dev_queue_xmit(skb); > } >You need to bracket this with CONFIG_BRIDGE_NETFILTER since nf_bridge is only in skbuff if that option is enabled.
Stephen Hemminger
2013-Sep-26 21:08 UTC
[Bridge] [RFC net] bridge: clean the nf_bridge status when forwarding the skb
On Thu, 26 Sep 2013 22:19:50 +0200 Antonio Quartulli <antonio at meshcoding.com> wrote:> From: Antonio Quartulli <antonio at open-mesh.com> > > Even if enslaving a bridge interface into another bridge is > forbidden, it is still possible to create a chain of > virtual interfaces including two distinct bridges. > > In this case, the skb entering the second bridge could have > the nf_bridge field already set due to a previous operation > and consequently lead to a wrong processing of the packet > itself. > > To prevent this behaviour release and set to NULL the > nf_bridge field of the skb when exiting the bridge interface. > > Signed-off-by: Antonio Quartulli <antonio at open-mesh.com> > --- > > I am not sure if this is a wanted behaviour or a real BUG. I found this > "misbehaviour" while testing batman-adv with the following configuration: > > - br0 (bridge interface) having bat0 and eth0 as slaves > - bat0 (which is a virtual interface provided by the batman-adv module and that > works similarly to a bridge - to some extends) having br1 as slave > - br1 (second bridge interface) having eth1 as slave > > Then follow these events: > - a broadcast packet arrives on eth0 > - the skb enters br0 and skb->nf_bridge gets initialised and used > - the skb enters bat0 and the packet *gets encapsulated in the batman-adv packet > which adds a batman-adv header and another Ethernet header* > - the skb enters br1 and gets ruined because nf_bridge_maybe_copy_header() (in > br_dev_queue_push_xmit()) will try to restore an header that does not make > sense anymore. > > With this patch the nf_bridge gets de-initialised before exiting br0 and > therefore it is processed properly inside br1: nf_bridge_maybe_copy_header() > does not take place at all because nf_bridge is never initialised (the packet is > non-IP since it is a batman-adv packet) > > To the developers of the bridge module I would like to ask: > 1) is skb->nf_bridge allowed to be non NULL when entering br_dev_xmit() ? If so, > when is this supposed to happen? > > 2) do you think this patch is logically correct but the nf_bridge release should > be done in batman-adv since it is the one re-encapsulating the packet? > > > I hope I have made the problem clear. > > Best regards, > > > net/bridge/br_forward.c | 5 +++++ > 1 file changed, 5 insertions(+) > > diff --git a/net/bridge/br_forward.c b/net/bridge/br_forward.c > index 4b81b14..65864bc 100644 > --- a/net/bridge/br_forward.c > +++ b/net/bridge/br_forward.c > @@ -49,6 +49,11 @@ int br_dev_queue_push_xmit(struct sk_buff *skb) > } else { > skb_push(skb, ETH_HLEN); > br_drop_fake_rtable(skb); > + > + /* clean the NF bridge data */ > + nf_bridge_put(skb->nf_bridge); > + skb->nf_bridge = NULL; > + > dev_queue_xmit(skb); > } >You need to bracket this with CONFIG_BRIDGE_NETFILTER since nf_bridge is only in skbuff if that option is enabled.
Stephen Hemminger
2013-Sep-26 21:10 UTC
[Bridge] [RFC net] bridge: clean the nf_bridge status when forwarding the skb
On Thu, 26 Sep 2013 22:19:50 +0200 Antonio Quartulli <antonio at meshcoding.com> wrote:> From: Antonio Quartulli <antonio at open-mesh.com> > > Even if enslaving a bridge interface into another bridge is > forbidden, it is still possible to create a chain of > virtual interfaces including two distinct bridges. > > In this case, the skb entering the second bridge could have > the nf_bridge field already set due to a previous operation > and consequently lead to a wrong processing of the packet > itself. > > To prevent this behaviour release and set to NULL the > nf_bridge field of the skb when exiting the bridge interface. > > Signed-off-by: Antonio Quartulli <antonio at open-mesh.com> > --- > > I am not sure if this is a wanted behaviour or a real BUG. I found this > "misbehaviour" while testing batman-adv with the following configuration: > > - br0 (bridge interface) having bat0 and eth0 as slaves > - bat0 (which is a virtual interface provided by the batman-adv module and that > works similarly to a bridge - to some extends) having br1 as slave > - br1 (second bridge interface) having eth1 as slave > > Then follow these events: > - a broadcast packet arrives on eth0 > - the skb enters br0 and skb->nf_bridge gets initialised and used > - the skb enters bat0 and the packet *gets encapsulated in the batman-adv packet > which adds a batman-adv header and another Ethernet header* > - the skb enters br1 and gets ruined because nf_bridge_maybe_copy_header() (in > br_dev_queue_push_xmit()) will try to restore an header that does not make > sense anymore. > > With this patch the nf_bridge gets de-initialised before exiting br0 and > therefore it is processed properly inside br1: nf_bridge_maybe_copy_header() > does not take place at all because nf_bridge is never initialised (the packet is > non-IP since it is a batman-adv packet) > > To the developers of the bridge module I would like to ask: > 1) is skb->nf_bridge allowed to be non NULL when entering br_dev_xmit() ? If so, > when is this supposed to happen? > > 2) do you think this patch is logically correct but the nf_bridge release should > be done in batman-adv since it is the one re-encapsulating the packet? > > > I hope I have made the problem clear. > > Best regards, > > > net/bridge/br_forward.c | 5 +++++ > 1 file changed, 5 insertions(+) > > diff --git a/net/bridge/br_forward.c b/net/bridge/br_forward.c > index 4b81b14..65864bc 100644 > --- a/net/bridge/br_forward.c > +++ b/net/bridge/br_forward.c > @@ -49,6 +49,11 @@ int br_dev_queue_push_xmit(struct sk_buff *skb) > } else { > skb_push(skb, ETH_HLEN); > br_drop_fake_rtable(skb); > + > + /* clean the NF bridge data */ > + nf_bridge_put(skb->nf_bridge); > + skb->nf_bridge = NULL; > + > dev_queue_xmit(skb); > } >I think the header will also be garbage if bridge on bridge with netfilter is used. See nf_bridge_save_header.