Michael S. Tsirkin
2013-Nov-19 21:53 UTC
[PATCH net] virtio-net: fix page refcnt leaking when fail to allocate frag skb
On Tue, Nov 19, 2013 at 01:36:36PM -0800, Eric Dumazet wrote:> On Tue, 2013-11-19 at 22:49 +0200, Michael S. Tsirkin wrote: > > On Tue, Nov 19, 2013 at 06:03:48AM -0800, Eric Dumazet wrote: > > > On Tue, 2013-11-19 at 16:05 +0800, Jason Wang wrote: > > > > We need to drop the refcnt of page when we fail to allocate an skb for frag > > > > list, otherwise it will be leaked. The bug was introduced by commit > > > > 2613af0ed18a11d5c566a81f9a6510b73180660a ("virtio_net: migrate mergeable rx > > > > buffers to page frag allocators"). > > > > > > > > Cc: Michael Dalton <mwdalton at google.com> > > > > Cc: Eric Dumazet <edumazet at google.com> > > > > Cc: Rusty Russell <rusty at rustcorp.com.au> > > > > Cc: Michael S. Tsirkin <mst at redhat.com> > > > > Signed-off-by: Jason Wang <jasowang at redhat.com> > > > > --- > > > > The patch was needed for 3.12 stable. > > > > > > Good catch, but if we return from receive_mergeable() in the 'middle' > > > of the frags we would need for the current skb, who will > > > call the virtqueue_get_buf() to flush the remaining frags ? > > > > > > Don't we also need to call virtqueue_get_buf() like > > > > > > while (--num_buf) { > > > buf = virtqueue_get_buf(rq->vq, &len); > > > if (!buf) > > > break; > > > put_page(virt_to_head_page(buf)); > > > } > > > > > > ? > > > > > > > > > > > > Let me explain what worries me in your suggestion: > > > > struct sk_buff *nskb = alloc_skb(0, GFP_ATOMIC); > > if (unlikely(!nskb)) { > > head_skb->dev->stats.rx_dropped++; > > return -ENOMEM; > > } > > > > is this the failure case we are talking about? > > I thought Jason patch was about this, no ? > > > > > I think this is a symprom of a larger problem > > introduced by 2613af0ed18a11d5c566a81f9a6510b73180660a, > > namely that we now need to allocate memory in the > > middle of processing a packet. > > > > > > I think discarding a completely valid and well-formed > > packet from the receive queue because we are unable > > to allocate new memory with GFP_ATOMIC > > for future packets is not a good idea. > > How is it different with NIC processing in RX path ?Which NIC? Virtio? Prior to 2613af0ed18a11d5c566a81f9a6510b73180660a it didn't drop packets received from host as far as I can tell. virtio is more like a pipe than a real NIC in this respect.> > > > It certainly violates the principle of least surprize: > > when one sees host pass packet to guest, one expects > > the packet to get into the networking stack, not get > > dropped by the driver internally. > > Guest stack can do with the packet what it sees fit. > > > > We actually wake up a thread if we can't fill up the queue, > > that will fill it up in GFP_KERNEL context. > > > > So I think we should find a way to pre-allocate if necessary and avoid > > error paths where allocating new memory is a required to avoid drops. > > > > Really, under ATOMIC context, there is no way you can avoid dropping > packets if you cannot allocate memory. If you cannot allocate sk_buff > (256 bytes !!), you wont be able to allocate the 1500+ bytes to hold the > payload of next packets anyway.that's why we do: if (!try_fill_recv(rq, GFP_ATOMIC)) schedule_delayed_work(&vi->refill, 0); the queues are large enough for a single failure not to be an immediate problem.> Same problem on a real NIC. > > Under memory pressure we _do_ packet drops. > Nobody really complained. > > Sure, you can add yet another cache of pre-allocated skbs and pay the > price of managing yet another cache layer, but still need to trop > packets under stress.We don't need a cache even. Just enough to avoid dropping packets if allocation failed in the middle so we don't dequeue a buffer and then drop it. Once we use this reserved skb, we stop processing the queue until refill gives it back.> Pre-allocating skb on real NIC has a performance cost, because we clear > sk_buff way ahead of time. By the time skb is finally received, cpu has > to bring back into its cache memory cache lines. >Alternatively we can pre-allocate the memory but avoid clearing it maybe? -- MST
Eric Dumazet
2013-Nov-19 22:00 UTC
[PATCH net] virtio-net: fix page refcnt leaking when fail to allocate frag skb
On Tue, 2013-11-19 at 23:53 +0200, Michael S. Tsirkin wrote:> Which NIC? Virtio? Prior to 2613af0ed18a11d5c566a81f9a6510b73180660a > it didn't drop packets received from host as far as I can tell. > virtio is more like a pipe than a real NIC in this respect.Prior/after to this patch, you were not posting buffers, so if packets were received on a physical NIC, you were dropping the packets anyway. It makes no difference at all, adding a cushion might make you feel better, but its really not worth it. Under memory stress, it makes better sense to drop a super big GRO packet (The one needing frag_list extension ...) It gives a better signal to the sender to reduce its pressure, and gives opportunity to free more of your memory.
Michael Dalton
2013-Nov-20 01:34 UTC
[PATCH net] virtio-net: fix page refcnt leaking when fail to allocate frag skb
Hi, After further reflection I think we're looking at two related issues: (a) a memory leak that Jason has identified that occurs when a memory allocation fails in receive_mergeable. Jasons commit solves this issue. (b) virtio-net does not dequeue all buffers for a packet in the case that an error occurs on receive and mergeable receive buffers is enabled. For (a), this bug is new and due to changes in 2613af0ed18a, and the net impact is memory leak on the physical page. However, I believe (b) has always been possible in some form because if page_to_skb() returns NULL (e.g., due to SKB allocation failure), receive_mergeable is never called. AFAICT this is also the behavior prior to 2613af0ed18a. The net impact of (b) would be that virtio-net would interpret a packet buffer that is in the middle of a mergeable packet as the start of a new packet, which is definitely also a bug (and the buffer contents could contain bytes that resembled a valid virtio-net header). A solution for (b) will require handling both the page_to_skb memory allocation failures and the memory allocation failures in receive_mergeable introduced by 2613af0ed18a. Best, Mike
Michael S. Tsirkin
2013-Nov-20 08:58 UTC
[PATCH net] virtio-net: fix page refcnt leaking when fail to allocate frag skb
On Tue, Nov 19, 2013 at 02:00:11PM -0800, Eric Dumazet wrote:> On Tue, 2013-11-19 at 23:53 +0200, Michael S. Tsirkin wrote: > > > Which NIC? Virtio? Prior to 2613af0ed18a11d5c566a81f9a6510b73180660a > > it didn't drop packets received from host as far as I can tell. > > virtio is more like a pipe than a real NIC in this respect. > > Prior/after to this patch, you were not posting buffers, so if packets > were received on a physical NIC, you were dropping the packets anyway. > > It makes no difference at all, adding a cushion might make you feel > better, but its really not worth it. > > Under memory stress, it makes better sense to drop a super big GRO > packet (The one needing frag_list extension ...) > > It gives a better signal to the sender to reduce its pressure, and gives > opportunity to free more of your memory. >OK, but in that case one wonders whether we should do more to free memory? E.g. imagine that we dropped a packet of a specific TCP flow because we couldn't allocate a new packet. What happens now is that the old packet is freed as well. So quite likely the next packet in queue will get processed since it will reuse the memory we have just freed. The next packet and the next after it etc all will have to go through the net stack until they get at the socket and are dropped then because we missed a segment. Even worse, GRO gets disabled so the load on receiver goes up instead of down. Sounds like a problem doesn't it? GRO actually detects it's the same flow and can see packet is out of sequence. Why doesn't it drop the packet then? Alternatively, we could (for example using the pre-allocated skb like I suggested) notify GRO that it should start dropping packets of this flow. What do you think? -- MST
Seemingly Similar Threads
- [PATCH net] virtio-net: fix page refcnt leaking when fail to allocate frag skb
- [PATCH net] virtio-net: fix page refcnt leaking when fail to allocate frag skb
- [PATCH net] virtio-net: fix page refcnt leaking when fail to allocate frag skb
- [PATCH net] virtio-net: fix page refcnt leaking when fail to allocate frag skb
- [PATCH net] virtio-net: fix page refcnt leaking when fail to allocate frag skb