tcp: add tcp_add_backlog()

When TCP operates in lossy environments (between 1 and 10 % packet losses), many SACK blocks can be exchanged, and I noticed we could drop them on busy senders, if these SACK blocks have to be queued into the socket backlog. While the main cause is the poor performance of RACK/SACK processing, we can try to avoid these drops of valuable information that can lead to spurious timeouts and retransmits. Cause of the drops is the skb->truesize overestimation caused by : - drivers allocating ~2048 (or more) bytes as a fragment to hold an Ethernet frame. - various pskb_may_pull() calls bringing the headers into skb->head might have pulled all the frame content, but skb->truesize could not be lowered, as the stack has no idea of each fragment truesize. The backlog drops are also more visible on bidirectional flows, since their sk_rmem_alloc can be quite big. Let's add some room for the backlog, as only the socket owner can selectively take action to lower memory needs, like collapsing receive queues or partial ofo pruning. Signed-off-by: Eric Dumazet <edumazet@google.com> Cc: Yuchung Cheng <ycheng@google.com> Cc: Neal Cardwell <ncardwell@google.com> Acked-by: Neal Cardwell <ncardwell@google.com> Signed-off-by: David S. Miller <davem@davemloft.net>
author: Eric Dumazet <edumazet@google.com> 2016-08-27 07:37:54 -0700
committer: David S. Miller <davem@davemloft.net> 2016-08-29 00:20:24 -0400
commit: c9c3321257e1b95be9b375f811fb250162af8d39 (patch)
tree: b3c1a7773a51c0076cc8213a86a47ea6e06d1024 /net/ipv6/tcp_ipv6.c
parent: 1a8ff8f52faebe28d145324884357fcdf33322c0 (diff)
1 files changed, 1 insertions, 4 deletions
diff --git a/net/ipv6/tcp_ipv6.c b/net/ipv6/tcp_ipv6.c
index e4f55683af31..5bf460bd299f 100644
--- a/net/ipv6/tcp_ipv6.c
+++ b/net/ipv6/tcp_ipv6.c
@@ -1467,10 +1467,7 @@ process:
 	if (!sock_owned_by_user(sk)) {
 		if (!tcp_prequeue(sk, skb))
 			ret = tcp_v6_do_rcv(sk, skb);
-	} else if (unlikely(sk_add_backlog(sk, skb,
-					   sk->sk_rcvbuf + sk->sk_sndbuf))) {
-		bh_unlock_sock(sk);
-		__NET_INC_STATS(net, LINUX_MIB_TCPBACKLOGDROP);
+	} else if (tcp_add_backlog(sk, skb)) {
 		goto discard_and_relse;
 	}
 	bh_unlock_sock(sk);
author	Eric Dumazet <edumazet@google.com>	2016-08-27 07:37:54 -0700
committer	David S. Miller <davem@davemloft.net>	2016-08-29 00:20:24 -0400
commit	c9c3321257e1b95be9b375f811fb250162af8d39 (patch)
tree	b3c1a7773a51c0076cc8213a86a47ea6e06d1024 /net/ipv6/tcp_ipv6.c
parent	1a8ff8f52faebe28d145324884357fcdf33322c0 (diff)