Date: Tue, 8 Mar 2016 10:29:32 +0900 From: Yongmin Cho <yongmincho82@gmail.com> To: hiren panchasara <hiren@strugglingcoder.info> Cc: transport@freebsd.org Subject: Re: In TCP recovery state, problem of computing the pipe(amount of data in flight). Message-ID: <20160308012930.GA24199@yongmincho-All-Series> In-Reply-To: <20160229175453.GA82027@strugglingcoder.info> References: <20160225112625.GA5680@yongmincho-All-Series> <20160227031604.GP31665@strugglingcoder.info> <20160229084045.GA21079@yongmincho-All-Series> <20160229175453.GA82027@strugglingcoder.info>
next in thread | previous in thread | raw e-mail | index | archive | help
--oyUTqETQ0mS9luUI
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
On Mon, Feb 29, 2016 at 09:54:53AM -0800, hiren panchasara wrote:
> On 02/29/16 at 05:40P, Yongmin Cho wrote:
> > Thanks very much for the quick reply.
> >
> > I'm sorry, I didn't make patch file.
> > First, I wanted to discuss my opinion is right.
> >
> > Let me give you example, please check this.
> >
> > <- ACK (cumack = 1, sack[3-4], sack[6-7], sack[9-10])
> > * here segment/byte 2, 5, and 8 are missing.
> > <- ACK (cumack = 1, sack[12-13], sack[15-16], sack[18-19])
> > * here segment/bytes 11, 14 and 17 are also reported missing.
> > <- ACK (cumack = 1, sack[18-19], sack[21-24], sack[27, 28])
> > * here segment/bytes 20 and 25, 26 are missing. (triggered fast
> > retransmission)
> > <- ACK.....(cumack = 1, sack[21-24], sack[27, 28], sack[32, 33])
> > <- ACK.....(cumack = 1, sack[34-35], sack[37, 40], sack[42, 44])
> > <- ACK.....(cumack = 1, sack[34-35], sack[37, 40], sack[42, 45])
> > <- ACKs.....(many duplication acks, and new sacked blocks)
> >
> > In the fast recovery phase, the pipe is caculated like below, If the
> > net.inet.tcp.rfc6675_pipe is turned on.
> > pipe = snd_max - snd_una + sack_bytes_rexmit(1 MSS size) -
> > sacked_bytes(10 = 34-35, 37-40, 42-45 tcp_sack.c:390)
> > One segment is sended(sack_bytes_rexmit), when triggered fast
> > retransmission.
> > Because the snd_cwnd was set 1 mss size. (tcp_input.c:2609)
> > In the fast recovery phase, The sender can send data,
> > If this condition is right(awnd < tp->snd_ssthresh tcp_input.c:2568).
> > When in the network, It still has many in flight packets,
> > snd_max and snd_una will not be changed, and sack_bytes_rexmit is one MSS
> > size, and sacked_bytes is caculated by last ACK that has three SACK
> > blocks(34-35, 37-40, 42-45).
> > So, sometimes(In my test environment) the awnd(pipe) value can't go
> > down less than the snd_ssthresh, while receiving each ACKs
> > in fast recovery phase.
> > You know, If the awnd value can't go down less than the snd_ssthresh,
> > The sender can't send data that is included snd_holes.
> >
> > So, I think, the sacked_bytes should be caculated by all of sacked
> > blocks that is greater than snd_una, like below.
> > pipe = snd_max - snd_una + sack_bytes_rexmit - sacked_bytes(3-4, 6-7,
> > 9-10, 12-13, 15-16, 18-19, 21-24, 27-28, ...).
> >
> > But on current implementation, the sacked_bytes is caculated by three(or four)
> > sacked blocks that is in last ACK, like below.
> > pipe = snd_max - snd_una + sack_bytes_rexmit - sacked_bytes(34-35,
> > 37-40, 42-45 -> sacked blocks of last ACK).
> >
> > My opinion may not be right. Just I want to check implementation of
> > computing pipe.
>
> Your opinion seems correct to me. If you can create a patch based on
> this, that'd be great. As I cannot spend time on this until next week.
>
> Cheers,
> Hiren
I've created a patch based on this issue.
You know, The sack_blocks array in tcp_sack_doack function(tcp_sack.c)
have only 3 or 4 sacked blocks.
But We need to know all the data that has been sacked.
The snd_holes queue(in tcp_cb structure) is updated every time we get
an ACK with sack blocks.
So, the snd_holes queue know all of hole information.
And, We can get the sacked_bytes from the snd_holes.
So, I calculated the sacked_bytes every time when updating the
snd_holes.
I've tested this patch in my test environment, And I think, this
implementation is working well.
Please check this patch. any feedback will be welcome.
Thanks.
--oyUTqETQ0mS9luUI
Content-Type: text/x-diff; charset=us-ascii
Content-Disposition: attachment; filename="tcp_sack.diff"
Index: sys/netinet/tcp_sack.c
===================================================================
--- sys/netinet/tcp_sack.c (revision 296480)
+++ sys/netinet/tcp_sack.c (working copy)
@@ -355,12 +355,13 @@ tcp_sack_doack(struct tcpcb *tp, struct tcpopt *to
{
struct sackhole *cur, *temp;
struct sackblk sack, sack_blocks[TCP_MAX_SACK + 1], *sblkp;
- int i, j, num_sack_blks, sack_changed;
+ int i, j, num_sack_blks, sack_changed, sacked_bytes;
INP_WLOCK_ASSERT(tp->t_inpcb);
num_sack_blks = 0;
sack_changed = 0;
+ sacked_bytes = 0;
/*
* If SND.UNA will be advanced by SEG.ACK, and if SACK holes exist,
* treat [SND.UNA, SEG.ACK) as if it is a SACK block.
@@ -374,7 +375,6 @@ tcp_sack_doack(struct tcpcb *tp, struct tcpopt *to
* received new blocks from the other side.
*/
if (to->to_flags & TOF_SACK) {
- tp->sackhint.sacked_bytes = 0; /* reset */
for (i = 0; i < to->to_nsacks; i++) {
bcopy((to->to_sacks + i * TCPOLEN_SACK),
&sack, sizeof(sack));
@@ -387,8 +387,6 @@ tcp_sack_doack(struct tcpcb *tp, struct tcpopt *to
SEQ_GT(sack.end, tp->snd_una) &&
SEQ_LEQ(sack.end, tp->snd_max)) {
sack_blocks[num_sack_blks++] = sack;
- tp->sackhint.sacked_bytes +=
- (sack.end-sack.start);
}
}
}
@@ -446,6 +444,7 @@ tcp_sack_doack(struct tcpcb *tp, struct tcpopt *to
temp = tcp_sackhole_insert(tp, tp->snd_fack,sblkp->start,NULL);
if (temp != NULL) {
tp->snd_fack = sblkp->end;
+ sacked_bytes += (sblkp->end - sblkp->start);
/* Go to the previous sack block. */
sblkp--;
sack_changed = 1;
@@ -462,11 +461,14 @@ tcp_sack_doack(struct tcpcb *tp, struct tcpopt *to
SEQ_LT(tp->snd_fack, sblkp->start))
sblkp--;
if (sblkp >= sack_blocks &&
- SEQ_LT(tp->snd_fack, sblkp->end))
+ SEQ_LT(tp->snd_fack, sblkp->end)) {
+ sacked_bytes += (sblkp->end - tp->snd_fack);
tp->snd_fack = sblkp->end;
+ }
}
} else if (SEQ_LT(tp->snd_fack, sblkp->end)) {
/* fack is advanced. */
+ sacked_bytes += (sblkp->end - tp->snd_fack);
tp->snd_fack = sblkp->end;
sack_changed = 1;
}
@@ -503,6 +505,10 @@ tcp_sack_doack(struct tcpcb *tp, struct tcpopt *to
/* Data acks at least the beginning of hole. */
if (SEQ_GEQ(sblkp->end, cur->end)) {
/* Acks entire hole, so delete hole. */
+ if (SEQ_GT(cur->start, th_ack))
+ sacked_bytes += (cur->end - cur->start);
+ else
+ sacked_bytes -= (sblkp->end - cur->end);
temp = cur;
cur = TAILQ_PREV(cur, sackhole_head, scblink);
tcp_sackhole_remove(tp, temp);
@@ -514,6 +520,8 @@ tcp_sack_doack(struct tcpcb *tp, struct tcpopt *to
continue;
} else {
/* Move start of hole forward. */
+ if (SEQ_GT(sblkp->start, th_ack))
+ sacked_bytes += (sblkp->end - cur->start);
cur->start = sblkp->end;
cur->rxmit = SEQ_MAX(cur->rxmit, cur->start);
}
@@ -521,6 +529,7 @@ tcp_sack_doack(struct tcpcb *tp, struct tcpopt *to
/* Data acks at least the end of hole. */
if (SEQ_GEQ(sblkp->end, cur->end)) {
/* Move end of hole backward. */
+ sacked_bytes += (cur->end - sblkp->start);
cur->end = sblkp->start;
cur->rxmit = SEQ_MIN(cur->rxmit, cur->end);
} else {
@@ -528,6 +537,7 @@ tcp_sack_doack(struct tcpcb *tp, struct tcpopt *to
* ACKs some data in middle of a hole; need
* to split current hole
*/
+ sacked_bytes += (sblkp->end - sblkp->start);
temp = tcp_sackhole_insert(tp, sblkp->end,
cur->end, cur);
if (temp != NULL) {
@@ -554,6 +564,7 @@ tcp_sack_doack(struct tcpcb *tp, struct tcpopt *to
else
sblkp--;
}
+ tp->sackhint.sacked_bytes += sacked_bytes;
return (sack_changed);
}
--oyUTqETQ0mS9luUI--
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20160308012930.GA24199>
