Skip site navigation (1)Skip section navigation (2)
Date:      Mon, 20 Feb 2017 19:36:47 +1100
From:      Lawrence Stewart <lstewart@freebsd.org>
To:        Steven Hartland <steven.hartland@multiplay.co.uk>, freebsd-transport@freebsd.org
Subject:   Re: TCP Receive buffer scaling without Timestamps
Message-ID:  <9d615fdb-c507-b6ef-2abe-e464a7849748@freebsd.org>
In-Reply-To: <2cc38c5d-78a9-4ce1-902e-70f2e55f14a2@multiplay.co.uk>
References:  <b9906198-a026-1403-73e6-279066b4e197@multiplay.co.uk> <be22c783-26d8-321d-d7b1-1edeb27d11e8@freebsd.org> <2cc38c5d-78a9-4ce1-902e-70f2e55f14a2@multiplay.co.uk>

next in thread | previous in thread | raw e-mail | index | archive | help
On 19/02/2017 12:37, Steven Hartland wrote:
> On 18/02/2017 00:33, Lawrence Stewart wrote:
>> On 18/02/2017 09:37, Steven Hartland wrote:
>> [snip]
>>> So the questions:
>>>
>>> 1. Has anyone looked at this issue before or is working on it?
>>> 2. Do people think we could add support for RTT estimates and enable
>>>    receive scaling without major tcp stack changes?
>> We already have a non-TS-enabled RTT estimator in place c.f. the
>> relevant code in tcp_input.c:
>>
>> if ((to.to_flags & TOF_TS) != 0 &&
>>     to.to_tsecr) {
>> 	uint32_t t;
>>
>> 	t = tcp_ts_getticks() - to.to_tsecr;
>> 	if (!tp->t_rttlow || tp->t_rttlow > t)
>> 		tp->t_rttlow = t;
>> 	tcp_xmit_timer(tp,
>> 	    TCP_TS_TO_TICKS(t) + 1);
>> } else if (tp->t_rtttime &&
>>     SEQ_GT(th->th_ack, tp->t_rtseq)) {
>> 	if (!tp->t_rttlow ||
>> 	    tp->t_rttlow > ticks - tp->t_rtttime)
>> 		tp->t_rttlow = ticks - tp->t_rtttime;
>> 	tcp_xmit_timer(tp,
>> 			ticks - tp->t_rtttime);
>> }
>>
>>
>> The autoscaling implementation just needs to have its insistence on
>> using timestamps stuffed into a cannon and shot in the direction of the
>> sun. This is the relevant block of rcvbuf code from tcp_input.c:
>>
>>
>> if (V_tcp_do_autorcvbuf &&
>>     (to.to_flags & TOF_TS) &&
>>     to.to_tsecr &&
>>     (so->so_rcv.sb_flags & SB_AUTOSIZE)) {
>> 	if (TSTMP_GT(to.to_tsecr, tp->rfbuf_ts) &&
>> 	    to.to_tsecr - tp->rfbuf_ts < hz) {
>> 		if (tp->rfbuf_cnt >
>> 		    (so->so_rcv.sb_hiwat / 8 * 7) &&
>> 		    so->so_rcv.sb_hiwat <
>> 		    V_tcp_autorcvbuf_max) {
>> 			newsize =
>> 			    min(so->so_rcv.sb_hiwat +
>> 			    V_tcp_autorcvbuf_inc,
>> 			    V_tcp_autorcvbuf_max);
>> 		}
>> 		/* Start over with next RTT. */
>> 		tp->rfbuf_ts = 0;
>> 		tp->rfbuf_cnt = 0;
>> 	} else
>> 		tp->rfbuf_cnt += tlen;  /* add up */
>> }
>>
>> It's pretty trivial to fix by anyone so inclined.
> Thanks for the pointers Lawrence, I'm not very familiar with network
> stack but I've had a stab at this, after reading through the current
> code in the blocks you pointed out, along side some experimentation ;-)

Nice, thanks for running with it.

> The results seem pretty encouraging, with S3 downloads with a ~17ms
> latency on a 1Gbps jumping from ~3MB/s to ~80MB/s.

Amazing what a bit of tuning can do :)

> I'd appreciate feedback on the review which can be found here:
> https://reviews.freebsd.org/D9668

ACK, will follow up there.

Cheers,
Lawrence



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?9d615fdb-c507-b6ef-2abe-e464a7849748>