From owner-freebsd-net Thu Nov 29 21:18:39 2001 Delivered-To: freebsd-net@freebsd.org Received: from c527597-a.cstvl1.sfba.home.com (c527597-a.cstvl1.sfba.home.com [24.176.204.87]) by hub.freebsd.org (Postfix) with ESMTP id 3AA5637B416; Thu, 29 Nov 2001 21:18:36 -0800 (PST) Received: (from bmah@localhost) by c527597-a.cstvl1.sfba.home.com (8.11.6/8.11.6) id fAU5IXx11078; Thu, 29 Nov 2001 21:18:33 -0800 (PST) (envelope-from bmah) Message-Id: <200111300518.fAU5IXx11078@c527597-a.cstvl1.sfba.home.com> X-Mailer: exmh version 2.5 07/13/2001 with nmh-1.0.4 To: Garrett Wollman Cc: Greg Lehey , net@FreeBSD.ORG Subject: TCP anomalies (was Re: FreeBSD performing worse than Linux?) In-Reply-To: <200111290113.fAT1DnH04474@khavrinen.lcs.mit.edu> References: <20011128102241.6887B380A@overcee.netplex.com.au> <20011128112006.195983808@overcee.netplex.com.au> <20011129105321.C74413@monorchid.lemis.com> <200111290113.fAT1DnH04474@khavrinen.lcs.mit.edu> Comments: In-reply-to Garrett Wollman message dated "Wed, 28 Nov 2001 20:13:49 -0500." From: "Bruce A. Mah" Reply-To: bmah@FreeBSD.ORG X-Face: g~c`.{#4q0"(V*b#g[i~rXgm*w;:nMfz%_RZLma)UgGN&=j`5vXoU^@n5v4:OO)c["!w)nD/!!~e4Sj7LiT'6*wZ83454H""lb{CC%T37O!!'S$S&D}sem7I[A 2V%N&+ X-Image-Url: http://www.employees.org/~bmah/Images/bmah-cisco-small.gif X-Url: http://www.employees.org/~bmah/ Mime-Version: 1.0 Content-Type: multipart/signed; boundary="==_Exmh_1808650111P"; micalg=pgp-sha1; protocol="application/pgp-signature" Content-Transfer-Encoding: 7bit Date: Thu, 29 Nov 2001 21:18:33 -0800 Sender: owner-freebsd-net@FreeBSD.ORG Precedence: bulk List-ID: List-Archive: (Web Archive) List-Help: (List Instructions) List-Subscribe: List-Unsubscribe: X-Loop: FreeBSD.org --==_Exmh_1808650111P Content-Type: text/plain; charset=us-ascii If memory serves me right, Garrett Wollman wrote: > Each trace shows a single large file transfer from a 4.4-stable > machine to my -current desktop over a local-area network. I'm pretty rusty at debugging TCP implementations, but I'll try to contribute something... Your 4.4-STABLE machine, is it from before or after rev 1.107.2.18 of sys/netinet/tcp_input.c? (Mon Nov 12 22:11:24 2001 UTC) I'm not sure how relevant this point is, but some of the anomalies I noticed seem related to fast retransmit (see below). Also...where did you do the trace (i.e. sender, receiver, or a third machine)? > test4 was > aborted about 10% into the transfer so that you have a chance at > looking at the whole thing in xplot. There are multiple pathologies > visible in the results, but a good place to start would be around > :56.44 in test4. test4 was the only trace I looked at. One thing that caught my eye is that the receiver seems to be sending a bunch of dupacks (in some cases, many more than needed to trigger fast retransmit) but no retransmit happens. In *most* cases, the receiver somehow gets the missing data because you can later see it acking later sequence numbers. The first place I saw this was at :41.504152. This looks a little odd, but it *could* be explained by data segments getting misordered somewhere and the dupacks getting lost. Another place to look is the large number of consecutive dupacks starting around :41.978767. I don't know what's happening here, but after a long time (about a second?!?) the sender finally gives up and sends the receiver what it wants. Cheers, Bruce. --==_Exmh_1808650111P Content-Type: application/pgp-signature -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.0.6 (FreeBSD) Comment: Exmh version 2.3.1+ 05/14/2001 iD8DBQE8Bxap2MoxcVugUsMRApAMAKDvOa1CwUxLt/XYWS+/eatsEX1N/QCfUwbt ce2ewwfnFV79kzsG6xRYgQA= =ha30 -----END PGP SIGNATURE----- --==_Exmh_1808650111P-- To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-net" in the body of the message