From owner-freebsd-current@FreeBSD.ORG Sun Nov 11 22:23:27 2007 Return-Path: Delivered-To: freebsd-current@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id CAC9A16A41A for ; Sun, 11 Nov 2007 22:23:27 +0000 (UTC) (envelope-from andre@freebsd.org) Received: from c00l3r.networx.ch (c00l3r.networx.ch [62.48.2.2]) by mx1.freebsd.org (Postfix) with ESMTP id 33C5613C4A3 for ; Sun, 11 Nov 2007 22:23:26 +0000 (UTC) (envelope-from andre@freebsd.org) Received: (qmail 91176 invoked from network); 11 Nov 2007 21:58:15 -0000 Received: from c00l3r.networx.ch (HELO [127.0.0.1]) ([62.48.2.2]) (envelope-sender ) by c00l3r.networx.ch (qmail-ldap-1.03) with SMTP for ; 11 Nov 2007 21:58:15 -0000 Message-ID: <473780DB.2040705@freebsd.org> Date: Sun, 11 Nov 2007 23:23:23 +0100 From: Andre Oppermann User-Agent: Thunderbird 1.5.0.13 (Windows/20070809) MIME-Version: 1.0 To: Gregory Wright References: <46B41421-3112-40C6-84D9-094FA771F93E@antiope.com> <4735CE3A.7020905@freebsd.org> In-Reply-To: Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Cc: freebsd-current@freebsd.org Subject: Re: excessive TCP dulplicate acks revisted X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 11 Nov 2007 22:23:27 -0000 Gregory Wright wrote: > > On Nov 10, 2007, at 10:28 AM, Andre Oppermann wrote: > >> Gregory Wright wrote: >>> (Note: long message) >>> Hi, >>> The tcp duplicate ACK attack is back. >>> Last March, there was a thread on duplicate TCP acks in -CURRENT. >>> I have been able to reproduce the problem on 7.0-BETA2 (amd64) >>> and have some new information that might help locate the bug. >>> Background: I first noticed problems with tcp connections dropping >>> when Bacula was running on our backup server. The hardware was >>> a dual Opteron 244, 2 GB RAM, running FreeBSD 6.2-RELEASE-p2. >>> The ethernet NIC was a bge. >>> We went through an extensive process to rule out hardware and cabling >>> problems: the server hardware was replaced with a single Opteron 270 >>> (dual core) on a Tyan S2882-D motherboard. The memory was replaced >>> as well. The NICs are still bge. This machine is "hardtack". >> >> This may be a TSO bug in the bge hardware. Please repeat your tests in >> the original setup with TSO disabled on the bge interfaces ("ifconfig >> bgeX -tso"). >> Second if you've got another network card with something else than bge >> please put it into the same box and run the tests as well. >> >> --Andre >> > > Hi Andre, > > I also took a look at the bge (4) driver in 7.0-BETA2. As far as I can > tell, > it does not support TSO (there is no ioctl supporting TSO enable/disable > as there is for the em(4) driver). OK. > Might the chip --- a BCM5704_B0 --- not be completely initialized? This > might explain why the machine with the BCM5714_B3 chips works, while > the other machine shows the duplicate ACK bug. Perhaps. Do you see the duplicate ACKs in a tcpdump on both the sender and the receiver? If you see it on the sender too, then it must be a bug in our network stack or the driver (by requeuing the same packet over and over again). -- Andre