From owner-freebsd-current@FreeBSD.ORG Fri Jan 18 21:13:24 2008 Return-Path: Delivered-To: freebsd-current@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 0C05C16A417; Fri, 18 Jan 2008 21:13:24 +0000 (UTC) (envelope-from sgk@troutmask.apl.washington.edu) Received: from troutmask.apl.washington.edu (troutmask.apl.washington.edu [128.208.78.105]) by mx1.freebsd.org (Postfix) with ESMTP id D73DF13C465; Fri, 18 Jan 2008 21:13:23 +0000 (UTC) (envelope-from sgk@troutmask.apl.washington.edu) Received: from troutmask.apl.washington.edu (localhost.apl.washington.edu [127.0.0.1]) by troutmask.apl.washington.edu (8.14.2/8.14.2) with ESMTP id m0ILDRnT051350; Fri, 18 Jan 2008 13:13:27 -0800 (PST) (envelope-from sgk@troutmask.apl.washington.edu) Received: (from sgk@localhost) by troutmask.apl.washington.edu (8.14.2/8.14.2/Submit) id m0ILDRup051349; Fri, 18 Jan 2008 13:13:27 -0800 (PST) (envelope-from sgk) Date: Fri, 18 Jan 2008 13:13:27 -0800 From: Steve Kargl To: Andre Oppermann Message-ID: <20080118211327.GA50720@troutmask.apl.washington.edu> References: <1199966437.1545.27.camel@localhost> <20080110175347.GA68673@troutmask.apl.washington.edu> <4790F680.1090204@freebsd.org> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <4790F680.1090204@freebsd.org> User-Agent: Mutt/1.4.2.3i Cc: Tom Evans , freebsd-current@freebsd.org Subject: Re: Regular bge watchdog timeouts on 7.0-PRERELEASE X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 18 Jan 2008 21:13:24 -0000 On Fri, Jan 18, 2008 at 07:57:04PM +0100, Andre Oppermann wrote: > Steve Kargl wrote: > >On Thu, Jan 10, 2008 at 12:00:37PM +0000, Tom Evans wrote: > > > >>I am encountering regular watchdog timeouts on bge: > >> > >>Jan 9 08:36:11 zoot kernel: bge0: watchdog timeout -- resetting > >>Jan 9 08:36:11 zoot kernel: bge0: link state changed to DOWN > >>Jan 9 08:36:13 zoot kernel: bge0: link state changed to UP > > > >Add the following to /etc/sysctl.conf > > > >net.inet.tcp.sendspace=131072 > >net.inet.tcp.recvspace=131072 > > In 7.0 these are automatically tuning and can be left at the default > settings. I started using the above before automatic tuning was available, and I haven't revisited whether these are still needed. "If it works, why fix it?" motto. > >net.inet.tcp.path_mtu_discovery=0 > > You should not disable path MTU discovery. It'll most likely break the > internet for you when you encounter for example PPPoE links. This is on a intranet. A small cluster used for MPI computations. I won't run into PPPoE issues, but it's good to know that problems can occur. > >net.inet.udp.recvspace=65536 > >net.inet.raw.recvspace=16384 > >kern.ipc.nmbclusters=50000 > >kern.ipc.shm_use_phys=1 > >net.inet.tcp.rexmit_min=30 > > These changes do not really have much influence on the bge problem > (at least theoretically). The first 3 are needed to make NFS happy on my cluster. The shm change is needed for MPICH2's nemesis device. I don't remember why I set rexmit_min. See motto above. -- Steve