Date: Tue, 28 Sep 2010 14:44:38 -0700 From: Pyun YongHyeon <pyunyh@gmail.com> To: a.smith@ukgrid.net Cc: freebsd-net@freebsd.org Subject: Re: bge watchdog timeout errors FreeBSD 7.3 Message-ID: <20100928214438.GC1252@michelle.cdnetworks.com> In-Reply-To: <20100928132445.72052m5iw9f41ns4@webmail2.ukgrid.net> References: <20100923154054.21153ulpaucsnocg@webmail2.ukgrid.net> <20100924021115.GI15014@michelle.cdnetworks.com> <20100924123938.80702gxrzyfpury0@webmail2.ukgrid.net> <20100924165452.GA19036@michelle.cdnetworks.com> <20100927122713.12822br1odth4sro@webmail2.ukgrid.net> <20100927165129.GA1435@michelle.cdnetworks.com> <20100928132445.72052m5iw9f41ns4@webmail2.ukgrid.net>
next in thread | previous in thread | raw e-mail | index | archive | help
On Tue, Sep 28, 2010 at 01:24:45PM +0100, a.smith@ukgrid.net wrote: > Quoting Pyun YongHyeon <pyunyh@gmail.com>: > > >Oops, sorry. I forgot one more chunk. You need to apply this one in > >addition to two patches. > >http://svn.freebsd.org/viewvc/base/stable/7/sys/dev/bge/if_bgereg.h?r1=202861&r2=208995&view=patch > > > > Hi, > > Ok I have installed the patches, and rebuilt the kernel. > Unfortunately the errors persist, > > > Sep 28 12:27:58 vcomm kernel: bge0: watchdog timeout -- resetting > Sep 28 12:27:58 vcomm kernel: bge0: link state changed to DOWN > Sep 28 12:28:00 vcomm kernel: bge0: link state changed to UP > > Although prior to the installation of the patch I tried to copy some > backup files off the server via scp. Copying a large file ~2GB caused > the network connection to drop and the copy to fail. Testing after > applying the patch shows that this is now improved, I have ran a few > copies without any problems... > > Where does that leave things? > Ok thanks for testing. It seems you have another issue which is not correctly handled in bge(4). I'm not sure you're actually seeing an errata of controller but could you try patch at the following URL? http://freefall.freebsd.org/~yongari/bge/bge.7.3R.post.diff The patch includes all patches I suggested so please back out previous patches before applying it. The patch was written to get better RX performance under high network load and it also includes a fix for a known hardware errata. But it's highly experimental and it's not for non-MSI bge(4) controllers because the patch may trigger other locking issues due to highly increased RX BD updates ratio in firmware for controllers that use shared interrupt. It seems your controller uses MSI so you don't have to worry about the issue. However don't apply the patch to production box.
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20100928214438.GC1252>