From owner-freebsd-net@FreeBSD.ORG Thu Jul 5 17:27:26 2012 Return-Path: Delivered-To: freebsd-net@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [69.147.83.52]) by hub.freebsd.org (Postfix) with ESMTP id AC969106566C for ; Thu, 5 Jul 2012 17:27:26 +0000 (UTC) (envelope-from emz@norma.perm.ru) Received: from elf.hq.norma.perm.ru (unknown [IPv6:2001:470:1f09:14c0::2]) by mx1.freebsd.org (Postfix) with ESMTP id 592778FC08 for ; Thu, 5 Jul 2012 17:27:25 +0000 (UTC) Received: from [192.168.248.32] ([192.168.248.32]) by elf.hq.norma.perm.ru (8.14.5/8.14.5) with ESMTP id q65HRNKa056822 (version=TLSv1/SSLv3 cipher=DHE-RSA-CAMELLIA256-SHA bits=256 verify=NO) for ; Thu, 5 Jul 2012 23:27:23 +0600 (YEKT) (envelope-from emz@norma.perm.ru) Message-ID: <4FF5CE74.9060005@norma.perm.ru> Date: Thu, 05 Jul 2012 23:27:16 +0600 From: "Eugene M. Zheganin" User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:13.0) Gecko/20120614 Thunderbird/13.0.1 MIME-Version: 1.0 To: freebsd-net@freebsd.org Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-Greylist: Sender IP whitelisted, not delayed by milter-greylist-4.2.7 (elf.hq.norma.perm.ru [192.168.3.10]); Thu, 05 Jul 2012 23:27:23 +0600 (YEKT) X-Spam-Status: No hits=-101.0 bayes=0.5 testhits ALL_TRUSTED=-1, USER_IN_WHITELIST=-100 autolearn=unavailable version=3.3.2 X-Spam-Checker-Version: SpamAssassin 3.3.2 (2011-06-06) on elf.hq.norma.perm.ru Subject: bge watchdog timeout - resetting X-BeenThere: freebsd-net@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Networking and TCP/IP with FreeBSD List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 05 Jul 2012 17:27:26 -0000 Hi, I'm having troubles with one FreeBSD server, running 8.2-STABLE. Randomly I get 'bge watchdog timeout - resetting' errors on it's console. It can run 1-2 months without problem, then I can get these errors like twice per day. They are appearing in bunches, and the system becomes irresponsive: I can see it's alive on the console, I can input the text, but I cannot login. Then I have to reset it. The system is also partially dropping the traffic (I mean partially receiveing and processing) - I have an identical box with carp running, but it cannot become MASTER, as it seems like some part of the traffic is handled by the troubled device. This box is an IBM x3250m2 server, I have like two dozens of this, and this is like the most distrurbing one. I'm also getting these messages on another box, also running 8.2-STABLE, but others are doing just fine (they are running mostly 8.2-RELEASE or even 7.x FreeBSDs, but I also have a couple of STABLEs running just fine). What can be done here (can something be done) ? Is it worth to change the cable or the catalyst port (did it help someone ?) ? Would it help to turn on the hw.bge.allow_asf ? the bge0 is (as the pciconf reports is): Broadcom NetXtreme BCM5722 Gigabit (94309) I can also say that it's running one of the recent firmwares, as I updated it already from IBM bomc, in order to resolve the situation, but didn't help much. Thanks. Eugene.