From owner-freebsd-stable@FreeBSD.ORG Sat Apr 19 13:13:49 2008 Return-Path: Delivered-To: stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id BE4C21065676; Sat, 19 Apr 2008 13:13:49 +0000 (UTC) (envelope-from gnn@neville-neil.com) Received: from outbound0.mx.meer.net (outbound0.mx.meer.net [209.157.153.23]) by mx1.freebsd.org (Postfix) with ESMTP id 99ADC8FC20; Sat, 19 Apr 2008 13:13:49 +0000 (UTC) (envelope-from gnn@neville-neil.com) Received: from mail.meer.net (mail.meer.net [209.157.152.14]) by outbound0.mx.meer.net (8.12.10/8.12.6) with ESMTP id m3JDDVi2042083; Sat, 19 Apr 2008 06:13:33 -0700 (PDT) (envelope-from gnn@neville-neil.com) Received: from mail2.meer.net (mail2.meer.net [64.13.141.16]) by mail.meer.net (8.13.3/8.13.3/meer) with ESMTP id m3JDCoxH051838; Sat, 19 Apr 2008 06:12:55 -0700 (PDT) (envelope-from gnn@neville-neil.com) Received: from minion.local.neville-neil.com (61.204.211.246.customerlink.pwd.ne.jp [61.204.211.246]) (authenticated bits=0) by mail2.meer.net (8.14.1/8.14.1) with ESMTP id m3JDCkbL088609; Sat, 19 Apr 2008 06:12:49 -0700 (PDT) (envelope-from gnn@neville-neil.com) Date: Sat, 19 Apr 2008 22:12:45 +0900 Message-ID: From: "George V. Neville-Neil" To: "Jack Vogel" In-Reply-To: <2a41acea0804171032y239913efq3163edb67fc468a0@mail.gmail.com> References: <20080417112035.GA81275@eos.sc1.parodius.com> <0528764C-4B25-4063-B018-EFD2750ACBDB@altesco.nl> <20080417141139.GA84832@eos.sc1.parodius.com> <4DE42652-2CE8-4567-AD49-2F425917A232@altesco.nl> <2a41acea0804171032y239913efq3163edb67fc468a0@mail.gmail.com> User-Agent: Wanderlust/2.15.5 (Almost Unreal) SEMI/1.14.6 (Maruoka) FLIM/1.14.9 (=?ISO-8859-4?Q?Goj=F2?=) APEL/10.7 Emacs/22.1.50 (i386-apple-darwin8.11.1) MULE/5.0 (SAKAKI) MIME-Version: 1.0 (generated by SEMI 1.14.6 - "Maruoka") Content-Type: text/plain; charset=US-ASCII Cc: jfv@freebsd.org, stable@freebsd.org, Jeremy Chadwick Subject: Re: Problems with em0 failing to initialize on stable? X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sat, 19 Apr 2008 13:13:49 -0000 At Thu, 17 Apr 2008 10:32:33 -0700, Jack Vogel wrote: > > On Thu, Apr 17, 2008 at 8:28 AM, Ben Stuyts wrote: > > > > > > On 17 apr 2008, at 16:11, Jeremy Chadwick wrote: > > > > > > > On Thu, Apr 17, 2008 at 02:02:51PM +0200, Ben Stuyts wrote: > > > > > > > On 17 apr 2008, at 13:20, Jeremy Chadwick wrote: > > > > > > > > > On Thu, Apr 17, 2008 at 06:32:32PM +0900, gnn@freebsd.org wrote: > > > > > > > > > > > I am running 7-STABLE with machines that are net booted. On > > occasion, > > > > > > that is not with any level of predictability, this happens: > > > > > > > > > > > > em0: Hardware Initialization Failed > > > > > > em0: Unable to initialize the hardware > > > > > > > > > > > > which of course stops the machine in its tracks. A normal dmesg is > > > > > > also included. > > > > > > > > > > > > Any steps I should take to help debug this? > > > > > > > > > > > > > > > > George, can you provide the following? > > > > > > > > > > * Motherboard type and model (a URL to the board would be good) > > > > > * kenv | grep smbios > > > > > * pciconf -lv > > > > > > > > > > > > > I have seen this too. A couple of times on a new server here. Maybe one > > in > > > > two or three reboots. So far, it only happens after a reboot, but not > > after > > > > a hard reset or power cycle. This board has 3 em network if's. Two on > > the > > > > board itself, and one on a daughter card on the ipmi add-on card. It is > > > > always em0 that is giving trouble. The board is a Supermicro X7DBP-i: > > > > > > > > > > > > > > > > I'm wondering if the IPMI add-on is what's doing it. However, the IPMI > > > cards compatible with the X7DBP all have a dedicated NIC (versus > > > piggybacking on top of the existing mainboards' NICs, which almost > > > always causes problems of the mysterious sort). I'm not sure, but I > > > don't think em0 will be that NIC. > > > > > > > Correct, em0 and em1 are on the mainboard. em2 is on the > > daughter-daughtercard and also single use for the OS. And then there's a 4th > > nic which is the ipmi interface. It is completely separate. (I have the > > SIM1U-3B & SIM1U-3D daughter cards: > > > > > > > > > > > Both you and George have boards that use the 82563EB. jv@ will have to > > > help with this one. I wonder if it's a BIOS bug of some kind, where > > > something on the NIC isn't getting reset by the BIOS on a soft boot... > > > > > > > Let me know if there's anything I can do to help. I can test patches during > > evening hours when this server is mostly idle. > > > > Ben > > > > > > There is a fix in the shared code that is checked into CURRENT that > addresses this, however, yesterday evening a problem cropped up that > might still be unfixed with that code, I'm looking into that > today... Once that's fixed can it be MFC'd? Best, George