From owner-freebsd-sparc64@FreeBSD.ORG Mon Nov 3 22:30:45 2008 Return-Path: Delivered-To: freebsd-sparc64@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 0F19F106568C for ; Mon, 3 Nov 2008 22:30:45 +0000 (UTC) (envelope-from marius@alchemy.franken.de) Received: from alchemy.franken.de (alchemy.franken.de [194.94.249.214]) by mx1.freebsd.org (Postfix) with ESMTP id 7C4528FC12 for ; Mon, 3 Nov 2008 22:30:44 +0000 (UTC) (envelope-from marius@alchemy.franken.de) Received: from alchemy.franken.de (localhost [127.0.0.1]) by alchemy.franken.de (8.14.3/8.14.3/ALCHEMY.FRANKEN.DE) with ESMTP id mA3MUg5R008648; Mon, 3 Nov 2008 23:30:42 +0100 (CET) (envelope-from marius@alchemy.franken.de) Received: (from marius@localhost) by alchemy.franken.de (8.14.3/8.14.3/Submit) id mA3MUg09008647; Mon, 3 Nov 2008 23:30:42 +0100 (CET) (envelope-from marius) Date: Mon, 3 Nov 2008 23:30:42 +0100 From: Marius Strobl To: Mark Linimon Message-ID: <20081103223042.GB8256@alchemy.franken.de> References: <20081031124442.GB9102@soaustin.net> <183638.12752.qm@web56802.mail.re3.yahoo.com> <20081031131827.GA9613@soaustin.net> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20081031131827.GA9613@soaustin.net> User-Agent: Mutt/1.4.2.3i Cc: freebsd-sparc64@freebsd.org Subject: Re: Free Ultra2 in Silicon Valley, USA X-BeenThere: freebsd-sparc64@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Porting FreeBSD to the Sparc List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 03 Nov 2008 22:30:45 -0000 On Fri, Oct 31, 2008 at 08:18:27AM -0500, Mark Linimon wrote: > On Fri, Oct 31, 2008 at 06:02:28AM -0700, mdh wrote: > > A dual CPU Ultra2 is going to be a lot more powerful than an Ultra5. > > Hmm, ok, didn't know that. If no-one else claims it first, perhaps > I can claim it for the build cluster and pull one of the Ultra 5s. > I intend to be out in .ca.us for meetBSD. > > > E4500's can be relatively beefy. > > We could have gotten our hands on some more of them in .ca.us but the > problem is who wants to pay for the power :-( Really, their time has > come and gone. > > > OK, this is probably way over my head, but I'll bite - what exactly > > happens if you don't breakpoint through it? > > http://people.freebsd.org/~linimon/studies/dmesgs/dmesg.netra_1_t200.txt . > > This appears to be some kind of race condition; my guess from fooling > around with it is that some interrupt is enabled, and then fires, before > the setup to handle it is finished. (Note that the same kernel runs > fine on the 100s). By stepping through it, you can see it fail at > different locations; without stepping through it, it is always at > the same. > > Unfortunately my notes are at home and that machine is unreachable ATM. > It's more likely that a device is exceeding the mapping provided, which causes the uncorrectable DMA error interrupt and in turn happens in different locations depending on how far the CPU has progressed since the transfer request was issued to the device. Anyway, the panic message provided isn't enough info to even guess what the real cause is. I think the easiest way to proceed would be to remove the remaining NIC (is there a reason you disabled gem(4) for the on-board ones?) and mass storage controller drivers one by one and see when the panic goes away. I'd begin with just disabling ATAPI DMA (meanwhile done by the sparc64 loader by default) though as ata(4) has a known issue causing data corruption with the ALI M5229 and ATAPI DMA on sparc64, which isn't impossible to be related with your problem. That said, my T1 AC200 is running fine and I've never seen such a problem with it... Marius