From owner-freebsd-stable@FreeBSD.ORG Tue Sep 28 09:27:20 2004 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 3CF8116A4CE for ; Tue, 28 Sep 2004 09:27:20 +0000 (GMT) Received: from anduin.net (anduin.net [212.12.46.226]) by mx1.FreeBSD.org (Postfix) with ESMTP id EE40243D1F for ; Tue, 28 Sep 2004 09:27:19 +0000 (GMT) (envelope-from ltning@anduin.net) Received: from mailnull by anduin.net with dspam-scanned (Exim 4.42 (FreeBSD)) id 1CCEE9-000IrX-RZ for stable@freebsd.org; Tue, 28 Sep 2004 11:24:53 +0200 Received: from mailnull by anduin.net with spamassassin-scanned (Exim 4.42 (FreeBSD)) id 1CCEE6-000IrJ-UA for stable@freebsd.org; Tue, 28 Sep 2004 11:24:53 +0200 Received: from eirik.unicore.no ([213.225.74.166] helo=[10.0.16.10]) by anduin.net with esmtp (Exim 4.42 (FreeBSD)) id 1CCEE2-000Ir9-Hq; Tue, 28 Sep 2004 11:24:46 +0200 From: Eirik Oeverby To: Doug White In-Reply-To: <20040925132827.G65556@carver.gumbysoft.com> References: <41519A92.3070503@anduin.net> <20040922191436.A36124@carver.gumbysoft.com> <5390CF52-0D22-11D9-862C-000D9335BCEC@anduin.net> <20040925132827.G65556@carver.gumbysoft.com> Content-Type: text/plain Date: Tue, 28 Sep 2004 11:23:21 +0200 Message-Id: <1096363401.77054.1.camel@eirik.unicore.no> Mime-Version: 1.0 X-Mailer: Evolution 2.0.0FreeBSD GNOME Team Port Content-Transfer-Encoding: 7bit X-Spam-Checker-Version: SpamAssassin 2.64 (2004-01-11) on anduin.net X-Spam-Level: X-Spam-Status: No, hits=-4.1 required=7.5 tests=AWL,BAYES_00 autolearn=no version=2.64 cc: stable@freebsd.org Subject: Re: Obscure errors in dmsg, system instability X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.1 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 28 Sep 2004 09:27:20 -0000 On Sat, 2004-09-25 at 13:29 -0700, Doug White wrote: > On Thu, 23 Sep 2004, [ISO-8859-1] Eirik verby wrote: > > > On 23. Sep 2004, at 04:15, Doug White wrote: > > > > > Is something sharing an interrupt with that device? > > > > > > PCI bus errors are generally Bad News .. either some device or the > > > mobo is inroducing errors. > > > > Well.. Yes, there is some interrupt sharing. Relevant parts of dmsg: > > > > [ltning@carnen] ~$ dmesg | grep "irq 2" > > IOAPIC #0 intpin 19 -> irq 2 > > uhci0: port 0xd400-0xd41f irq > > 2 at device 4.2 on pci0 > > ahc0: port 0xd000-0xd0ff mem > > 0xe2000000-0xe2000fff irq 2 at device 6.0 on pci0 > > amr0: mem 0xe3000000-0xe300ffff irq 2 at device 9.1 > > on pci0 > > Apparently one of these devices doesn't like getting an interrupt when > there's no data pending. It might be a FreeBSD driver bug, but being a > 3-way share it'll make it hard to untangle. Got some more info here. It happened again, and while in the kernel debugger (serial console), typing 'help' offers me the following functions in addition to the standard stuff: ahd_set_unit ahd_pause ahd_unpause ahd_in ahd_out Any idea what I could do with these? It's a rather strong suggestion that the Adaptec is the problem here - though I have no idea what to do next. Haven't managed to take the box out to disable Adaptec yet. > > > I don't like the fact that the LSI and the Adaptec are sharing IRQs, > > given that the LSI is the main system drive controller (which is why I > > don't use the Adaptec at all - and it cannot be disabled in BIOS I > > think!)... > > > > I should perhaps try to reallocate some of the IRQs, but I don't really > > have a clue how to do that, since I have no VGA in that box.. Ohwell, I > > guess I just have to rip it open ;) > > Yah .. rearrange the cards in the slots and see what you can convince it > to do. Will do, but the box is 1000km away, so I need help for that. Thanks, /Eirik