From owner-freebsd-current@FreeBSD.ORG Wed Jul 6 17:09:52 2005 Return-Path: X-Original-To: freebsd-current@freebsd.org Delivered-To: freebsd-current@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 25ABD16A41C; Wed, 6 Jul 2005 17:09:52 +0000 (GMT) (envelope-from scottl@samsco.org) Received: from pooker.samsco.org (pooker.samsco.org [168.103.85.57]) by mx1.FreeBSD.org (Postfix) with ESMTP id 9B3ED43D49; Wed, 6 Jul 2005 17:09:49 +0000 (GMT) (envelope-from scottl@samsco.org) Received: from [192.168.254.21] (rat.samsco.home [192.168.254.21]) (authenticated bits=0) by pooker.samsco.org (8.13.3/8.13.3) with ESMTP id j66HGILh025433; Wed, 6 Jul 2005 11:16:18 -0600 (MDT) (envelope-from scottl@samsco.org) Message-ID: <42CC1085.6090504@samsco.org> Date: Wed, 06 Jul 2005 11:10:29 -0600 From: Scott Long User-Agent: Mozilla/5.0 (X11; U; FreeBSD amd64; en-US; rv:1.7.5) Gecko/20050321 X-Accept-Language: en-us, en MIME-Version: 1.0 To: Mike Tancsa References: <70e8236f05070208212e36c375@mail.gmail.com> <42C6DA5F.9070303@gneto.com> <6.2.1.2.0.20050703212843.07889088@64.7.153.2> <20050705011650.R80892@lexi.siliconlandmark.com> <70e8236f050706002655cd9a0c@mail.gmail.com> <42CBE7F4.9040106@samsco.org> <83fb4207210a3f028b8ee2d2289573c4@xcllnt.net> <42CBFC36.1040406@samsco.org> <6.2.1.2.0.20050706115146.07a59d08@64.7.153.2> <6.2.1.2.0.20050706115824.07a58588@64.7.153.2> In-Reply-To: <6.2.1.2.0.20050706115824.07a58588@64.7.153.2> Content-Type: text/plain; charset=us-ascii; format=flowed Content-Transfer-Encoding: 7bit X-Spam-Status: No, score=-2.8 required=3.8 tests=ALL_TRUSTED autolearn=failed version=3.0.2 X-Spam-Checker-Version: SpamAssassin 3.0.2 (2004-11-16) on pooker.samsco.org Cc: freebsd-current@freebsd.org Subject: Re: 6.0-CURRENT SNAP004 hangs on amr (long) X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 06 Jul 2005 17:09:52 -0000 Mike Tancsa wrote: > At 11:53 AM 06/07/2005, Mike Tancsa wrote: > >> At 11:43 AM 06/07/2005, Scott Long wrote: >> >>> According to the original dmesg, the hang happens well after bus >>> enumeration is complete and interrupts have been enabled. It's >>> happening on a taste I/O from GEOM. >> >> >> Here is a boot -v that is a little more upto date. I am just >> netbooting with various kernel configs to try and sort out whats going >> on. > > > .... > >> atapci0: Lazy allocation of 0x10 bytes rid 0x20 type 4 at 0 >> ata0: on atapci0 >> atapci0: Reserved 0x8 bytes for rid 0x10 type 4 at 0x1f0 >> atapci0: Reserved 0x1 bytes for rid 0x14 type 4 at 0x3f6 >> ata0: reset tp1 mask=00 ostat0=ff ostat1=ff >> ata0: [MPSAFE] >> ata1: on atapci0 >> atapci0: Reserved 0x8 bytes for rid 0x18 type 4 at 0x170 >> atapci0: Reserved 0x1 bytes for rid 0x1c type 4 at 0x376 >> >> It totally hangs here and I cant even break into debugger. Its almost >> as if the thing goes into suspend mode ? > > > OK, some more details. I removed the ata code, and it no longer sends > the box to "sleep" or whatever weird state its in. > > Now its stuck again, but I can break into the debugger from the serial > console > I wonder if the AMR interrupt is getting routed to the ata interrupt pins. With the ata driver enabled, the OS gets stuck in an infinite loop of trying to service what it thinks in an ata interrupt. With the ata driver disabled, the ata interrupt lines stay disabled and the OS sees nothing. Would it be possible to send an NMI to the machine while it's hung with the ata driver enabled? If not, we can probably drop some simple printf into the ata interrupt handler. Scott