From owner-freebsd-stable@FreeBSD.ORG Thu Jan 11 21:51:42 2007 Return-Path: X-Original-To: freebsd-stable@freebsd.org Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [69.147.83.52]) by hub.freebsd.org (Postfix) with ESMTP id 5BFC316A412; Thu, 11 Jan 2007 21:51:42 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from server.baldwin.cx (66-23-211-162.clients.speedfactory.net [66.23.211.162]) by mx1.freebsd.org (Postfix) with ESMTP id DAB7413C455; Thu, 11 Jan 2007 21:51:41 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from localhost.corp.yahoo.com (john@localhost [127.0.0.1]) (authenticated bits=0) by server.baldwin.cx (8.13.6/8.13.6) with ESMTP id l0BLpdQd099976; Thu, 11 Jan 2007 16:51:39 -0500 (EST) (envelope-from jhb@freebsd.org) From: John Baldwin To: Jeremy Chadwick Date: Thu, 11 Jan 2007 16:47:17 -0500 User-Agent: KMail/1.9.1 References: <20070102153608.GA78405@icarus.home.lan> <200701111340.48708.jhb@freebsd.org> <20070111211300.GA30702@icarus.home.lan> In-Reply-To: <20070111211300.GA30702@icarus.home.lan> MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit Content-Disposition: inline Message-Id: <200701111647.18735.jhb@freebsd.org> X-Greylist: Sender succeeded SMTP AUTH authentication, not delayed by milter-greylist-2.0.2 (server.baldwin.cx [127.0.0.1]); Thu, 11 Jan 2007 16:51:39 -0500 (EST) X-Virus-Scanned: ClamAV 0.88.3/2436/Thu Jan 11 06:48:19 2007 on server.baldwin.cx X-Virus-Status: Clean X-Spam-Status: No, score=-4.4 required=4.2 tests=ALL_TRUSTED,AWL,BAYES_00 autolearn=ham version=3.1.3 X-Spam-Checker-Version: SpamAssassin 3.1.3 (2006-06-01) on server.baldwin.cx Cc: freebsd-stable@freebsd.org Subject: Re: Interrupt (SCSI?) hang on 4.x X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 11 Jan 2007 21:51:42 -0000 On Thursday 11 January 2007 16:13, Jeremy Chadwick wrote: > On Thu, Jan 11, 2007 at 01:40:48PM -0500, John Baldwin wrote: > > On Tuesday 02 January 2007 10:36, Jeremy Chadwick wrote: > > > Yes, I know 4.11 is EOL'd at the end of this month, but hopefully > > > someone can shed some light on this problem anyways. I simply don't > > > have the knowledge of what's going on on a low-level to determine > > > the cause. > > > > > > I do have serial console on this box, and after enabling some > > > debugging for the ahc(4) driver a few months back, was able to > > > get something intelligent out of the system regarding SCBs this > > > morning. This may not be useful (or the cause), though. I also > > > cannot enable drop-to-DDB-on-serial-break because our Portmaster 2 > > > has been known to send a serial break on rare occasion. :-( > > > > > > Every so often (sometimes hours, sometimes months -- usually months), > > > the 4.11 box we have "locks up" in the sense that both NICs on the > > > box stop working, and the SCSI controller also appears hung. This > > > problem has existed for a couple years; it's not specific to 4.11 > > > (versus 4.10 or 4.9). > > > > Can you try a RELENG_4 kernel? I know of at least one fix (in > > sys/i386/isa/clock.c) since 4.11 was released that might help you out. > > John, > > Thanks for the tip. > > The kernel I'm using was built on the following date (thus, src-all > for release=cvs tag=RELENG_4 was cvsup'd about an hour prior to this): > > FreeBSD pentarou.parodius.com 4.11-STABLE FreeBSD 4.11-STABLE #0: Sun May 28 12:02:42 PDT 2006 root@pentarou.parodius.com:/usr/obj/usr/src/sys/PENTAROU i386 > > Which revision of sys/i386/isa/clock.c might have fixed the problem? > I only see 1 commit between May 28th 2006 and present: It was the 1.149.2.7, so you probably have that already. I wasn't sure when you said "4.11" kernel if you meant RELENG_4_11 or RELENG_4. -- John Baldwin