From owner-freebsd-stable@FreeBSD.ORG Thu Jan 11 21:13:00 2007 Return-Path: X-Original-To: freebsd-stable@freebsd.org Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [69.147.83.52]) by hub.freebsd.org (Postfix) with ESMTP id EA97D16A416 for ; Thu, 11 Jan 2007 21:13:00 +0000 (UTC) (envelope-from jdc@koitsu.dyndns.org) Received: from rwcrmhc12.comcast.net (rwcrmhc12.comcast.net [204.127.192.82]) by mx1.freebsd.org (Postfix) with ESMTP id D4D1813C455 for ; Thu, 11 Jan 2007 21:13:00 +0000 (UTC) (envelope-from jdc@koitsu.dyndns.org) Received: from icarus.home.lan (c-71-198-0-135.hsd1.ca.comcast.net[71.198.0.135]) by comcast.net (rwcrmhc12) with ESMTP id <20070111211300m1200ak4b9e>; Thu, 11 Jan 2007 21:13:00 +0000 Received: by icarus.home.lan (Postfix, from userid 1000) id 24BD41FA037; Thu, 11 Jan 2007 13:13:00 -0800 (PST) Date: Thu, 11 Jan 2007 13:13:00 -0800 From: Jeremy Chadwick To: John Baldwin Message-ID: <20070111211300.GA30702@icarus.home.lan> Mail-Followup-To: John Baldwin , freebsd-stable@freebsd.org References: <20070102153608.GA78405@icarus.home.lan> <200701111340.48708.jhb@freebsd.org> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <200701111340.48708.jhb@freebsd.org> X-PGP-Key: http://jdc.parodius.com/pubkey.asc User-Agent: Mutt/1.5.13 (2006-08-11) Cc: freebsd-stable@freebsd.org Subject: Re: Interrupt (SCSI?) hang on 4.x X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 11 Jan 2007 21:13:01 -0000 On Thu, Jan 11, 2007 at 01:40:48PM -0500, John Baldwin wrote: > On Tuesday 02 January 2007 10:36, Jeremy Chadwick wrote: > > Yes, I know 4.11 is EOL'd at the end of this month, but hopefully > > someone can shed some light on this problem anyways. I simply don't > > have the knowledge of what's going on on a low-level to determine > > the cause. > > > > I do have serial console on this box, and after enabling some > > debugging for the ahc(4) driver a few months back, was able to > > get something intelligent out of the system regarding SCBs this > > morning. This may not be useful (or the cause), though. I also > > cannot enable drop-to-DDB-on-serial-break because our Portmaster 2 > > has been known to send a serial break on rare occasion. :-( > > > > Every so often (sometimes hours, sometimes months -- usually months), > > the 4.11 box we have "locks up" in the sense that both NICs on the > > box stop working, and the SCSI controller also appears hung. This > > problem has existed for a couple years; it's not specific to 4.11 > > (versus 4.10 or 4.9). > > Can you try a RELENG_4 kernel? I know of at least one fix (in > sys/i386/isa/clock.c) since 4.11 was released that might help you out. John, Thanks for the tip. The kernel I'm using was built on the following date (thus, src-all for release=cvs tag=RELENG_4 was cvsup'd about an hour prior to this): FreeBSD pentarou.parodius.com 4.11-STABLE FreeBSD 4.11-STABLE #0: Sun May 28 12:02:42 PDT 2006 root@pentarou.parodius.com:/usr/obj/usr/src/sys/PENTAROU i386 Which revision of sys/i386/isa/clock.c might have fixed the problem? I only see 1 commit between May 28th 2006 and present: http://www.freebsd.org/cgi/cvsweb.cgi/src/sys/i386/isa/clock.c?f=u&only_with_tag=RELENG_4&logsort=date -- | Jeremy Chadwick jdc at parodius.com | | Parodius Networking http://www.parodius.com/ | | UNIX Systems Administrator Mountain View, CA, USA | | Making life hard for others since 1977. PGP: 4BD6C0CB |