From owner-freebsd-current@FreeBSD.ORG Wed Aug 25 12:35:24 2004 Return-Path: Delivered-To: freebsd-current@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 7DDCA16A4CE for ; Wed, 25 Aug 2004 12:35:24 +0000 (GMT) Received: from duke.cs.duke.edu (duke.cs.duke.edu [152.3.140.1]) by mx1.FreeBSD.org (Postfix) with ESMTP id 187E943D55 for ; Wed, 25 Aug 2004 12:35:22 +0000 (GMT) (envelope-from gallatin@cs.duke.edu) Received: from grasshopper.cs.duke.edu (grasshopper.cs.duke.edu [152.3.145.30]) by duke.cs.duke.edu (8.12.10/8.12.10) with ESMTP id i7PCZLJt013949 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO); Wed, 25 Aug 2004 08:35:21 -0400 (EDT) Received: (from gallatin@localhost) by grasshopper.cs.duke.edu (8.12.9p2/8.12.9/Submit) id i7PCZGi6023585; Wed, 25 Aug 2004 08:35:16 -0400 (EDT) (envelope-from gallatin) From: Andrew Gallatin MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit Message-ID: <16684.34692.113251.838400@grasshopper.cs.duke.edu> Date: Wed, 25 Aug 2004 08:35:16 -0400 (EDT) To: Chris Dillon In-Reply-To: <20040824235426.U93926@duey.wolves.k12.mo.us> References: <20040824235426.U93926@duey.wolves.k12.mo.us> X-Mailer: VM 6.75 under 21.1 (patch 12) "Channel Islands" XEmacs Lucid cc: freebsd-current@freebsd.org Subject: Re: 5.3-BETA1 hard lockups (HP DL380 G3) X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.1 Precedence: list List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 25 Aug 2004 12:35:24 -0000 Chris Dillon writes: > > Anybody else experiencing hard lockups with a recent 5.3-BETA? I can > manage to build and install a kernel if I hold my breath while doing > it, but I can't make it through a buildworld without the system > locking up hard enough to require a manual reset (thank goodness for > iLO and its virtual reset button :-). > > The hardware I'm working on is a HP Proliant DL380 G3, with dual > 2.8GHz Xeons, 2GB RAM, and using the on-board SmartArray 5i+ > controller with six 72GB 10KRPM disks in a RAID5 configuration. > Everything worked fine at first, even went through a few > kernels+worlds and plenty of port builds over the last month, but > within the last week or so these hard lockups started happening during > kernel/world builds (probably any moderate load would do). I thought > maybe something might have gone wrong with the hardware so I ran the > HP Diagnostics on it for 24 hours in a continuous loop and everything > went fine. Any clues? Me too. 4x opteron 844 w/AMD64 kernel, 4GB ram, mpt scsi. Building kernels is fine, but a buildworld always locks up hard until I backed down to a UP kernel. Do you have Doug White's IPI deadlock patch applied? (I do..). You might also have a greater chance to get into ddb if you set debug.kdb.stop_cpus=0 There's also MP_WATCHDOG for you, since you're on i386. Drew