From owner-freebsd-stable@FreeBSD.ORG Tue Jan 18 21:46:56 2011 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 052A81065672 for ; Tue, 18 Jan 2011 21:46:56 +0000 (UTC) (envelope-from jdc@koitsu.dyndns.org) Received: from qmta14.emeryville.ca.mail.comcast.net (qmta14.emeryville.ca.mail.comcast.net [76.96.27.212]) by mx1.freebsd.org (Postfix) with ESMTP id DA0F08FC13 for ; Tue, 18 Jan 2011 21:46:55 +0000 (UTC) Received: from omta23.emeryville.ca.mail.comcast.net ([76.96.30.90]) by qmta14.emeryville.ca.mail.comcast.net with comcast id x0dV1f00A1wfjNsAE9mvrD; Tue, 18 Jan 2011 21:46:55 +0000 Received: from koitsu.dyndns.org ([98.248.34.134]) by omta23.emeryville.ca.mail.comcast.net with comcast id x9mu1f00G2tehsa8j9muhq; Tue, 18 Jan 2011 21:46:55 +0000 Received: by icarus.home.lan (Postfix, from userid 1000) id 48B499B422; Tue, 18 Jan 2011 13:46:54 -0800 (PST) Date: Tue, 18 Jan 2011 13:46:54 -0800 From: Jeremy Chadwick To: Lev Serebryakov Message-ID: <20110118214654.GA15398@icarus.home.lan> References: <1321946168.20110119001248@serebryakov.spb.ru> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <1321946168.20110119001248@serebryakov.spb.ru> User-Agent: Mutt/1.5.21 (2010-09-15) Cc: freebsd-stable@freebsd.org, "Vogel, Jack" Subject: Re: 8-STABLE/amd64 semi-regular crash with "kernel trap 12 with interrupts disabled" in "process 12 (swi4: clock)" X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 18 Jan 2011 21:46:56 -0000 On Wed, Jan 19, 2011 at 12:12:48AM +0300, Lev Serebryakov wrote: > Hello, Freebsd-stable. > > One of my servers crashes about once a week, with always same > diagnostics: "kernel trap 12 with interrupts disabled" and in same > process: "swi4: clock" > > It doesn't look as memory failure, as memtest86+ can not find any > errors in 8 passes. > > Also, after this crash server refuse to auto-reboot, last message on > console is "cpu_reset: Stopping other CPUs", and it hangs. > > Kernel config, booting dmesg & results of "savecore" are attached > (bzipped). CC'ing Jack Vogel of Intel, as this looks like it could be something the em(4) driver might be tickling. I do see it in the stack trace shortly before the crash. In the interim, can you please provide output from the following command: # pciconf -lbcv And include only the entries relevant to your emX devices. As for the "the server refuses to auto-reboot": that may be a separate problem. You might try toggling the hw.acpi.disable_on_reboot and hw.acpi.handle_reboot sysctls (check what values they have on your system first) to see if there's any improvement. For Jack -- the core/stack trace, and dmesg are at the below URL as attachments (and bzip2 compressed): http://lists.freebsd.org/pipermail/freebsd-stable/2011-January/061168.html -- | Jeremy Chadwick jdc@parodius.com | | Parodius Networking http://www.parodius.com/ | | UNIX Systems Administrator Mountain View, CA, USA | | Making life hard for others since 1977. PGP 4BD6C0CB |