From owner-freebsd-bugs@FreeBSD.ORG Mon Nov 26 02:30:03 2007 Return-Path: Delivered-To: freebsd-bugs@hub.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 00EDB16A41B for ; Mon, 26 Nov 2007 02:30:03 +0000 (UTC) (envelope-from gnats@FreeBSD.org) Received: from freefall.freebsd.org (freefall.freebsd.org [IPv6:2001:4f8:fff6::28]) by mx1.freebsd.org (Postfix) with ESMTP id D8A4513C4E7 for ; Mon, 26 Nov 2007 02:30:02 +0000 (UTC) (envelope-from gnats@FreeBSD.org) Received: from freefall.freebsd.org (gnats@localhost [127.0.0.1]) by freefall.freebsd.org (8.14.2/8.14.2) with ESMTP id lAQ2U190000304 for ; Mon, 26 Nov 2007 02:30:01 GMT (envelope-from gnats@freefall.freebsd.org) Received: (from gnats@localhost) by freefall.freebsd.org (8.14.2/8.14.1/Submit) id lAQ2U1RP000303; Mon, 26 Nov 2007 02:30:01 GMT (envelope-from gnats) Resent-Date: Mon, 26 Nov 2007 02:30:01 GMT Resent-Message-Id: <200711260230.lAQ2U1RP000303@freefall.freebsd.org> Resent-From: FreeBSD-gnats-submit@FreeBSD.org (GNATS Filer) Resent-To: freebsd-bugs@FreeBSD.org Resent-Reply-To: FreeBSD-gnats-submit@FreeBSD.org, Jeremy Chadwick Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 927F216A417 for ; Mon, 26 Nov 2007 02:29:18 +0000 (UTC) (envelope-from jdc@parodius.com) Received: from mx01.sc1.parodius.com (mx01.sc1.parodius.com [72.20.106.3]) by mx1.freebsd.org (Postfix) with ESMTP id 8555513C448 for ; Mon, 26 Nov 2007 02:29:18 +0000 (UTC) (envelope-from jdc@parodius.com) Received: by mx01.sc1.parodius.com (Postfix, from userid 1000) id DBBF11CC07B; Sun, 25 Nov 2007 18:29:14 -0800 (PST) Message-Id: <20071126022914.DBBF11CC07B@mx01.sc1.parodius.com> Date: Sun, 25 Nov 2007 18:29:14 -0800 (PST) From: Jeremy Chadwick To: FreeBSD-gnats-submit@FreeBSD.org X-Send-Pr-Version: 3.113 Cc: Subject: conf/118255: savecore never finding kernel core dumps (rcorder problem) X-BeenThere: freebsd-bugs@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list Reply-To: Jeremy Chadwick List-Id: Bug reports List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 26 Nov 2007 02:30:03 -0000 >Number: 118255 >Category: conf >Synopsis: savecore never finding kernel core dumps (rcorder problem) >Confidential: no >Severity: serious >Priority: medium >Responsible: freebsd-bugs >State: open >Quarter: >Keywords: >Date-Required: >Class: sw-bug >Submitter-Id: current-users >Arrival-Date: Mon Nov 26 02:30:01 UTC 2007 >Closed-Date: >Last-Modified: >Originator: Jeremy Chadwick >Release: FreeBSD 6.3-PRERELEASE i386 >Organization: >Environment: System: FreeBSD eos.sc1.parodius.com 6.3-PRERELEASE FreeBSD 6.3-PRERELEASE #0: Wed Nov 7 13:14:24 PST 2007 root@eos.sc1.parodius.com:/usr/obj/usr/src/sys/EOS i386 >Description: One of our production systems has begun kernel panic'ing for reasons unknown; we're in the process of figuring out why that's happening. On the other hand, none of our kernel panics (which are being written to disk when doing "panic" from DDB) are being dropped into /var/crash when savecore runs. Details of our configuration and what actually happens were posted to freebsd-stable. It shows that a kernel core dump is indeed written to the correct device (/dev/ad0s1b), but savecore never detects the cores: http://lists.freebsd.org/pipermail/freebsd-stable/2007-November/038069.html http://lists.freebsd.org/pipermail/freebsd-stable/2007-November/038569.html I believe the problem is that /etc/rc.d/swap1 (which does `swapon -a`) is being called _before_ /etc/rc.d/savecore, thus clobbering/stomping over any core dumps that exist. See the 2nd URL above for some additional details. I'm marking this serious/medium because people being able to get vmcore images after a kernel panic is important. :-) >How-To-Repeat: Set dumpdev and dumpdir in /etc/rc.conf, panic system, and see. >Fix: I believe the issue can be fixed by adjusting some of the rcorder(8) values so that savecore gets run *before* swap1. I'm not familiar with what needs to be changed to make this work. >Release-Note: >Audit-Trail: >Unformatted: