From owner-freebsd-questions@FreeBSD.ORG Tue Feb 24 19:30:18 2009 Return-Path: Delivered-To: freebsd-questions@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id EF8A51065673 for ; Tue, 24 Feb 2009 19:30:18 +0000 (UTC) (envelope-from fbsd.questions@rachie.is-a-geek.net) Received: from mail.rachie.is-a-geek.net (rachie.is-a-geek.net [66.230.99.27]) by mx1.freebsd.org (Postfix) with ESMTP id BE6A38FC2A for ; Tue, 24 Feb 2009 19:30:18 +0000 (UTC) (envelope-from fbsd.questions@rachie.is-a-geek.net) Received: from localhost (mail.lan.rachie.is-a-geek.net [192.168.2.101]) by mail.rachie.is-a-geek.net (Postfix) with ESMTP id 370A9AFC1FF; Tue, 24 Feb 2009 10:30:18 -0900 (AKST) From: Mel To: freebsd-questions@freebsd.org Date: Tue, 24 Feb 2009 10:30:17 -0900 User-Agent: KMail/1.9.10 References: <435616.681235485535608.JavaMail.HALO$@halo> In-Reply-To: <435616.681235485535608.JavaMail.HALO$@halo> MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-6" Content-Transfer-Encoding: 7bit Content-Disposition: inline Message-Id: <200902241030.17937.fbsd.questions@rachie.is-a-geek.net> Cc: Peter Steele Subject: Re: What is correct way to enable watchdog? X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 24 Feb 2009 19:30:20 -0000 On Tuesday 24 February 2009 05:25:36 Peter Steele wrote: > >No, meaning, if a system is unresponsive for 300 seconds, action will be > >taken. watchdogd will not prevent proper reboots, panics or power > > failures. > > Bad wording on my part. What you said is what I meant, and I assume the > default action is to reboot the system? If -e cmd is not specified, the daemon will perform a trivial file system check instead. > >Panic, or overheating. Check the dumpdev/dumpdir variables in rc.conf(5). > > We don't have dumpdev/dumpdir configured in rc.conf. I'll do that. What > makes us suspicious is that we have been running this stress test on > systems for months without any reboots. We then enable the 300 second > watchdog and two systems spontaneously reboot. We've turned it off again > and have restarted the stress test and so far no reboots. What we want to > know is are these reboots occurring as a result of a watchdog reboot? Is > any kind of system log created when the watchdog reboots a system? This smells more like a bug in watchdog. If that's the case, the crash dumps should point right at it, at which point I'd take it to freebsd-stable or -current, whichever applies to the OS version. -- Mel Problem with today's modular software: they start with the modules and never get to the software part.