Date: Tue, 24 Feb 2009 10:30:17 -0900 From: Mel <fbsd.questions@rachie.is-a-geek.net> To: freebsd-questions@freebsd.org Cc: Peter Steele <psteele@maxiscale.com> Subject: Re: What is correct way to enable watchdog? Message-ID: <200902241030.17937.fbsd.questions@rachie.is-a-geek.net> In-Reply-To: <435616.681235485535608.JavaMail.HALO$@halo> References: <435616.681235485535608.JavaMail.HALO$@halo>
next in thread | previous in thread | raw e-mail | index | archive | help
On Tuesday 24 February 2009 05:25:36 Peter Steele wrote: > >No, meaning, if a system is unresponsive for 300 seconds, action will be > >taken. watchdogd will not prevent proper reboots, panics or power > > failures. > > Bad wording on my part. What you said is what I meant, and I assume the > default action is to reboot the system? If -e cmd is not specified, the daemon will perform a trivial file system check instead. > >Panic, or overheating. Check the dumpdev/dumpdir variables in rc.conf(5). > > We don't have dumpdev/dumpdir configured in rc.conf. I'll do that. What > makes us suspicious is that we have been running this stress test on > systems for months without any reboots. We then enable the 300 second > watchdog and two systems spontaneously reboot. We've turned it off again > and have restarted the stress test and so far no reboots. What we want to > know is are these reboots occurring as a result of a watchdog reboot? Is > any kind of system log created when the watchdog reboots a system? This smells more like a bug in watchdog. If that's the case, the crash dumps should point right at it, at which point I'd take it to freebsd-stable or -current, whichever applies to the OS version. -- Mel Problem with today's modular software: they start with the modules and never get to the software part.
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?200902241030.17937.fbsd.questions>