From owner-freebsd-hardware@FreeBSD.ORG Sun Mar 5 07:56:17 2006 Return-Path: X-Original-To: freebsd-hardware@freebsd.org Delivered-To: freebsd-hardware@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 5809D16A420 for ; Sun, 5 Mar 2006 07:56:17 +0000 (GMT) (envelope-from gcorcoran@rcn.com) Received: from smtp02.mrf.mail.rcn.net (smtp02.mrf.mail.rcn.net [207.172.4.62]) by mx1.FreeBSD.org (Postfix) with ESMTP id 02DAA43D55 for ; Sun, 5 Mar 2006 07:56:13 +0000 (GMT) (envelope-from gcorcoran@rcn.com) Received: from 207-172-241-254.c3-0.tlg-ubr1.atw-tlg.pa.cable.rcn.com (HELO [10.56.78.130]) ([207.172.241.254]) by smtp02.mrf.mail.rcn.net with ESMTP; 05 Mar 2006 02:56:14 -0500 X-IronPort-AV: i="4.02,164,1139202000"; d="scan'208"; a="211285855:sNHT28100096" Message-ID: <440A9B0A.4050303@rcn.com> Date: Sun, 05 Mar 2006 03:02:18 -0500 From: Gary Corcoran User-Agent: Thunderbird 1.5 (Windows/20051201) MIME-Version: 1.0 To: FreeBSD References: <1141517664.1407@swaggi.com> In-Reply-To: <1141517664.1407@swaggi.com> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Cc: me@carrollkong.com, freebsd-hardware@freebsd.org Subject: Re: FreeBSD shutting down unexpectedly X-BeenThere: freebsd-hardware@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: General discussion of FreeBSD hardware List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 05 Mar 2006 07:56:17 -0000 FreeBSD wrote: > Hi everyone, > > I'm having a problem with my FreeBSD server shutting down for no reason. This started happening recently (within the last month) so I'm not sure if the hardware is dying or if there's another underlying problem. The server is a rackmountable 1U chassis at a remote location connected to an APC UPS. When I lose connectivity and go to physically inspect the server, it's powered off completely as if there was a power outage. However, each time I can confirm that the UPS did not lose power and there were no voltage spikes (the APC can be managed via telnet and SNMP so there are logs I can look at). So far the only way I've been able to reproduce the problem is by running "portsdb -Uu" to update the ports DB after a cvsup of the ports tree. I suppose this is a CPU/disk intensive task so maybe I have a dying hard drive? Running this command used to take about 10 minutes for me, now when I run it about 5 minutes later the box powers off mysteriously. There's nothing in /var/log/me ssa > ges and after I bring it back up, dmesg does not show anything unusual. I setup a console server to monitor this server's console but there was absolutely nothing on the console during the last such "crash". > I'm looking for some suggestions on how to troubleshoot this problem, perhaps I can enable crash dump files or some sort of debugging? Here's some output: > Any suggestions would be welcome. You may have a failing, or underpowered, power supply, which could cause flakiness when you have large amounts of CPU and/or disk activity. Or, sorry to say, you may be yet another victim of the "bad capacitors" that have plagued many motherboards, even from quality makers, during the past few years. Do a google search, or try this site that google just showed me, for what to look for: http://www.badcaps.net/ Gary