From owner-freebsd-questions@FreeBSD.ORG Sun Sep 7 13:02:34 2003 Return-Path: Delivered-To: freebsd-questions@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 943CE16A4BF for ; Sun, 7 Sep 2003 13:02:34 -0700 (PDT) Received: from pop017.verizon.net (pop017pub.verizon.net [206.46.170.210]) by mx1.FreeBSD.org (Postfix) with ESMTP id 840CC43F75 for ; Sun, 7 Sep 2003 13:02:32 -0700 (PDT) (envelope-from cswiger@mac.com) Received: from mac.com ([68.237.14.199]) by pop017.verizon.net (InterMail vM.5.01.05.33 201-253-122-126-133-20030313) with ESMTP id <20030907200231.RZVK27671.pop017.verizon.net@mac.com>; Sun, 7 Sep 2003 15:02:31 -0500 Message-ID: <3F5B8ED0.4090500@mac.com> Date: Sun, 07 Sep 2003 16:02:24 -0400 From: Chuck Swiger Organization: The Courts of Chaos User-Agent: Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.4) Gecko/20030624 X-Accept-Language: en-us, en MIME-Version: 1.0 To: "Jack L. Stone" References: <3.0.5.32.20030907102900.01393408@sage-one.net> <3.0.5.32.20030907102900.01393408@sage-one.net> <3.0.5.32.20030907142124.013a9ef0@sage-one.net> In-Reply-To: <3.0.5.32.20030907142124.013a9ef0@sage-one.net> X-Enigmail-Version: 0.76.5.0 X-Enigmail-Supports: pgp-inline, pgp-mime Content-Type: text/plain; charset=us-ascii; format=flowed Content-Transfer-Encoding: 7bit X-Authentication-Info: Submitted using SMTP AUTH at pop017.verizon.net from [68.237.14.199] at Sun, 7 Sep 2003 15:02:31 -0500 cc: freebsd-questions@freebsd.org Subject: Re: Random crash and/or reboots X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.1 Precedence: list List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 07 Sep 2003 20:02:34 -0000 Jack L. Stone wrote: [ ... ] > Except, I doubt if those 2 nighttime reboots had the same problem....that's > why I said always triggered by login to root.... forget the 2 unrelated ones. How many unexplained crashes do you think your system should have? :-) Seriously, if you're running a release version of the OS, or are tracking the security branch, your machines should stay up until the power goes out and the UPS dies, or you reboot them. You should be seeing hundred-day uptimes, unless you have hardware problems. Yes, I read that you've swapped most of the hardware out without result; can you set up a second machine running the same software and configuration and see whether it crashes in a similar fashion (or at all)? [ ... ] > http://www.sageweb.net/tmp/1-lsof.txt > http://www.sageweb.net/tmp/2-lsof.txt > http://www.sageweb.net/tmp/3-lsof.txt > http://www.sageweb.net/tmp/4-lsof.txt > http://www.sageweb.net/tmp/5-lsof.txt > http://www.sageweb.net/tmp/6-lsof.txt These don't provide any information that seems particularly relevant to diagnosing the problem. [ The data says what services you're running-- apache, perl, sendmail, spamassassin, and suggest that your machine was idle or under a light load when the crashes happened. By weak inference, that suggests against a thermal problem like poor CPU cooling, but I wouldn't be certain of even that. ] -- -Chuck