From owner-freebsd-questions@FreeBSD.ORG Tue Feb 28 09:05:22 2006 Return-Path: X-Original-To: freebsd-questions@freebsd.org Delivered-To: freebsd-questions@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 36EC616A420 for ; Tue, 28 Feb 2006 09:05:22 +0000 (GMT) (envelope-from bsd@bathnetworks.com) Received: from lmail.bathnetworks.co.uk (mail.bathnetworks.com [84.92.24.252]) by mx1.FreeBSD.org (Postfix) with ESMTP id D4F9C43D49 for ; Tue, 28 Feb 2006 09:05:15 +0000 (GMT) (envelope-from bsd@bathnetworks.com) Received: (qmail 9873 invoked by uid 510); 28 Feb 2006 09:08:20 +0000 Received: from 84.92.24.252 by lmail.bathnetworks.co.uk (envelope-from , uid 508) with qmail-scanner-1.24-st-qms (clamdscan: 0.88/1261. spamassassin: 3.0.2. perlscan: 1.24-st-qms. Clear:RC:0(84.92.24.252):SA:0(-3.4/5.0):. Processed in 2.495834 secs); 28 Feb 2006 09:08:20 -0000 X-Spam-Status: No, hits=-3.4 required=5.0 X-Antivirus-MYDOMAIN-Mail-From: bsd@bathnetworks.com via lmail.bathnetworks.co.uk X-Antivirus-MYDOMAIN: 1.24-st-qms (Clear:RC:0(84.92.24.252):SA:0(-3.4/5.0):. Processed in 2.495834 secs Process 9866) Received: from mail.bathnetworks.com (HELO ?84.92.24.252?) (bsd@bathnetworks.com@84.92.24.252) by lmail.bathnetworks.co.uk with SMTP; 28 Feb 2006 09:08:17 +0000 From: Robert Slade To: Olivier Nicole In-Reply-To: <200602280425.k1S4P9pK098441@banyan.cs.ait.ac.th> References: <200602280425.k1S4P9pK098441@banyan.cs.ait.ac.th> Content-Type: text/plain Message-Id: <1141117697.8351.19.camel@lmail.bathnetworks.co.uk> Mime-Version: 1.0 X-Mailer: Ximian Evolution 1.4.6 (1.4.6-2) Date: Tue, 28 Feb 2006 09:08:17 +0000 Content-Transfer-Encoding: 7bit Cc: "freebsd-questions@freebsd.org" Subject: Re: Mysterious reboot, suspecting hardware issue X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 28 Feb 2006 09:05:22 -0000 On Tue, 2006-02-28 at 04:25, Olivier Nicole wrote: > Hi, > > Running an Amanda server on a dual P III, with 256 MB ram and dual > Symbios SCSI adapter. Part of the hard disk are mounted in RAID with > vinum on the first SCSI adapter. The Tandberg tape drive is on the > second SCSI adapter. FreeBSD is: > > FreeBSD amanda.cs.ait.ac.th 4.11-RELEASE-p14 FreeBSD 4.11-RELEASE-p14 #3: Thu Feb 23 14:03:05 ICT 2006 root@amanda.cs.ait.ac.th:/usr/obj/usr/src/sys/SMALL i386 > > Recently (mid February) it started rebooting around 01:20, that is > about 40 minutes after amanda started the nightly dumps of the various > servers on my network. It's about time when amanda reaches full speed > and the machine may be busy doing some high gzip compress. > > The first reboot occured while I was running 4.10-RELEASE-p19 so it > cannot be related to the version of FreeBSD. > > That machine has been running like that for about one year without > any problem. > > I set dumpdev="/dev/rda0s1b" in /etc/rc.conf, expecting to get a > kernel dump in time of the crash, but I got nothing. > > - did I missed something? > > - how to prevent the machine from rebooting when it crashes? > > Other idea? > > Best regards, > > Olivier Oliver, There has been problems with the SMP kernel in the past, but it was supposed to be fixed around 5.4. From memory there was a patch for 5.3 which was incorporated in 5.4 and 6.0. I say supposed as there have been reports of random reboots with 5.4 and 6.0 running SMP - see the archives. The usual response is that it is a hardware related, but I am not too sure. In my case it was on a dual redundant power supply etc quad processor machine which ran the diagnostics with no problem for several days. It looks like a load problem to me. I ran out of time trying to fix the machine and had to go down a different route. Hope this helps a little Rob