From owner-freebsd-stable@FreeBSD.ORG Thu Apr 1 11:20:17 2004 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id E64DA16A4CF for ; Thu, 1 Apr 2004 11:20:17 -0800 (PST) Received: from smtp1.mc.surewest.net (smtp1.mc.surewest.net [66.60.130.50]) by mx1.FreeBSD.org (Postfix) with SMTP id C670643D4C for ; Thu, 1 Apr 2004 11:20:15 -0800 (PST) (envelope-from dislists@updegrove.net) Received: (qmail 10045 invoked from network); 1 Apr 2004 19:19:13 -0000 Received: from unknown (HELO updegrove.net) (64.30.97.117) by smtp1.mc.surewest.net with SMTP; 1 Apr 2004 19:19:13 -0000 Received: (qmail 47964 invoked by uid 98); 1 Apr 2004 19:20:16 -0000 Received: from dislists@updegrove.net by smeagol.purgatory by uid 1008 with qmail-scanner-1.20 Clear:RC:1(64.166.46.10):. Processed in 0.148462 secs); 01 Apr 2004 19:20:16 -0000 X-Qmail-Scanner-Mail-From: dislists@updegrove.net via smeagol.purgatory X-Qmail-Scanner: 1.20 (Clear:RC:1(64.166.46.10):. Processed in 0.148462 secs) Received: from adsl-64-166-46-10.dsl.scrm01.pacbell.net (HELO updegrove.net) (64.166.46.10) by updegrove.net with SMTP; 1 Apr 2004 19:20:15 -0000 Message-ID: <406C6CD5.8050000@updegrove.net> Date: Thu, 01 Apr 2004 11:26:13 -0800 From: Rick Updegrove User-Agent: Mozilla/5.0 (Windows; U; Windows NT 5.0; en-US; rv:1.6) Gecko/20040113 X-Accept-Language: en-us, en MIME-Version: 1.0 To: freebsd-stable@freebsd.org, freebsd-smp@freebsd.org Content-Type: text/plain; charset=us-ascii; format=flowed Content-Transfer-Encoding: 7bit X-TST: smtp1 SNWK3 0.31-41 ip=64.30.97.117 Subject: upgrade from 4.8 SMP to 4.9 SMP causes unexplained rebooting! X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.1 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 01 Apr 2004 19:20:18 -0000 (yes sorry for the cross-posting but I haven't been making much progress with this problem and my poor disks are taking a beating from all this rebooting) A 4.8-STABLE machine I have been running with no problems for over 130 days straight uptime is now having unexplained reboots AFTER upgrading to 4.9 STABLE. The reboots are not every single day, or predictable, but they are happening almost every other day. It is a low traffic qmail-scanner machine (7k messages a day) and the only reason I even upgraded was due to to http://www.freebsd.org/releases/4.9R/errata.html Now I am sort of wishing I did not : ) I lost all the uptime and now the unexplained rebooting... I hesitate reporting this because most people point their fingers at the hardware. I am tempted to abandon this machine for another but if anyone is interested in taking a look please advise. I have the following: #/etc/rc.conf #rebooting dumpdev=YES savecore=YES dumpdir="/var/crash" I know ths is not quite enough - I need to configure a dump devide but I have no tape drive. Is there another way? I rebuilt this kernel with: makeoptions DEBUG=-g #Build kernel with gdb(1) debug symbols Is there anything else I should do? Right now I do not have the ability to attach a serial console to the crashing system and set the system to serial console. And even if I did have physical access I am not sure how to do that exactly... Is there another way to accomplish the debugging of this? I have been running FreeBSD so long with no problems I am sort of rusty at tracking them down, especially the elusive ones. So please point me in the right direction. Thanks! Rick P.S. dmesg follows Copyright (c) 1992-2003 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD 4.9-STABLE #1: Wed Mar 24 08:06:56 PST 2004 root@govmail.ca.gov:/usr/obj/usr/src/sys/SMP Timecounter "i8254" frequency 1193182 Hz CPU: Pentium III/Pentium III Xeon/Celeron (499.15-MHz 686-class CPU) Origin = "GenuineIntel" Id = 0x673 Stepping = 3 Features=0x387fbff real memory = 536870912 (524288K bytes) avail memory = 519516160 (507340K bytes) Programming 24 pins in IOAPIC #0 IOAPIC #0 intpin 2 -> irq 0 FreeBSD/SMP: Multiprocessor motherboard: 2 CPUs cpu0 (BSP): apic id: 1, version: 0x00040011, at 0xfee00000 cpu1 (AP): apic id: 0, version: 0x00040011, at 0xfee00000 io0 (APIC): apic id: 2, version: 0x00170011, at 0xfec00000 Preloaded elf kernel "kernel" at 0xc0327000. Pentium Pro MTRR support enabled md0: Malloc disk Using $PIR table, 14 entries at 0xc00fdee0 npx0: on motherboard npx0: INT 16 interface pcib0: on motherboard IOAPIC #0 intpin 19 -> irq 2 IOAPIC #0 intpin 17 -> irq 16 pci0: on pcib0 isab0: at device 4.0 on pci0 isa0: on isab0 atapci0: port 0xfcd0-0xfcdf at device 4.1 on pci0 ata0: at 0x1f0 irq 14 on atapci0 ata1: at 0x170 irq 15 on atapci0 pci0: at 4.2 irq 2 Timecounter "PIIX" frequency 3579545 Hz chip1: port 0x2180-0x218f at device 4.3 on pci0 pcib1: at device 7.0 on pci0 IOAPIC #0 intpin 16 -> irq 17 pci1: on pcib1 ahc0: port 0xe800-0xe8ff mem 0xfebfe000-0xfebfefff irq 17 at device 4.0 on pci1 aic7880: Ultra Wide Channel A, SCSI Id=7, 16/253 SCBs pci1: (vendor=0x1000, dev=0x000c) at 7.0 irq 18 amr0: mem 0xf0000000-0xf7ffffff irq 16 at device 7.1 on pci0 amr0: Firmware D.02.05, BIOS B.01.04, 16MB RAM pcib2: at device 8.0 on pci0 pci2: on pcib2 fxp0: port 0xdce0-0xdcff mem 0xfe900000-0xfe9fffff,0xefffe000-0xefffefff irq 16 at device 2.0 on pci2 fxp0: Ethernet address 00:90:27:b7:09:76 inphy0: on miibus0 inphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto pci0: (vendor=0x103c, dev=0x10c1) at 11.0 pci0: at 13.0 orm0: