From owner-freebsd-bugs Fri Mar 30 2:20: 9 2001 Delivered-To: freebsd-bugs@hub.freebsd.org Received: from freefall.freebsd.org (freefall.freebsd.org [216.136.204.21]) by hub.freebsd.org (Postfix) with ESMTP id 9A33737B719 for ; Fri, 30 Mar 2001 02:20:02 -0800 (PST) (envelope-from gnats@FreeBSD.org) Received: (from gnats@localhost) by freefall.freebsd.org (8.11.1/8.11.1) id f2UAK2706707; Fri, 30 Mar 2001 02:20:02 -0800 (PST) (envelope-from gnats) Date: Fri, 30 Mar 2001 02:20:02 -0800 (PST) Message-Id: <200103301020.f2UAK2706707@freefall.freebsd.org> To: freebsd-bugs@FreeBSD.org Cc: From: cjm88@home.com Subject: Re: kern/26161: Kernel Panic on Dual Processor System during heavy disk IO Reply-To: cjm88@home.com Sender: owner-freebsd-bugs@FreeBSD.ORG Precedence: bulk X-Loop: FreeBSD.org The following reply was made to PR kern/26161; it has been noted by GNATS. From: cjm88@home.com To: Peter Pentchev Cc: freebsd-gnats-submit@FreeBSD.org Subject: Re: kern/26161: Kernel Panic on Dual Processor System during heavy disk IO Date: Fri, 30 Mar 2001 05:16:12 -0500 OK, I followed the first suggestion and initially it seemed that the system was more stable (i.e. it took longer for it to panic). When it paniced I built a debug versino of the kernel and ran the tests again. this time it took even longer for the system to crash. So I ran the tests a few more times with increasing intensity. It seemed that the time required to crash the system was inversely proportional to the intensity of the disk IO that the system was subjected to. Here is what the gdb session (run by a newbie... i.e. 'me' ) showed... I probably need some further direction from someone more experienced to extract more useful information. u2# gdb -k GNU gdb 4.18 Copyright 1998 Free Software Foundation, Inc. GDB is free software, covered by the GNU General Public License, and you are welcome to change it and/or distribute copies of it under certain conditions. Type "show copying" to see the conditions. There is absolutely no warranty for GDB. Type "show warranty" for details. This GDB was configured as "i386-unknown-freebsd". (kgdb) symbol-file kernel.debug Reading symbols from kernel.debug...done. (kgdb) exec-file kernel (kgdb) core-file /usr/wrk/vmcore.0 SMP 2 cpus IdlePTD 3461120 initial pcb at 2bdae0 panicstr: NMI indicates hardware failure panic messages: --- --- #0 0xc0158eae in dumpsys () (kgdb) where #0 0xc0158eae in dumpsys () #1 0xc0158ccf in boot () #2 0xc0159080 in poweroff_wait () #3 0xc0256d90 in trap (frame={tf_fs = 47, tf_es = 47, tf_ds = 47, tf_edi = 134533120, tf_esi = 69, tf_ebp = -1077937120, tf_isp = -931377196, tf_ebx = 183304184, tf_edx = -1077937252, tf_ecx = 672025592, tf_eax = 15, tf_trapno = 19, tf_err = 0, tf_eip = 134514178, tf_cs = 31, tf_eflags = 514, tf_esp = -1077937160, tf_ss = 47}) at ../../i386/i386/trap.c:396 #4 0x8048602 in ?? () #5 0x80484bd in ?? () (kgdb) OK... :) so now what do I do??? Thanks for your help. C Peter Pentchev wrote: > On Tue, Mar 27, 2001 at 06:31:21PM -0800, cjm88@home.com wrote: > > > > >Number: 26161 > > >Category: kern > > >Synopsis: Kernel Panic on Dual Processor System during heavy disk IO > > >Originator: Christophe Michel > > >Release: 4.2-RELEASE > > >Organization: > > >Environment: > > FreeBSD u2 4.2-RELEASE FreeBSD 4.2-RELEASE #1: Sat Mar 24 21:27:43 EST 2001 > > root@u2:/usr/src/sys/compile/U2 i386 > > > > >Description: > > The system panics when subjected to heavy disk IO. The system is an Intel > > altserver with two Pentium 166 processors on a mother board supprting SMP > > 1.4. I'm using the on-board adaptec SCSI controller with 2G Seagate drive. > > It is quite stable until something requires heavy disk IO and then > > crashes within 15 to 30 minutes. The behavior is the same whether the IO > > is for swapping or just heavy file access. I managed to photograph the > > console just after the panic on two occasions and can forward via e-mail, > > those jpgs to whoever would be interested in looking at this problem. > > > > I tried to replicate the problem on two other FreeBSD platforms but they > > were single-cpu boxes. The problem did not occur even after extended > > disk pounding (over 24 hours). > > This is all very nice :) But, can you either: > > 1. update your system to 4.2-stable (which is actually 4.3-RC now), or > > 2. follow the instructions on http://www.FreeBSD.org/handbook/kerneldebug.html > to build a debugging kernel, run dumpon, have the kernel panic again, > this time storing the core dump, then run savecore and examine > the kernel crash dump, posting more information about the dump? > > G'luck, > Peter > > -- > Nostalgia ain't what it used to be. To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-bugs" in the body of the message