Date: Tue, 23 Oct 2001 12:07:43 +0930 From: Greg Lehey <grog@FreeBSD.org> To: Bjarne Wichmann Petersen <mekanix@privat.dk> Cc: freebsd-stable@freebsd.org Subject: Re: [HARD CRASH] gdb output - what is it saying? Message-ID: <20011023120743.B27668@wantadilla.lemis.com> In-Reply-To: <20011022200156.PTLK23247.fepA.post.tele.dk@there>; from mekanix@privat.dk on Mon, Oct 22, 2001 at 10:04:46PM %2B0200 References: <20011018125533.OPYG1982.fepF.post.tele.dk@there> <01101913591302.00703@proxy.the-i-pa.com> <20011021181228.FFZX2863.fepZ.post.tele.dk@there> <20011022200156.PTLK23247.fepA.post.tele.dk@there>
next in thread | previous in thread | raw e-mail | index | archive | help
On Monday, 22 October 2001 at 22:04:46 +0200, Bjarne Wichmann Petersen wrote:
> I've included the latest gdb-out. I have no clue to what it all means, so if
> someone with a clue would help me locate what is causing my 4.4-STABLE to
> crash I'd be very happy.
If I could read it, it would help. You shouldn't wrap computer
output. If you follow up, please send the output as it comes, and
make the output in hex.
> (kgdb) symbol-file kernel.debug
> Reading symbols from kernel.debug...done.
> (kgdb) exec-file /var/crash/kernel.4
> (kgdb) core-file /var/crash/vmcore.4
>
> (kgdb) where
You've had two traps in a row, separated by an interrupt. Both traps
are in timer code.
(rearranging)
> at ../../i386/i386/trap.c:849
> #16 0xc0357ca7 in trap (frame={tf_fs = 16, tf_es = 16, tf_ds = 16, tf_edi =
> -880967808,
> tf_esi = -880967900, tf_ebp = -880967916, tf_isp = -880967952, tf_ebx =
> -1058816640,
> tf_edx = -1069680192, tf_ecx = -1069680192, tf_eax = 1381192787,
> tf_trapno = 12,
> tf_err = 0, tf_eip = 1381192787, tf_cs = 8, tf_eflags = 66178, tf_esp =
> -1071952601,
> tf_ss = -1058816640}) at ../../i386/i386/trap.c:448
Here's the first one. You've had a trap 12 (page fault in kernel
mode). The IP register (instruction pointer) was pointing to
1381192787. It's a lot easier to read this if you set your
output-radix to 16, where the address will show as 0x52535453. This
is not only not a valid kernel address, it represents the text "STSR",
which suggests to me that something has been overwriting the stack.
It's not worth looking at this frame any more.
> #17 0x52535453 in ?? ()
I'm not sure what this is, but clearly the stack has been trashed
(recognize that address?).
> #18 0xc01c6472 in gettimeofday (p=0xcb684ea0, uap=0xcb7d7f80) at
> ../../kern/kern_time.c:307
This should be a call to microtime(). Somehow it didn't get there.
> #19 0xc03586c1 in syscall2 (frame={tf_fs = 47, tf_es = 47, tf_ds = 47, tf_edi
> = 137846784,
> tf_esi = 46815, tf_ebp = -1077938116, tf_isp = -880967724, tf_ebx =
> 842094169,
> tf_edx = -1077938308, tf_ecx = 678132992, tf_eax = 116, tf_trapno = 0,
> tf_err = 2,
> tf_eip = 677671868, tf_cs = 31, tf_eflags = 643, tf_esp = -1077938160,
> tf_ss = 47})
> at ../../i386/i386/trap.c:1155
> #20 0xc0349ce5 in Xint0x80_syscall ()
> #21 0x8068b2c in ?? ()
> #22 0x8064540 in ?? ()
> #23 0x8061f7d in ?? ()
Clock interrupt. So far, so good.
> #9 0xc035cbb7 in clkintr (frame={cf_vec = 0, cf_ppl = 0, cf_fs = 16, cf_es =
> 16,
> cf_ds = 16, cf_edi = -880968264, cf_esi = -880968260, cf_ebp =
> -880968248,
> -880968308, cf_ebx = -881756544, cf_edx = -880968264, cf_ecx =
> -881756544,
> cf_eax = -881982624, 0, 0, cf_eip = -1071714030, cf_cs = 8, cf_eflags =
> 582,
> cf_esp = -1055106048, cf_ss = 0}) at ../../i386/isa/clock.c:216
> #10 0xc01ef112 in vfs_msync (mp=0xc11c5c00, flags=2) at
> ../../kern/vfs_subr.c:2536
> #11 0xc01f00e0 in sync (p=0xc043d760, uap=0x0) at
> ../../kern/vfs_syscalls.c:544
> #12 0xc01c0bd2 in boot (howto=256) at ../../kern/kern_shutdown.c:234
> #13 0xc01c11c0 in poweroff_wait (junk=0xc03cb42c, howto=-1069764785)
> at ../../kern/kern_shutdown.c:581
> #14 0xc0358416 in trap_fatal (frame=0xcb7d7ec4, eva=1381192787)
> at ../../i386/i386/trap.c:956
> #15 0xc03580e9 in trap_pfault (frame=0xcb7d7ec4, usermode=0, eva=1381192787)
> #0 dumpsys () at ../../kern/kern_shutdown.c:473
> #1 0xc01c0df3 in boot (howto=260) at ../../kern/kern_shutdown.c:313
> #2 0xc01c11c0 in poweroff_wait (junk=0xc03cb42c, howto=-1069764785)
> at ../../kern/kern_shutdown.c:581
> #3 0xc0358416 in trap_fatal (frame=0xcb7d7cc4, eva=1381192787)
> at ../../i386/i386/trap.c:956
> #4 0xc03580e9 in trap_pfault (frame=0xcb7d7cc4, usermode=0, eva=1381192787)
> at ../../i386/i386/trap.c:849
Second trap. Same address as the first.
> #5 0xc0357ca7 in trap (frame={tf_fs = 16, tf_es = 16, tf_ds = 16, tf_edi =
> -1058817060,
> tf_esi = -1058816548, tf_ebp = -880968424, tf_isp = -880968464, tf_ebx
> = -1058817152,
> tf_edx = -1058817024, tf_ecx = 0, tf_eax = 1381192787, tf_trapno = 12,
> tf_err = 0,
> tf_eip = 1381192787, tf_cs = 8, tf_eflags = 66050, tf_esp = -1071951411,
> tf_ss = -1058817152}) at ../../i386/i386/trap.c:448
> #6 0x52535453 in ?? ()
Recognize this bogus address again?
> #7 0xc01b5224 in tco_forward (force=0) at ../../kern/kern_clock.c:761
This should be a call to sync_other_counter().
> #8 0xc01b49b4 in hardclock (frame=0xcb7d7d58) at ../../kern/kern_clock.c:236
This is a puzzling dump. Have you any specialized timer hardware or
software on your machine? Is the dump repeatable?
Greg
--
See complete headers for address and phone numbers
To Unsubscribe: send mail to majordomo@FreeBSD.org
with "unsubscribe freebsd-stable" in the body of the message
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20011023120743.B27668>
