Skip site navigation (1)Skip section navigation (2)
Date:      Tue, 23 Oct 2001 12:07:43 +0930
From:      Greg Lehey <grog@FreeBSD.org>
To:        Bjarne Wichmann Petersen <mekanix@privat.dk>
Cc:        freebsd-stable@freebsd.org
Subject:   Re: [HARD CRASH] gdb output - what is it saying?
Message-ID:  <20011023120743.B27668@wantadilla.lemis.com>
In-Reply-To: <20011022200156.PTLK23247.fepA.post.tele.dk@there>; from mekanix@privat.dk on Mon, Oct 22, 2001 at 10:04:46PM %2B0200
References:  <20011018125533.OPYG1982.fepF.post.tele.dk@there> <01101913591302.00703@proxy.the-i-pa.com> <20011021181228.FFZX2863.fepZ.post.tele.dk@there> <20011022200156.PTLK23247.fepA.post.tele.dk@there>

next in thread | previous in thread | raw e-mail | index | archive | help
On Monday, 22 October 2001 at 22:04:46 +0200, Bjarne Wichmann Petersen wrote:
> I've included the latest gdb-out. I have no clue to what it all means, so if
> someone with a clue would help me locate what is causing my 4.4-STABLE to
> crash I'd be very happy.

If I could read it, it would help.  You shouldn't wrap computer
output.  If you follow up, please send the output as it comes, and
make the output in hex.

> (kgdb) symbol-file kernel.debug
> Reading symbols from kernel.debug...done.
> (kgdb) exec-file /var/crash/kernel.4
> (kgdb) core-file /var/crash/vmcore.4
>
> (kgdb) where

You've had two traps in a row, separated by an interrupt.  Both traps
are in timer code.

(rearranging)

>     at ../../i386/i386/trap.c:849
> #16 0xc0357ca7 in trap (frame={tf_fs = 16, tf_es = 16, tf_ds = 16, tf_edi =
> -880967808,
>       tf_esi = -880967900, tf_ebp = -880967916, tf_isp = -880967952, tf_ebx =
> -1058816640,
>       tf_edx = -1069680192, tf_ecx = -1069680192, tf_eax = 1381192787,
> tf_trapno = 12,
>       tf_err = 0, tf_eip = 1381192787, tf_cs = 8, tf_eflags = 66178, tf_esp =
> -1071952601,
>       tf_ss = -1058816640}) at ../../i386/i386/trap.c:448

Here's the first one.  You've had a trap 12 (page fault in kernel
mode).  The IP register (instruction pointer) was pointing to
1381192787.  It's a lot easier to read this if you set your
output-radix to 16, where the address will show as 0x52535453.  This
is not only not a valid kernel address, it represents the text "STSR",
which suggests to me that something has been overwriting the stack.
It's not worth looking at this frame any more.

> #17 0x52535453 in ?? ()

I'm not sure what this is, but clearly the stack has been trashed
(recognize that address?).

> #18 0xc01c6472 in gettimeofday (p=0xcb684ea0, uap=0xcb7d7f80) at
> ../../kern/kern_time.c:307

This should be a call to microtime().  Somehow it didn't get there.

> #19 0xc03586c1 in syscall2 (frame={tf_fs = 47, tf_es = 47, tf_ds = 47, tf_edi
> = 137846784,
>       tf_esi = 46815, tf_ebp = -1077938116, tf_isp = -880967724, tf_ebx =
> 842094169,
>       tf_edx = -1077938308, tf_ecx = 678132992, tf_eax = 116, tf_trapno = 0,
> tf_err = 2,
>       tf_eip = 677671868, tf_cs = 31, tf_eflags = 643, tf_esp = -1077938160,
> tf_ss = 47})
>     at ../../i386/i386/trap.c:1155
> #20 0xc0349ce5 in Xint0x80_syscall ()
> #21 0x8068b2c in ?? ()
> #22 0x8064540 in ?? ()
> #23 0x8061f7d in ?? ()

Clock interrupt.  So far, so good.

> #9  0xc035cbb7 in clkintr (frame={cf_vec = 0, cf_ppl = 0, cf_fs = 16, cf_es =
> 16,
>       cf_ds = 16, cf_edi = -880968264, cf_esi = -880968260, cf_ebp =
> -880968248,
>       -880968308, cf_ebx = -881756544, cf_edx = -880968264, cf_ecx =
> -881756544,
>       cf_eax = -881982624, 0, 0, cf_eip = -1071714030, cf_cs = 8, cf_eflags =
> 582,
>       cf_esp = -1055106048, cf_ss = 0}) at ../../i386/isa/clock.c:216
> #10 0xc01ef112 in vfs_msync (mp=0xc11c5c00, flags=2) at
> ../../kern/vfs_subr.c:2536
> #11 0xc01f00e0 in sync (p=0xc043d760, uap=0x0) at
> ../../kern/vfs_syscalls.c:544
> #12 0xc01c0bd2 in boot (howto=256) at ../../kern/kern_shutdown.c:234
> #13 0xc01c11c0 in poweroff_wait (junk=0xc03cb42c, howto=-1069764785)
>     at ../../kern/kern_shutdown.c:581
> #14 0xc0358416 in trap_fatal (frame=0xcb7d7ec4, eva=1381192787)
>     at ../../i386/i386/trap.c:956
> #15 0xc03580e9 in trap_pfault (frame=0xcb7d7ec4, usermode=0, eva=1381192787)


> #0  dumpsys () at ../../kern/kern_shutdown.c:473
> #1  0xc01c0df3 in boot (howto=260) at ../../kern/kern_shutdown.c:313
> #2  0xc01c11c0 in poweroff_wait (junk=0xc03cb42c, howto=-1069764785)
>     at ../../kern/kern_shutdown.c:581
> #3  0xc0358416 in trap_fatal (frame=0xcb7d7cc4, eva=1381192787)
>     at ../../i386/i386/trap.c:956
> #4  0xc03580e9 in trap_pfault (frame=0xcb7d7cc4, usermode=0, eva=1381192787)
>     at ../../i386/i386/trap.c:849

Second trap.  Same address as the first.

> #5  0xc0357ca7 in trap (frame={tf_fs = 16, tf_es = 16, tf_ds = 16, tf_edi =
> -1058817060,
>       tf_esi = -1058816548, tf_ebp = -880968424, tf_isp = -880968464, tf_ebx
> = -1058817152,
>       tf_edx = -1058817024, tf_ecx = 0, tf_eax = 1381192787, tf_trapno = 12,
> tf_err = 0,
>       tf_eip = 1381192787, tf_cs = 8, tf_eflags = 66050, tf_esp = -1071951411,
>       tf_ss = -1058817152}) at ../../i386/i386/trap.c:448
> #6  0x52535453 in ?? ()

Recognize this bogus address again?

> #7  0xc01b5224 in tco_forward (force=0) at ../../kern/kern_clock.c:761

This should be a call to sync_other_counter().

> #8  0xc01b49b4 in hardclock (frame=0xcb7d7d58) at ../../kern/kern_clock.c:236

This is a puzzling dump.  Have you any specialized timer hardware or
software on your machine?  Is the dump repeatable?

Greg
--
See complete headers for address and phone numbers

To Unsubscribe: send mail to majordomo@FreeBSD.org
with "unsubscribe freebsd-stable" in the body of the message




Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20011023120743.B27668>