From owner-freebsd-amd64@FreeBSD.ORG Tue Feb 3 16:08:02 2009 Return-Path: Delivered-To: freebsd-amd64@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id AE076106564A; Tue, 3 Feb 2009 16:08:02 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from cyrus.watson.org (cyrus.watson.org [65.122.17.42]) by mx1.freebsd.org (Postfix) with ESMTP id 6C5D78FC17; Tue, 3 Feb 2009 16:08:02 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from server.baldwin.cx (pool-98-109-39-197.nwrknj.fios.verizon.net [98.109.39.197]) by cyrus.watson.org (Postfix) with ESMTPSA id D91CF46B09; Tue, 3 Feb 2009 11:08:01 -0500 (EST) Received: from localhost (john@localhost [127.0.0.1]) (authenticated bits=0) by server.baldwin.cx (8.14.3/8.14.3) with ESMTP id n13G7thR050843; Tue, 3 Feb 2009 11:07:55 -0500 (EST) (envelope-from jhb@freebsd.org) From: John Baldwin To: Roar Pettersen Date: Tue, 3 Feb 2009 10:20:32 -0500 User-Agent: KMail/1.9.7 References: <200901311529.n0VFTj73014198@www.freebsd.org> <200902021059.28556.jhb@freebsd.org> In-Reply-To: MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit Content-Disposition: inline Message-Id: <200902031020.33158.jhb@freebsd.org> X-Greylist: Sender succeeded SMTP AUTH authentication, not delayed by milter-greylist-2.0.2 (server.baldwin.cx [127.0.0.1]); Tue, 03 Feb 2009 11:07:55 -0500 (EST) X-Virus-Scanned: ClamAV 0.94.2/8946/Tue Feb 3 07:32:04 2009 on server.baldwin.cx X-Virus-Status: Clean X-Spam-Status: No, score=-4.4 required=4.2 tests=ALL_TRUSTED,AWL,BAYES_00 autolearn=ham version=3.1.3 X-Spam-Checker-Version: SpamAssassin 3.1.3 (2006-06-01) on server.baldwin.cx Cc: freebsd-gnats-submit@freebsd.org, freebsd-amd64@freebsd.org Subject: Re: amd64/131209: 7.1-STABLE amd64 crash X-BeenThere: freebsd-amd64@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Porting FreeBSD to the AMD64 platform List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 03 Feb 2009 16:08:03 -0000 On Tuesday 03 February 2009 2:34:04 am Roar Pettersen wrote: > Hello John ! > > > Is your source tree out of date wrt your kernel? The kernel messages clearly > > show a page fault, not a TSS fault as Xtss() would indicate. Also, if > > vattr_null() was passed a NULL pointer, it should have faulted at the start > > of its routine rather than halfway through it. > > Yes, forgot that I had done a buildworld and build kernel to get all new > patches installed. > > > No crash yet, but each time we do a "shutdown -r now" because the system > get unstable/unusable after some hours (4-6), we now get a dump each time > : > > # kgdb kernel.debug /var/crash/vmcore.6 > GNU gdb 6.1.1 [FreeBSD] > Copyright 2004 Free Software Foundation, Inc. > GDB is free software, covered by the GNU General Public License, and you > are > welcome to change it and/or distribute copies of it under certain > conditions. > Type "show copying" to see the conditions. > There is absolutely no warranty for GDB. Type "show warranty" for > details. > This GDB was configured as "amd64-marcel-freebsd"... > > Unread portion of the kernel message buffer: > <118>Feb 3 07:27:50 proxy-gw syslogd: exiting on signal 15 > Waiting (max 60 seconds) for system process `vnlru' to stop...done > WaitSiynncing dgi s(kmsa,x v6n0o dseesc ornedmsa)i nfionrg .s.y.st6e m > process `syncer' to stop...5 1 2 2 1 1 0 0 0 done > Waiting (max 60 seconds) for system process `bufdaemon' to stop...done > All buffers synced. > Uptime: 7h8m11s > > > Fatal trap 12: page fault while in kernel mode > cpuid = 1; apic id = 01 > fault virtual address = 0x10 > fault code = supervisor read data, page not present > instruction pointer = 0x8:0xffffffff8021d746 > stack pointer = 0x10:0xfffffffef7cf3b20 > frame pointer = 0x10:0x12000 > code segment = base 0x0, limit 0xfffff, type 0x1b > = DPL 0, pres 1, long 1, def32 0, gran 1 > processor eflags = interrupt enabled, resume, IOPL = 0 > current process = 29 (irq257: bce1) > trap number = 12 > panic: page fault > cpuid = 1 > Uptime: 7h8m11s > Physical memory: 4087 MB > Dumping 362 MB: 347 331 315 299 283 267 251 235 219 203 187 171 155 139 > 123 107 91 75 59 43 27 11 > > #0 doadump () at pcpu.h:195 > 195 __asm __volatile("movq %%gs:0,%0" : "=r" (td)); > (kgdb) > (kgdb) backtrace > #0 doadump () at pcpu.h:195 > #1 0x0000000000000004 in ?? () > #2 0xffffffff802fae39 in boot (howto=260) at > /usr/src/sys/kern/kern_shutdown.c:418 > #3 0xffffffff802fb242 in panic (fmt=0x104
) > at /usr/src/sys/kern/kern_shutdown.c:574 > #4 0xffffffff804fbd63 in trap_fatal (frame=0xffffff0001559000, > eva=Variable "eva" is not available. > ) at /usr/src/sys/amd64/amd64/trap.c:764 > #5 0xffffffff804fc135 in trap_pfault (frame=0xfffffffef7cf3a70, > usermode=0) > at /usr/src/sys/amd64/amd64/trap.c:680 > #6 0xffffffff804fca78 in trap (frame=0xfffffffef7cf3a70) at > /usr/src/sys/amd64/amd64/trap.c:449 > #7 0xffffffff804e24be in calltrap () at > /usr/src/sys/amd64/amd64/exception.S:209 > #8 0xffffffff8021d746 in bce_intr (xsc=Variable "xsc" is not available. > ) at /usr/src/sys/dev/bce/if_bce.c:5748 Looks to be a bug here. Can you do 'frame 8' followed by 'l' in kgdb? -- John Baldwin