From owner-freebsd-stable@FreeBSD.ORG Tue Jul 14 17:39:18 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 316CF1065698 for ; Tue, 14 Jul 2009 17:39:18 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from cyrus.watson.org (cyrus.watson.org [65.122.17.42]) by mx1.freebsd.org (Postfix) with ESMTP id E41E48FC23 for ; Tue, 14 Jul 2009 17:39:17 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from bigwig.baldwin.cx (66.111.2.69.static.nyinternet.net [66.111.2.69]) by cyrus.watson.org (Postfix) with ESMTPSA id 7B33546B5C; Tue, 14 Jul 2009 13:39:17 -0400 (EDT) Received: from jhbbsd.hudson-trading.com (unknown [209.249.190.8]) by bigwig.baldwin.cx (Postfix) with ESMTPA id 655148A099; Tue, 14 Jul 2009 13:39:16 -0400 (EDT) From: John Baldwin To: freebsd-stable@freebsd.org Date: Tue, 14 Jul 2009 13:00:16 -0400 User-Agent: KMail/1.9.7 References: <20090714145101.60515vh49s7o7c4k@10.248.192.16> In-Reply-To: <20090714145101.60515vh49s7o7c4k@10.248.192.16> MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit Content-Disposition: inline Message-Id: <200907141300.16634.jhb@freebsd.org> X-Greylist: Sender succeeded SMTP AUTH, not delayed by milter-greylist-4.0.1 (bigwig.baldwin.cx); Tue, 14 Jul 2009 13:39:16 -0400 (EDT) X-Virus-Scanned: clamav-milter 0.95.1 at bigwig.baldwin.cx X-Virus-Status: Clean X-Spam-Status: No, score=-2.5 required=4.2 tests=AWL,BAYES_00,RDNS_NONE autolearn=no version=3.2.5 X-Spam-Checker-Version: SpamAssassin 3.2.5 (2008-06-10) on bigwig.baldwin.cx Cc: Subject: Re: trap 12 X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 14 Jul 2009 17:39:19 -0000 On Tuesday 14 July 2009 9:51:01 am Ian J Hart wrote: > Quoting John Baldwin : > > > On Tuesday 07 July 2009 5:51:03 am Ian J Hart wrote: > >> Quoting Ian J Hart : > >> > >>> Quoting Ian J Hart : > >>> > >>>> Is this likely to be hardware? Details will follow if not. > >>>> > >>>> [copied from a screen dump] > >>>> > >>>> Fatal trap 12: page fault while in kernel mode > >>>> cpuid = 1; apic id = 01 > >>>> fault virtual address = 0x0 > >>>> fault code = supervisor write data, page not present > >>>> instruction pointer = 0x8:0xffffffff807c6c12 > >>>> stack pointer = 0x10:0xffffffff510e7890 > >>>> frame pointer = 0x10:0xffffff00054a6c90 > >>>> code segment = base 0x0, limit 0xfffff, type 0x1b > >>>> = DPL 0, pres 1, long 1 def32 0, gran 1 > >>>> processor eflags = interrupt enabled, resume, IOPL = 0 > >>>> current process = 75372 (printf) > >>>> trap number = 12 > >>>> panic: page fault > >>>> cpuid = 1 > >>>> uptime: 8m2s > >>>> Cannot dump. No dump device defined. > >>>> > >>>> > >>> Ran crashinfo, now have much more info than I need ;) > >>> > >>> Starting another portupgrade run now to see how reproducable this is. > >>> > >>> Later BIOS waiting in USB floppy. > >>> > >> [snip dmesg] > >> > >> It took 2 runs of portupgrade -af.Some corruption in the dbs may have > >> to pkg_delete -a. > >> > >> FreeBSD * 7.2-RELEASE-p1 FreeBSD 7.2-RELEASE-p1 #0: Tue Jun 16 > >> 18:03:10 BST 2009 *@*:/usr/obj/usr/src/sys/GENERIC amd64 > >> > >> panic: page fault > >> > >> GNU gdb 6.1.1 [FreeBSD] > >> Copyright 2004 Free Software Foundation, Inc. > >> GDB is free software, covered by the GNU General Public License, and you are > >> welcome to change it and/or distribute copies of it under certain > > conditions. > >> Type "show copying" to see the conditions. > >> There is absolutely no warranty for GDB. Type "show warranty" for details. > >> This GDB was configured as "amd64-marcel-freebsd"... > >> > >> Unread portion of the kernel message buffer: > >> > >> > >> Fatal trap 12: page fault while in kernel mode > >> cpuid = 1; apic id = 01 > >> fault virtual address = 0xfffffffff5555570 > >> fault code = supervisor write data, page not present > >> instruction pointer = 0x8:0xffffffff807c429b > >> stack pointer = 0x10:0xffffffff511e4710 > >> frame pointer = 0x10:0x20 > >> code segment = base 0x0, limit 0xfffff, type 0x1b > >> = DPL 0, pres 1, long 1, def32 0, gran 1 > >> processor eflags = interrupt enabled, resume, IOPL = 0 > >> current process = 69996 (mkdir) > >> trap number = 12 > >> panic: page fault > > > > This one does look like a hardware issue from the stack trace. It's hard to > > know if the first panic you saw was a hardware issue as well without the > > stack trace information. > > > >> #7 0xffffffff807b706e in calltrap () > >> at /usr/src/sys/amd64/amd64/exception.S:209 > >> #8 0xffffffff807c429b in free_pv_entry (pmap=0xffffffff80b66c80, > >> pv=Variable "pv" is not available. > >> ) > >> at /usr/src/sys/amd64/amd64/pmap.c:1905 > >> #9 0xffffffff807c4403 in pmap_remove_entry (pmap=Variable "pmap" is > >> not available. > >> ) > >> at /usr/src/sys/amd64/amd64/pmap.c:2131 > >> #10 0xffffffff807c6447 in pmap_remove_pte (pmap=0xffffffff80b66c80, > >> ptq=0xaaaaaaa8, va=18446744070506639360, ptepde=23601251, > >> free=0xffffffff511e4790) at /usr/src/sys/amd64/amd64/pmap.c:2366 > >> #11 0xffffffff807cab87 in pmap_remove (pmap=0xffffffff80b66c80, > >> sva=18446744070506639360, eva=18446744070506909696) > >> at /usr/src/sys/amd64/amd64/pmap.c:2510 > > > > -- > > John Baldwin > > > > The remote backup continues to run so there was definitely some issue > there. No more reboots, but it wasn't doing that regularly without > some additional load. > > Hopefully I can swap parts around until I find the offending item. > > Thanks for your input. I would try running memtest86 to check your RAM. -- John Baldwin