From owner-freebsd-questions@FreeBSD.ORG Sun Dec 30 05:42:01 2007 Return-Path: Delivered-To: freebsd-questions@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 37FB816A419 for ; Sun, 30 Dec 2007 05:42:01 +0000 (UTC) (envelope-from iain@workingsoftware.com.au) Received: from gulfstream.scoastnet.com.au (gulfstream.scoastnet.com.au [202.177.214.10]) by mx1.freebsd.org (Postfix) with ESMTP id B5F4913C45B for ; Sun, 30 Dec 2007 05:42:00 +0000 (UTC) (envelope-from iain@workingsoftware.com.au) Received: from socata.scoastnet.com.au (socata.iaindooley.com [202.167.244.104]) by gulfstream.scoastnet.com.au (8.13.3/8.13.6) with ESMTP id lBU5fwvM076287; Sun, 30 Dec 2007 16:41:58 +1100 (EST) (envelope-from iain@workingsoftware.com.au) Received: by socata.scoastnet.com.au (Postfix, from userid 1001) id 6827C22B16; Sun, 30 Dec 2007 13:27:11 +1100 (EST) Received: from localhost (localhost [127.0.0.1]) by socata.scoastnet.com.au (Postfix) with ESMTP id 6706A22B15; Sun, 30 Dec 2007 13:27:11 +1100 (EST) Date: Sun, 30 Dec 2007 13:27:11 +1100 (EST) From: Iain Dooley To: Kris Kennaway In-Reply-To: <477700E8.3050100@FreeBSD.org> Message-ID: <20071230132455.T2071@socata.scoastnet.com.au> References: <20071230122718.D2071@socata.scoastnet.com.au> <477700E8.3050100@FreeBSD.org> MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII; format=flowed Cc: freebsd-questions@freebsd.org Subject: Re: kernel panic 6.2-RELEASE SMP dual quad core X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 30 Dec 2007 05:42:01 -0000 >> uname -a >> FreeBSD HOSTNAME 6.2-RELEASE FreeBSD 6.2-RELEASE #0: Fri Jan 12 11:05:30 >> UTC 2007 root@dessler.cse.buffalo.edu:/usr/obj/usr/src/sys/SMP i386 >> >> running on dual quad core intel xeons with 4gb ram. >> >> my server has been rebooting quite a bit and stopped responding today. i >> found this on the console: >> >> Fatal trap: 12 page fault while in kernel mode >> cpuid = 5; apic id = 05 >> fault virtual address = 0x0 >> fault code = supervisor write, page not present >> instruction pointer = 0x20:0xc0880472 >> stack pointer = 0x28:0xe6ea9c8c >> code segment = base 0x0, limit 0xfffff, type 0x1b >> = DPL 0, pres 1, def32 1, gran 1 >> processor eflags = resume, IOPL = 0 >> current process = 12 (idle: cpu5) >> trap number = 12 >> panic: page fault >> cpuid = 5 >> uptime 24m41s >> cannot dump. no dump device specified >> >> i've configured the dump device and will follow the kernel debugging >> details in the handbook if it happens again but i thought i'd write in now >> in case the cause of the problem jumped out at anyone. >> >> i've run mprime for 24 hours, and memtest for 3 passes, and a script i >> wrote which just exhausts ram and CPU, then backs off and does it again >> which have been running for 24 hours. >> >> i've also just started running this: >> >> http://www.holm.cc/stress/ >> >> i noticed that the same "fatal trap 12" appeared during stress tests as >> listed here: >> >> http://people.freebsd.org/~pho/stress/log/cons224.html >> >> any help, guidance or information would be much appreciated. > > What you have so far is close to meaningless. The "fault virtual address = > 0x0" means little more than "somewhere in the kernel there was a null pointer > dereference". The fact that it was the idle process is suspicious though, it > suggests that hardware failure is a high probability. Please follow up with > the backtrace if you want to pursue this further. I thought that dodgy ram was the culprit. I've run memtest86 on three passes with no errrors reported although I've also read numerous reports on the net of memtest86 not being very effective. can you suggest a good method of stress testing ram? this is a new machine and still under warranty so if it's a ram issue I can easily just get it replaced. Cheers, Iain