From owner-freebsd-current@FreeBSD.ORG Tue Jul 28 15:33:13 2009 Return-Path: Delivered-To: freebsd-current@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 4302F1065670; Tue, 28 Jul 2009 15:33:13 +0000 (UTC) (envelope-from xcllnt@mac.com) Received: from asmtpout029.mac.com (asmtpout029.mac.com [17.148.16.104]) by mx1.freebsd.org (Postfix) with ESMTP id 2B7B78FC1A; Tue, 28 Jul 2009 15:33:13 +0000 (UTC) (envelope-from xcllnt@mac.com) MIME-version: 1.0 Content-transfer-encoding: 7BIT Content-type: text/plain; charset=us-ascii; format=flowed; delsp=yes Received: from [172.24.241.141] (natint3.juniper.net [66.129.224.36]) by asmtp029.mac.com (Sun Java(tm) System Messaging Server 6.3-8.01 (built Dec 16 2008; 32bit)) with ESMTPSA id <0KNI00K3H0HITE10@asmtp029.mac.com>; Tue, 28 Jul 2009 08:33:01 -0700 (PDT) From: Marcel Moolenaar In-reply-to: <20090728103545.GA22380@mech-cluster241.men.bris.ac.uk> Date: Tue, 28 Jul 2009 08:32:06 -0700 Message-id: References: <4A6DB30B.20705@zedat.fu-berlin.de> <4A6DB9F1.7050404@haruhiism.net> <4A6E0620.6070200@mail.zedat.fu-berlin.de> <20090727210428.GA30253@mech-cluster241.men.bris.ac.uk> <20090728103545.GA22380@mech-cluster241.men.bris.ac.uk> To: Anton Shterenlikht X-Mailer: Apple Mail (2.1074) Cc: "O. Hartmann" , freebsd-current@freebsd.org, freebsd-ia64@freebsd.org Subject: Re: FreeBSD 8.0-BETA2/amd64 crashes on SMP under load X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 28 Jul 2009 15:33:13 -0000 On Jul 28, 2009, at 3:35 AM, Anton Shterenlikht wrote: > On Mon, Jul 27, 2009 at 10:04:28PM +0100, Anton Shterenlikht wrote: >> On Mon, Jul 27, 2009 at 09:55:12PM +0200, O. Hartmann wrote: >>> Kamigishi Rei wrote: >>>> O. Hartmann wrote: >>>>> I have the problem of crashing FreeBSD 8.0-BETA2/amd64 under >>>>> load on >>>>> all of our SMP boxes. Is there an issue known at the moment? If >>>>> not, I >>>>> will prepare the kernel for whitnessing and provide more >>>>> informations, >>>>> if you wish. >>>> A quick question: what is in the crash message, i.e. the backtrace? >>>> And what kind of crash is it - a panic() or a fatal trap? >>> >>> On the 8-core server box, I sometimes see : >>> >>> Fatal trap 12: page fault while in kernel mode >>> fault code = supervisor read, page not present >> >> Not sure if it's related, but on ia64 SMP (2 cpus) with 8.0-current >> and >> later with 8.0-beta1 (I havent' built beta2 yet) I'm getting crashes >> under load every so often. E.g buildworld -j8 is likely to crash the >> box. No messages, just a sudden freeze, no backtrace or panic, and >> then reboot. >> >> If load is less heavy, e.g. fewer processes and some idle time, the >> problem doesn't seem to appear. >> >> I'm happy to do any further testing, if suggested. > > my ia64 8.0-beta1 SMP box died again on > make -j8 buildworld > with no panic or log entries. Do you have MCA records? > > Is it possible that some kernel variable needs to > be increased? E.g. kern.maxproc, kern.maxfiles, etc. No need. -- Marcel Moolenaar xcllnt@mac.com