Skip site navigation (1)Skip section navigation (2)
Date:      Tue, 28 Jul 2009 08:32:06 -0700
From:      Marcel Moolenaar <xcllnt@mac.com>
To:        Anton Shterenlikht <mexas@bristol.ac.uk>
Cc:        "O. Hartmann" <ohartman@mail.zedat.fu-berlin.de>, freebsd-current@freebsd.org, freebsd-ia64@freebsd.org
Subject:   Re: FreeBSD 8.0-BETA2/amd64 crashes on SMP under load
Message-ID:  <F8C8F9A7-2A9D-4F19-902C-48D85131AE8B@mac.com>
In-Reply-To: <20090728103545.GA22380@mech-cluster241.men.bris.ac.uk>
References:  <4A6DB30B.20705@zedat.fu-berlin.de> <4A6DB9F1.7050404@haruhiism.net> <4A6E0620.6070200@mail.zedat.fu-berlin.de> <20090727210428.GA30253@mech-cluster241.men.bris.ac.uk> <20090728103545.GA22380@mech-cluster241.men.bris.ac.uk>

next in thread | previous in thread | raw e-mail | index | archive | help

On Jul 28, 2009, at 3:35 AM, Anton Shterenlikht wrote:

> On Mon, Jul 27, 2009 at 10:04:28PM +0100, Anton Shterenlikht wrote:
>> On Mon, Jul 27, 2009 at 09:55:12PM +0200, O. Hartmann wrote:
>>> Kamigishi Rei wrote:
>>>> O. Hartmann wrote:
>>>>> I have the problem of crashing FreeBSD 8.0-BETA2/amd64 under  
>>>>> load on
>>>>> all of our SMP boxes. Is there an issue known at the moment? If  
>>>>> not, I
>>>>> will prepare the kernel for whitnessing and provide more  
>>>>> informations,
>>>>> if you wish.
>>>> A quick question: what is in the crash message, i.e. the backtrace?
>>>> And what kind of crash is it - a panic() or a fatal trap?
>>>
>>> On the 8-core server box, I sometimes see :
>>>
>>> Fatal trap 12: page fault while in kernel mode
>>> fault code              = supervisor read, page not present
>>
>> Not sure if it's related, but on ia64 SMP (2 cpus) with 8.0-current  
>> and
>> later with 8.0-beta1 (I havent' built beta2 yet) I'm getting crashes
>> under load every so often. E.g buildworld -j8 is likely to crash the
>> box. No messages, just a sudden freeze, no backtrace or panic, and  
>> then reboot.
>>
>> If load is less heavy, e.g. fewer processes and some idle time, the
>> problem doesn't seem to appear.
>>
>> I'm happy to do any further testing, if suggested.
>
> my ia64 8.0-beta1 SMP box died again on
> make -j8 buildworld
> with no panic or log entries.

Do you have MCA records?

>
> Is it possible that some kernel variable needs to
> be increased? E.g. kern.maxproc, kern.maxfiles, etc.

No need.

-- 
Marcel Moolenaar
xcllnt@mac.com






Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?F8C8F9A7-2A9D-4F19-902C-48D85131AE8B>