From owner-freebsd-ia64@FreeBSD.ORG Wed Jul 29 22:58:10 2009 Return-Path: Delivered-To: freebsd-ia64@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 31A3C106566C for ; Wed, 29 Jul 2009 22:58:10 +0000 (UTC) (envelope-from gaijin.k@gmail.com) Received: from qw-out-2122.google.com (qw-out-2122.google.com [74.125.92.27]) by mx1.freebsd.org (Postfix) with ESMTP id D1D2A8FC0C for ; Wed, 29 Jul 2009 22:58:09 +0000 (UTC) (envelope-from gaijin.k@gmail.com) Received: by qw-out-2122.google.com with SMTP id 3so580593qwe.7 for ; Wed, 29 Jul 2009 15:58:09 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:received:subject:from:to:cc :in-reply-to:references:content-type:date:message-id:mime-version :x-mailer:content-transfer-encoding; bh=fpLCfkyva86RdzJUS4UFcU5luym9MJBeDqmg3LKoJkM=; b=K8eyj8C8wkVu0rSF+yoDEQeSFnWru+ZzMQBZRNZa6Y9V9LguZ/iGJmR2UX/0uOhWNi eavLGUJLyfOLnAFF6EtK6PWjR0I4s9C2NhaOfgyp+ufOJPFHsWLY4T88bJqBc0hDSlcC Y3W+/KlMap5N/BXKTpDR/McAuK831yOl1WN+g= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=subject:from:to:cc:in-reply-to:references:content-type:date :message-id:mime-version:x-mailer:content-transfer-encoding; b=B8beufepV8kZQipMyvTkrRZkyOUOoriCp5q+oJKRVQ2g867FMa9Uoey2s1vOH1G7/4 BuBm0eqHDatU689L6DzW1TCAdg/wr9oHHRVvSAZWk9fOOkjW5fW0K/EL8S2AExHRPx7z c/Cj/4+NQWWD3/R28h/6HVQQCI2697Q0nZ9tY= Received: by 10.224.37.69 with SMTP id w5mr311544qad.250.1248906924511; Wed, 29 Jul 2009 15:35:24 -0700 (PDT) Received: from ?10.0.3.231? (pool-71-250-33-226.nwrknj.east.verizon.net [71.250.33.226]) by mx.google.com with ESMTPS id 2sm1724312qwi.3.2009.07.29.15.35.22 (version=SSLv3 cipher=RC4-MD5); Wed, 29 Jul 2009 15:35:23 -0700 (PDT) From: "Alexandre \"Sunny\" Kovalenko" To: Anton Shterenlikht In-Reply-To: <20090728144555.GD75439@mech-cluster241.men.bris.ac.uk> References: <4A6DB30B.20705@zedat.fu-berlin.de> <4A6DB9F1.7050404@haruhiism.net> <4A6E0620.6070200@mail.zedat.fu-berlin.de> <20090727210428.GA30253@mech-cluster241.men.bris.ac.uk> <20090728103545.GA22380@mech-cluster241.men.bris.ac.uk> <4A6F09BA.2020703@zedat.fu-berlin.de> <20090728144555.GD75439@mech-cluster241.men.bris.ac.uk> Content-Type: text/plain; charset="UTF-8" Date: Wed, 29 Jul 2009 18:34:15 -0400 Message-Id: <1248906855.1459.8.camel@RabbitsDen> Mime-Version: 1.0 X-Mailer: Evolution 2.26.3 FreeBSD GNOME Team Port Content-Transfer-Encoding: 8bit Cc: "O. Hartmann" , freebsd-current@freebsd.org, "O. Hartmann" , freebsd-ia64@freebsd.org Subject: Re: FreeBSD 8.0-BETA2/amd64 crashes on SMP under load X-BeenThere: freebsd-ia64@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Porting FreeBSD to the IA-64 List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 29 Jul 2009 22:58:10 -0000 On Tue, 2009-07-28 at 15:45 +0100, Anton Shterenlikht wrote: > On Tue, Jul 28, 2009 at 02:22:50PM +0000, O. Hartmann wrote: > > Anton Shterenlikht wrote: > > > On Mon, Jul 27, 2009 at 10:04:28PM +0100, Anton Shterenlikht wrote: > > >> On Mon, Jul 27, 2009 at 09:55:12PM +0200, O. Hartmann wrote: > > >>> Kamigishi Rei wrote: > > >>>> O. Hartmann wrote: > > >>>>> I have the problem of crashing FreeBSD 8.0-BETA2/amd64 under load on > > >>>>> all of our SMP boxes. Is there an issue known at the moment? If not, I > > >>>>> will prepare the kernel for whitnessing and provide more informations, > > >>>>> if you wish. > > >>>> A quick question: what is in the crash message, i.e. the backtrace? > > >>>> And what kind of crash is it - a panic() or a fatal trap? > > >>> On the 8-core server box, I sometimes see : > > >>> > > >>> Fatal trap 12: page fault while in kernel mode > > >>> fault code = supervisor read, page not present > > >> Not sure if it's related, but on ia64 SMP (2 cpus) with 8.0-current and > > >> later with 8.0-beta1 (I havent' built beta2 yet) I'm getting crashes > > >> under load every so often. E.g buildworld -j8 is likely to crash the > > >> box. No messages, just a sudden freeze, no backtrace or panic, and then reboot. > > >> > > >> If load is less heavy, e.g. fewer processes and some idle time, the > > >> problem doesn't seem to appear. > > >> > > >> I'm happy to do any further testing, if suggested. > > > > > > my ia64 8.0-beta1 SMP box died again on > > > make -j8 buildworld > > > with no panic or log entries. > > > > > > Is it possible that some kernel variable needs to > > > be increased? E.g. kern.maxproc, kern.maxfiles, etc. > > > Or perhaps I'm talking complete rubbish.. > > > > > > > I suggest you try again with a UP kernel - a suggestion from a > > kernel-nnob, sorry. My SMP boxes work now with UP-kernel, but they are > > really slowish although they have modern Intel C2D/Penryn cores. > > I need SMP for OpenMP codes. It's a shame if SMP is buggy, but > I guess all is down to small user base.. > Before you go down that path, which, IMHO, is as counterproductive as it is incorrect, could you, please, show the output of sysctl debug | grep panic and check whether output of savecore -vC makes sense to you. -- Alexandre Kovalenko (Олександр Коваленко)