From owner-freebsd-hackers@FreeBSD.ORG Tue Sep 25 17:51:37 2007 Return-Path: Delivered-To: freebsd-hackers@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 9047216A420 for ; Tue, 25 Sep 2007 17:51:37 +0000 (UTC) (envelope-from kris@FreeBSD.org) Received: from weak.local (hub.freebsd.org [IPv6:2001:4f8:fff6::36]) by mx1.freebsd.org (Postfix) with ESMTP id C894113C469; Tue, 25 Sep 2007 17:51:36 +0000 (UTC) (envelope-from kris@FreeBSD.org) Message-ID: <46F94AAA.5030504@FreeBSD.org> Date: Tue, 25 Sep 2007 19:51:38 +0200 From: Kris Kennaway User-Agent: Thunderbird 2.0.0.6 (Macintosh/20070728) MIME-Version: 1.0 To: Benjie Chen References: <46F8D12E.7060202@FreeBSD.org> In-Reply-To: Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Cc: freebsd-hackers@freebsd.org Subject: Re: Kernel panic on PowerEdge 1950 under certain stress load X-BeenThere: freebsd-hackers@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Technical Discussions relating to FreeBSD List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 25 Sep 2007 17:51:37 -0000 Benjie Chen wrote: > You are right, they may not be the same. From first look it seems like > they are similar based on the description of the problems -- system > stable, then under load related to network, get panic after different > time intervals. I just assumed that kernel is typically stable enough > that this kind of panic are rare (been using FBSD for 7 or 8 years now > and in heavy loads as well, never had kernel panics to deal with). OK, that means it is likely it has absolutely nothing in common. > I did reboot the system and set mpsafenet to 0 and I have not had a > crash since then (almost a day) running the same load, so that's > positive: at least it may be that that's the workaround, and I don't > need Dell to send me new memory modules to try... That should only be considered a temporary workaround while you continue to obtain debugging information to solve the problem for good. e.g. support for mpsafenet=0 has already been removed from 7.0. > Kris or Ivan: I was wondering if you could briefly explain what your > guess the problem might be. I am curious what the cause of the problem > is. E.g. it seems like a race condition, but I am curious to know more > of the details... It is impossible to say until you obtain some actual details about the panic :) Kris