From owner-freebsd-stable@FreeBSD.ORG Tue Jun 28 14:49:57 2005 Return-Path: X-Original-To: freebsd-stable@FreeBSD.org Delivered-To: freebsd-stable@FreeBSD.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id C26B116A41C; Tue, 28 Jun 2005 14:49:57 +0000 (GMT) (envelope-from matt@atopia.net) Received: from neptune.atopia.net (neptune.atopia.net [209.128.231.90]) by mx1.FreeBSD.org (Postfix) with ESMTP id 995A143D49; Tue, 28 Jun 2005 14:49:57 +0000 (GMT) (envelope-from matt@atopia.net) Received: from [192.168.0.102] (pcp173257pcs.plsntv01.nj.comcast.net [68.46.70.16]) by neptune.atopia.net (Postfix) with ESMTP id BC86E4120; Tue, 28 Jun 2005 10:49:56 -0400 (EDT) Message-ID: <42C16394.4040904@atopia.net> Date: Tue, 28 Jun 2005 10:49:56 -0400 From: Matt Juszczak User-Agent: Mozilla Thunderbird 0.9 (X11/20041129) X-Accept-Language: en-us, en MIME-Version: 1.0 To: Gleb Smirnoff References: <42BF8815.6090909@atopia.net> <20050627081933.GA97832@cell.sick.ru> In-Reply-To: <20050627081933.GA97832@cell.sick.ru> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Cc: freebsd-stable@FreeBSD.org Subject: Re: FreeBSD -STABLE servers repeatedly crashing. X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 28 Jun 2005 14:49:57 -0000 Gleb Smirnoff wrote: >On Mon, Jun 27, 2005 at 01:01:09AM -0400, Matt Juszczak wrote: >M> About three weeks ago, I upgraded my 5.3-RELEASE boxes to 5.4-RELEASE. >M> I also turned on procmail globally on our mail server. Here is our >M> current FreeBSD server setup: >M> >M> URANUS - primary ldap >M> CALIBAN - secondary ldap >M> ORION - primary mail >M> >M> Orion was the first one to crash, about three weeks ago. Orion is >M> constantly talking to uranus, because uranus is our primary ldap server >M> (we have a planet scheme), and caliban is our secondary ldap server. I >M> ran an email flood test on orion to see if I could crash it again. This >M> time, the high requests on Uranus caused Uranus to crash. With two >M> different servers on two different hardware setups crashing, I had to >M> start thinking of what could be causing the problem. >M> >M> Memory tests on both servers came back OK. Orion had some ECC errors >M> which it was able to fix. I wasn't able to catch orion's first crash, >M> but I was able to catch uranus's first crash: >M> >M> http://paste.atopia.net/126 > >Can you please build kernel with debugging and obtain a crashdump? > > > > Ever since I setup the debug kernel the machine is now crashing every 12 hours. I think I have to switch to OpenBSD or 4.11 FreeBSD because this box can't keep crashing. It refuses to do a crash dump. -Matt