From owner-freebsd-stable@FreeBSD.ORG Mon Jun 27 08:19:40 2005 Return-Path: X-Original-To: freebsd-stable@FreeBSD.org Delivered-To: freebsd-stable@FreeBSD.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 5386C16A41C for ; Mon, 27 Jun 2005 08:19:40 +0000 (GMT) (envelope-from glebius@FreeBSD.org) Received: from relay.bestcom.ru (relay.bestcom.ru [217.72.144.5]) by mx1.FreeBSD.org (Postfix) with ESMTP id B689A43D1F for ; Mon, 27 Jun 2005 08:19:39 +0000 (GMT) (envelope-from glebius@FreeBSD.org) Received: from cell.sick.ru (root@cell.sick.ru [217.72.144.68]) by relay.bestcom.ru (8.13.1/8.12.9) with ESMTP id j5R8JZ9K077808 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=FAIL); Mon, 27 Jun 2005 12:19:36 +0400 (MSD) (envelope-from glebius@FreeBSD.org) Received: from cell.sick.ru (glebius@localhost [127.0.0.1]) by cell.sick.ru (8.13.1/8.12.8) with ESMTP id j5R8JYwG097899 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO); Mon, 27 Jun 2005 12:19:35 +0400 (MSD) (envelope-from glebius@FreeBSD.org) Received: (from glebius@localhost) by cell.sick.ru (8.13.1/8.13.1/Submit) id j5R8JX8e097898; Mon, 27 Jun 2005 12:19:33 +0400 (MSD) (envelope-from glebius@FreeBSD.org) X-Authentication-Warning: cell.sick.ru: glebius set sender to glebius@FreeBSD.org using -f Date: Mon, 27 Jun 2005 12:19:33 +0400 From: Gleb Smirnoff To: Matt Juszczak Message-ID: <20050627081933.GA97832@cell.sick.ru> References: <42BF8815.6090909@atopia.net> Mime-Version: 1.0 Content-Type: text/plain; charset=koi8-r Content-Disposition: inline In-Reply-To: <42BF8815.6090909@atopia.net> User-Agent: Mutt/1.5.6i X-Virus-Scanned: ClamAV version devel-20050125, clamav-milter version 0.80ff on relay.bestcom.ru X-Virus-Status: Clean Cc: freebsd-stable@FreeBSD.org Subject: Re: FreeBSD -STABLE servers repeatedly crashing. X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 27 Jun 2005 08:19:40 -0000 On Mon, Jun 27, 2005 at 01:01:09AM -0400, Matt Juszczak wrote: M> About three weeks ago, I upgraded my 5.3-RELEASE boxes to 5.4-RELEASE. M> I also turned on procmail globally on our mail server. Here is our M> current FreeBSD server setup: M> M> URANUS - primary ldap M> CALIBAN - secondary ldap M> ORION - primary mail M> M> Orion was the first one to crash, about three weeks ago. Orion is M> constantly talking to uranus, because uranus is our primary ldap server M> (we have a planet scheme), and caliban is our secondary ldap server. I M> ran an email flood test on orion to see if I could crash it again. This M> time, the high requests on Uranus caused Uranus to crash. With two M> different servers on two different hardware setups crashing, I had to M> start thinking of what could be causing the problem. M> M> Memory tests on both servers came back OK. Orion had some ECC errors M> which it was able to fix. I wasn't able to catch orion's first crash, M> but I was able to catch uranus's first crash: M> M> http://paste.atopia.net/126 Can you please build kernel with debugging and obtain a crashdump? -- Totus tuus, Glebius. GLEBIUS-RIPN GLEB-RIPE