From owner-freebsd-questions@FreeBSD.ORG Mon Jun 20 08:56:03 2005 Return-Path: X-Original-To: freebsd-questions@freebsd.org Delivered-To: freebsd-questions@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id E6BF616A41C for ; Mon, 20 Jun 2005 08:56:03 +0000 (GMT) (envelope-from matt@atopia.net) Received: from neptune.atopia.net (neptune.atopia.net [209.128.231.90]) by mx1.FreeBSD.org (Postfix) with ESMTP id BF2CB43D49 for ; Mon, 20 Jun 2005 08:56:03 +0000 (GMT) (envelope-from matt@atopia.net) Received: from [192.168.0.102] (pcp173257pcs.plsntv01.nj.comcast.net [68.46.70.16]) by neptune.atopia.net (Postfix) with ESMTP id E884F40EF for ; Mon, 20 Jun 2005 04:56:02 -0400 (EDT) Message-ID: <42B684A2.6020305@atopia.net> Date: Mon, 20 Jun 2005 04:56:02 -0400 From: Matt Juszczak User-Agent: Mozilla Thunderbird 0.9 (X11/20041129) X-Accept-Language: en-us, en MIME-Version: 1.0 To: freebsd-questions@freebsd.org Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Subject: FreeBSD Machines dieing, we've tried so much.... X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 20 Jun 2005 08:56:04 -0000 Hi all, OK, we're still having the FreeBSD machines die on us. Its two specific machines we've noticed, both FreeBSD 5.4, different hardware, different purposes. Originally, orion, our mail server, started getting kernel traps and dieing. Then, our primary ldap server, a week later, started doing it. Now they both are dieing atleast once every couple days, at random times. Orion has been up solid for five days, and Caliban (our primary ldap server) has been up for about seven, before this evening at 2:00 am when it died again. Here is the output from Caliban: http://paste.atopia.net/126. Orion has a similar message on the console when it hard locks, but the process usually says "procmail". I've never had instability problems with FreeBSD. These machines are both in the same location, but on different power supplies. They are controlled with high-level Air Conditioning. We've got three other FreeBSD 5.4 machines which haven't shown any sign of instability, but they dont receive anywhere near as much traffic as Caliban and Orion ... those servers get hammered constantly. The ONLY similarity between Orion and Caliban software-wise is that they both are involved in LDAP. Caliban acts as a primary LDAP server and Orion has LDAP configured via pam and nss. Please let me know any suggestions you can think of. The hardware is fairly new in both machines, but they are completely different kinds of boxes. Both machines are multiprocessor. Thanks in advance, Matt