From owner-freebsd-questions@FreeBSD.ORG Thu Apr 19 15:45:28 2007 Return-Path: X-Original-To: freebsd-questions@freebsd.org Delivered-To: freebsd-questions@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [69.147.83.52]) by hub.freebsd.org (Postfix) with ESMTP id 1025F16A401 for ; Thu, 19 Apr 2007 15:45:28 +0000 (UTC) (envelope-from derek@computinginnovations.com) Received: from betty.computinginnovations.com (mail.computinginnovations.com [64.81.227.250]) by mx1.freebsd.org (Postfix) with ESMTP id 9C49513C45B for ; Thu, 19 Apr 2007 15:45:25 +0000 (UTC) (envelope-from derek@computinginnovations.com) Received: from p28.computinginnovations.com (dhcp-10-20-30-100.computinginnovations.com [10.20.30.100]) (authenticated bits=0) by betty.computinginnovations.com (8.13.8/8.12.11) with ESMTP id l3JFikTe075236; Thu, 19 Apr 2007 10:44:46 -0500 (CDT) Message-Id: <6.0.0.22.2.20070419103724.025a8fd8@mail.computinginnovations.com> X-Sender: derek@mail.computinginnovations.com X-Mailer: QUALCOMM Windows Eudora Version 6.0.0.22 Date: Thu, 19 Apr 2007 10:44:00 -0500 To: Dimitris Zilaskos , freebsd-questions@freebsd.org From: Derek Ragona In-Reply-To: References: Mime-Version: 1.0 X-ComputingInnovations-MailScanner-Information: Please contact the ISP for more information X-ComputingInnovations-MailScanner: Found to be clean X-ComputingInnovations-MailScanner-From: derek@computinginnovations.com X-Spam-Status: No Content-Type: text/plain; charset="us-ascii"; format=flowed X-Content-Filtered-By: Mailman/MimeDel 2.1.5 Cc: Subject: Re: random hangs/reboots with Dell servers X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 19 Apr 2007 15:45:28 -0000 At 05:54 AM 4/19/2007, Dimitris Zilaskos wrote: > Dear all, > >I am trying to understand some long standing issues we have with freebsd >and Dell servers. > >Over the last 3 year we have installed freebsd 5.x and 6.x, with currently >deployed version being 6.1, to a variety of of Dell rack mounted systems. > >The Dell systems used so far are Poweredge 1750, 2950 (both scsi), and >sc1425 (sata). All of them are dual CPU Xeon systems. > >All these systems serve as mail/web servers, with 2 to 15 jails. > >Installation has always proceeded normally without problems. However, >after a few months of operation, all of these systems, purchased at >different moments during the last 3 years, will begin rebooting randomly >or freezing completely. > >These reboots/freezes will at first occur once per 6 months, then >gradually will move to to once per month, to normally stabilize around >once per week, but in the case of the 1750 system once it even happened >twice a day. > >Load does not seem to matter, since even after shutting down all services >in the servers, still random reboots occured. > >So far we tried various tricks digged from the archives, like disabling >ACPI, HT, but nothing changed. > >We have migrated some systems that had these issues to RHEL compatible OS, >and they run rock solid under heavy load. > >Right now I have enabled kernel crash dumps and I am waiting for the next >crash. But I understad a lot of people use FreeBSD with Dell servers, and >I would like to listen on how to tackle this situation we are facing. First make sure you are up-to-date on the FreeBSD version you are running, also make sure it is still a supported release. If not, update your src and rebuild everything. For the hardware I'd run complete diagnostics from dell on one of these servers, and any stress tests available as well. If the hardware all checks out OK, I would look for either an environmental cause such as heat. Heat can cause hardware problems that wouldn't show up otherwise. If neither of these looks like the cause, then you may need to swap-out a system board, or RAM as it must be a hardware issue. -Derek -- This message has been scanned for viruses and dangerous content by MailScanner, and is believed to be clean. MailScanner thanks transtec Computers for their support.