From owner-freebsd-questions@FreeBSD.ORG Sat Apr 21 11:54:49 2007 Return-Path: X-Original-To: freebsd-questions@freebsd.org Delivered-To: freebsd-questions@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [69.147.83.52]) by hub.freebsd.org (Postfix) with ESMTP id D96F416A400 for ; Sat, 21 Apr 2007 11:54:49 +0000 (UTC) (envelope-from dzila@tassadar.physics.auth.gr) Received: from tassadar.physics.auth.gr (tassadar.physics.auth.gr [155.207.123.25]) by mx1.freebsd.org (Postfix) with ESMTP id 5B6CF13C45A for ; Sat, 21 Apr 2007 11:54:49 +0000 (UTC) (envelope-from dzila@tassadar.physics.auth.gr) Received: from tassadar.physics.auth.gr (IDENT:1000@localhost [127.0.0.1]) by tassadar.physics.auth.gr (8.13.7/8.13.6) with ESMTP id l3LBslPc012110 for ; Sat, 21 Apr 2007 14:54:47 +0300 Received: from localhost (dzila@localhost) by tassadar.physics.auth.gr (8.13.7/8.13.3/Submit) with ESMTP id l3LBsk6D012107 for ; Sat, 21 Apr 2007 14:54:47 +0300 Date: Sat, 21 Apr 2007 14:54:46 +0300 (EEST) From: Dimitris Zilaskos To: freebsd-questions@freebsd.org In-Reply-To: Message-ID: References: MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII; format=flowed X-Virus-Scanned: ClamAV 0.88/3141/Fri Apr 20 23:23:13 2007 on tassadar.physics.auth.gr X-Virus-Status: Clean Subject: Re: random hangs/reboots with Dell servers X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sat, 21 Apr 2007 11:54:49 -0000 Thnx to everyone for your replies, A colleague has provided me with his hand notes of an older crash screen, it has the following(however i cant guarantee it is accurate, it is handnotes). Fatal trap 12: page fault while in kernel mode cpuid=0; apicid=00 fault virtual address=0xac fault code=supervisor write,page not present instruction pointer=0x20:0x current process 79962 trap numbers : 12 panic: pagefault cpuid=1 uptime=6d7423m55 I do not believe the problems are related to envriroment or electricity, since during the period the problems occured we have switched data center, and in addition to dell systems there are 150 more nodes from various vendors (HP mostly, but also IBM, supermicro, SUN, and various assembled towers), and none has shown similar behaviour. We dont run FreeBSD on them though. We have a Dell 2850 with Windows 2003 that has been running rock solid for at least 1 year. And the 1750 that under FreeBSD 5 would sometimes crash even under no load, with RHEL 4 pushes 60 Mbps of ftp data 24/7 with ease for the last year without any problems. Disabling everything from BIOS was one of our first moves, though we havent disabled usb since sometimes we need to connect a keyboard. And no IPMI is running on a public interface:) Apart from all the nodes being SMP and Dell, I cannot think of anything else in common. Some are SCSI, some are SATA. All have a number of jails. Memory size is 2 GB (the 1750), the others have 4 GB. I have also asked Dell for some help, though they told me freebsd is not certified by Dell, they will try to look into it. -- ============================================================================ Dimitris Zilaskos Department of Physics @ Aristotle University of Thessaloniki , Greece PGP key : http://tassadar.physics.auth.gr/~dzila/pgp_public_key.asc http://egnatia.ee.auth.gr/~dzila/pgp_public_key.asc MD5sum : de2bd8f73d545f0e4caf3096894ad83f pgp_public_key.asc ============================================================================