From owner-freebsd-amd64@FreeBSD.ORG Fri Jul 31 16:50:06 2009 Return-Path: Delivered-To: freebsd-amd64@hub.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 51D50106564A for ; Fri, 31 Jul 2009 16:50:06 +0000 (UTC) (envelope-from gnats@FreeBSD.org) Received: from freefall.freebsd.org (freefall.freebsd.org [IPv6:2001:4f8:fff6::28]) by mx1.freebsd.org (Postfix) with ESMTP id 3EDDD8FC0A for ; Fri, 31 Jul 2009 16:50:06 +0000 (UTC) (envelope-from gnats@FreeBSD.org) Received: from freefall.freebsd.org (gnats@localhost [127.0.0.1]) by freefall.freebsd.org (8.14.3/8.14.3) with ESMTP id n6VGo56j068624 for ; Fri, 31 Jul 2009 16:50:05 GMT (envelope-from gnats@freefall.freebsd.org) Received: (from gnats@localhost) by freefall.freebsd.org (8.14.3/8.14.3/Submit) id n6VGo5qx068623; Fri, 31 Jul 2009 16:50:05 GMT (envelope-from gnats) Date: Fri, 31 Jul 2009 16:50:05 GMT Message-Id: <200907311650.n6VGo5qx068623@freefall.freebsd.org> To: freebsd-amd64@FreeBSD.org From: Martin W X-Mailman-Approved-At: Sat, 01 Aug 2009 14:33:31 +0000 Cc: Subject: Re: amd64/128263: [panic] 2 amd64 dl380 g5 with dual quadcore xeons, 8 and 16gb ram, crash and dump mem X-BeenThere: freebsd-amd64@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list Reply-To: Martin W List-Id: Porting FreeBSD to the AMD64 platform List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 31 Jul 2009 16:50:06 -0000 The following reply was made to PR amd64/128263; it has been noted by GNATS. From: Martin W To: bug-followup@FreeBSD.org, martin.wikesjo@cypoint.se Cc: Subject: Re: amd64/128263: [panic] 2 amd64 dl380 g5 with dual quadcore xeons, 8 and 16gb ram, crash and dump mem Date: Fri, 31 Jul 2009 18:30:41 +0200 Sorry for the late follow up. Since this has been rated as "serious" and with a "high" priority, yet I have recieved no real feedback I haven't put much effort into reporting anymore. Anyhow, we have been having the same issues with a few more machines now. Random spontaneous crashes. I do suspect faulty hardware, more specifically RAM or CPU. But since the errors I see don't provide me any proof I am unable to convince our hardware vendor, HP, that they are broken. I have ran the HP diagnostics for 7 loops as HP recommends, and it reports no errors. I have also recompiled the kernel on one of these machines with "options PRINTF_BUFR_SIZE=128" to see if the output would be more than just garbage, but it did not help. We will attempt to upgrade one machine to 7.2 next week to see if it will produce better error logs if/when they crash again(or maybe we'll be incredibly lucky and its a software bug that is now fixed). FWIW, these machine are part of a large online gaming platform. It has well over 100 more of these machines with the same hardware and FreeBSD setup. If someone could look into this that would be much appreciated.