From owner-freebsd-smp@FreeBSD.ORG Fri Jul 15 13:15:28 2005 Return-Path: X-Original-To: freebsd-smp@freebsd.org Delivered-To: freebsd-smp@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 329E216A41F for ; Fri, 15 Jul 2005 13:15:28 +0000 (GMT) (envelope-from rutger.bevaart@illian.net) Received: from darwin.illian.net (darwin.illian.net [80.69.74.160]) by mx1.FreeBSD.org (Postfix) with ESMTP id B988A43D4C for ; Fri, 15 Jul 2005 13:15:26 +0000 (GMT) (envelope-from rutger.bevaart@illian.net) Received: from localhost (localhost.illian.net [127.0.0.1]) by darwin.illian.net (Postfix) with ESMTP id A271A4508C for ; Fri, 15 Jul 2005 15:15:24 +0200 (CEST) Received: from darwin.illian.net ([127.0.0.1]) by localhost (darwin.illian.net [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 02035-03 for ; Fri, 15 Jul 2005 15:15:24 +0200 (CEST) Received: from www.illian.net (localhost.illian.net [127.0.0.1]) by darwin.illian.net (Postfix) with ESMTP id 0115A4508B for ; Fri, 15 Jul 2005 15:15:24 +0200 (CEST) Received: from 193.172.18.3 (SquirrelMail authenticated user rutger); by www.illian.net with HTTP; Fri, 15 Jul 2005 15:15:24 +0200 (CEST) Message-ID: <24434.193.172.18.3.1121433324.squirrel@193.172.18.3> Date: Fri, 15 Jul 2005 15:15:24 +0200 (CEST) From: "Rutger Bevaart" To: freebsd-smp@freebsd.org User-Agent: SquirrelMail/1.4.3a X-Mailer: SquirrelMail/1.4.3a MIME-Version: 1.0 Content-Type: text/plain;charset=iso-8859-1 Content-Transfer-Encoding: 8bit X-Priority: 3 (Normal) Importance: Normal X-Virus-Scanned: amavisd-new at illian.net Subject: FreeBSD unstable on Dell 1750 using SMP? X-BeenThere: freebsd-smp@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: FreeBSD SMP implementation group List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 15 Jul 2005 13:15:28 -0000 hello list, For the past year we've been running several Dell PowerEdge 1750 servers on FreeBSD 4.10, 4.11 and 5.3. All these machines have dual Xeons running with HT enabled. This install has proven to be unstable in that the machine will reboot between 3 days and 170 days without apparant reason. No log is written. Other machines we have with a single CPU (HT enabled) do not experience this problem. As it is present in both 4.x and 5.x and googling the last year has not revealed similar experience I'm consulting this list. As all of these machines are productions machines that have a continuous load (not heavly load, but a light average - some peaks) it's not easy to experiment with HT setting etc. I dislike driving to the datacenter for locked systems with fubarred kernels ;-) The only error i've ever seen just before a reboot is "bge0: discard frame w/o packet header" on the 5.3 machine. Any clues or help greatly appreciated! Regards Rutger Bevaart