From owner-freebsd-amd64@FreeBSD.ORG Fri Apr 15 17:18:17 2005 Return-Path: Delivered-To: freebsd-amd64@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 3AC1016A4CE for ; Fri, 15 Apr 2005 17:18:17 +0000 (GMT) Received: from carver.gumbysoft.com (carver.gumbysoft.com [66.220.23.50]) by mx1.FreeBSD.org (Postfix) with ESMTP id 0A4CD43D54 for ; Fri, 15 Apr 2005 17:18:17 +0000 (GMT) (envelope-from dwhite@gumbysoft.com) Received: by carver.gumbysoft.com (Postfix, from userid 1000) id EF05372DDF; Fri, 15 Apr 2005 10:18:16 -0700 (PDT) Received: from localhost (localhost [127.0.0.1]) by carver.gumbysoft.com (Postfix) with ESMTP id EC7D972DD9; Fri, 15 Apr 2005 10:18:16 -0700 (PDT) Date: Fri, 15 Apr 2005 10:18:16 -0700 (PDT) From: Doug White To: Eirik =?ISO-8859-1?B?2A==?=verby In-Reply-To: Message-ID: <20050415100931.O34838@carver.gumbysoft.com> References: MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=ISO-8859-1 Content-Transfer-Encoding: QUOTED-PRINTABLE cc: amd64@freebsd.org Subject: Re: Mysterious hangs X-BeenThere: freebsd-amd64@freebsd.org X-Mailman-Version: 2.1.1 Precedence: list List-Id: Porting FreeBSD to the AMD64 platform List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 15 Apr 2005 17:18:17 -0000 On Wed, 13 Apr 2005, Eirik [ISO-8859-1] =D8verby wrote: > Sorry, this was meant for the amd64 list. > Got another report for this list. Confused the addresses. > > On 13-04-05 07:28, "Eirik =D8verby" wrote: > > > Hi, > > > > For some time I've been seeing infrequent and seemingly random hangs on= my > > dual opteron system. Motherboard is Tyan K8S Pro. Until now I thought i= t was a > > "hard" hang, but I've got serial console hooked up and BREAK_TO_DEBUGGE= R, DDB > > and KDB in the kernel, and when sending a break through telnet I get: > > > > telnet> send brk > > KDB: enter: Line break on console I have a rev-D HDAMA that hangs like this. If you set debug.kdb.stop_cpus=3D0 then ddb will enter, which leads me to believe one of the CPUs is going out to lunch. I had a rev-G HDAMA that upgrading the BIOS fixed this on, so you might try that. I can only assume the BIOS difference includes a tweak to a timing somewhere; I've been wanting to mess with the HyperTransport clocks but haven't had time, but HT problems would likely cause wierdcCPU communication issues. I have an S2881 that does not exhibit the problem. > > And nothing more. Which means the box is sort of alive, but fails to pr= esent > > even the kernel debugger. The machine is currently on 5.4 as of yesterd= ay. > > It's been hanging before (with anything from days to weeks between), un= der > > various variations of 5.x, but I haven't had the serial console before = now. > > > > Has any one seen this before? Any ideas what I could look at? I suspect= ed > > hardware, but it seems that isn't the case as most HW errors I've seen = would > > cause a total freeze. > > > > Here's to hoping. > > > > Thanks, > > /Eirik > > > _______________________________________________ > freebsd-amd64@freebsd.org mailing list > http://lists.freebsd.org/mailman/listinfo/freebsd-amd64 > To unsubscribe, send any mail to "freebsd-amd64-unsubscribe@freebsd.org" > --=20 Doug White | FreeBSD: The Power to Serve dwhite@gumbysoft.com | www.FreeBSD.org