From owner-freebsd-ppc@FreeBSD.ORG Tue Apr 6 00:57:23 2010 Return-Path: Delivered-To: freebsd-ppc@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 55D0B1065672; Tue, 6 Apr 2010 00:57:23 +0000 (UTC) (envelope-from toasty@dragondata.com) Received: from mail-yw0-f171.google.com (mail-yw0-f171.google.com [209.85.211.171]) by mx1.freebsd.org (Postfix) with ESMTP id E9B668FC0C; Tue, 6 Apr 2010 00:57:21 +0000 (UTC) Received: by ywh1 with SMTP id 1so343703ywh.3 for ; Mon, 05 Apr 2010 17:57:21 -0700 (PDT) Received: by 10.100.54.14 with SMTP id c14mr4044714ana.204.1270515440660; Mon, 05 Apr 2010 17:57:20 -0700 (PDT) Received: from vpn177.ord02.your.org (vpn177.ord02.your.org [204.9.55.177]) by mx.google.com with ESMTPS id 22sm1947201iwn.12.2010.04.05.17.57.19 (version=TLSv1/SSLv3 cipher=RC4-MD5); Mon, 05 Apr 2010 17:57:19 -0700 (PDT) Mime-Version: 1.0 (Apple Message framework v1077) Content-Type: text/plain; charset=us-ascii From: Kevin Day In-Reply-To: <4BBA2BD8.9050003@freebsd.org> Date: Mon, 5 Apr 2010 19:57:18 -0500 Content-Transfer-Encoding: quoted-printable Message-Id: <2FD96EE6-1761-4040-9E5A-58A33DE1D030@dragondata.com> References: <40B1BEB2-6620-4188-BB71-F8B5ED4AA234@dragondata.com> <4BB5EE68.2040504@freebsd.org> <7F22E2B9-34FB-4E3B-981E-8D2EF73A4F64@dragondata.com> <4BB7A9B2.3080901@freebsd.org> <4BBA2BD8.9050003@freebsd.org> To: Nathan Whitehorn X-Mailer: Apple Mail (2.1077) Cc: freebsd-ppc@freebsd.org Subject: Re: Xserve G4 stability (random processes crashing) X-BeenThere: freebsd-ppc@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Porting FreeBSD to the PowerPC List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 06 Apr 2010 00:57:23 -0000 On Apr 5, 2010, at 1:28 PM, Nathan Whitehorn wrote: > Kevin Day wrote: >> On Apr 3, 2010, at 3:48 PM, Nathan Whitehorn wrote: >> =20 >>> Since you say UP kernels have the same problems, other G4 machines = seem not to have issues, and SMP G5 Xserves are completely stable, that = points at some G4 Xserve-specific piece of hardware. I'd guess the ATA = controller. Could you try chroot to an NFS volume mounted from a = known-stable machine, or a USB or Firewire disk, and trying the same = things? >>> -Nathan >>> =20 >>=20 >> Okay, i've done some more playing... The problem still happens even = if TMPDIR, /usr/src and /usr/obj are NFS mounted to another system. >>=20 >> I'm fiddling more, but I think that rules out ATA then.=20 >> The problem seems to take a long while to first appear, but once it = does appear it happens pretty fast repeatedly after that. Is it possible = the fan controls aren't working right? >> =20 > That's possible. The fan control settings are done completely by = hardware, though. Can you try with the whole system on NFS (i.e. a = chroot or netbooting)? > -Nathan Even pure NFS (running inside a jail with all of the jail chroot over = NFS) was still crashing. But, I think I may have figured out the issue... This box only has 1GB of DIMMs installed, but FreeBSD is somehow seeing = 1.25GB of RAM and is apparently trying to use it. If I put 2GB of RAM in = there, it correctly detects 2GB and (so far) buildworld is running fine = after three reboots. Mac OS X is only seeing 1GB, and seems to reliably detect that. I'm = going to do some more digging to figure out where the wrong memory size = is coming from.