From owner-freebsd-current@FreeBSD.ORG Wed Jan 21 10:40:17 2004 Return-Path: Delivered-To: freebsd-current@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 11C8416A4CE for ; Wed, 21 Jan 2004 10:40:17 -0800 (PST) Received: from electra.cse.Buffalo.EDU (electra.cse.Buffalo.EDU [128.205.32.2]) by mx1.FreeBSD.org (Postfix) with ESMTP id 9819543D46 for ; Wed, 21 Jan 2004 10:40:00 -0800 (PST) (envelope-from kensmith@cse.Buffalo.EDU) Received: from electra.cse.Buffalo.EDU (kensmith@localhost [127.0.0.1]) i0LIe0Tr026071; Wed, 21 Jan 2004 13:40:00 -0500 (EST) Received: (from kensmith@localhost) by electra.cse.Buffalo.EDU (8.12.10/8.12.9/Submit) id i0LIdxZc026070; Wed, 21 Jan 2004 13:39:59 -0500 (EST) Date: Wed, 21 Jan 2004 13:39:59 -0500 From: Ken Smith To: Kris Kennaway Message-ID: <20040121183959.GA25589@electra.cse.Buffalo.EDU> References: <20040121182730.GB40652@xor.obsecurity.org> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20040121182730.GB40652@xor.obsecurity.org> User-Agent: Mutt/1.4.1i cc: "Robin P. Blanchard" cc: current@freebsd.org Subject: Re: Strange behaviour X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.1 Precedence: list List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 21 Jan 2004 18:40:17 -0000 On Wed, Jan 21, 2004 at 10:27:30AM -0800, Kris Kennaway wrote: > On Wed, Jan 21, 2004 at 12:28:27PM -0500, Robin P. Blanchard wrote: > > I have one -CURRENT client: > > CPU: Intel(R) Xeon(TM) CPU 2.40GHz (2392.25-MHz 686-class CPU) > > Origin = "GenuineIntel" Id = 0xf27 Stepping = 7 > > > > Features=0xbfebfbff > CMOV,PAT,PSE36,CLFLUSH,DTS,ACPI,MMX,FXSR, > > SSE,SSE2,SS,HTT,TM,PBE> > > Hyperthreading: 2 logical CPUs > > real memory = 1073610752 (1023 MB) > > avail memory = 1045266432 (996 MB) > > ACPI APIC Table: > > FreeBSD/SMP: Multiprocessor System Detected: 2 CPUs > > > > which, when installing a new world (via nfs), consistently hangs at the end > > with: > > > > -------------------------------------------------------------- > > >>> Rebuilding man page indices > > -------------------------------------------------------------- > > cd /usr/src/share/man; make makedb > > makewhatis /usr/share/man > > > > > > The box is useable at this point, however. I have been simply rebooting the > > machine, and then running the above commands by hand after the reboot. While > > 'installworld' is hung (at the end, as above), this is in a 'top': > > > > 19107 root -4 0 992K 896K getblk 1 0:01 0.00% 0.00% > > makewhatis > > How long has it been "hung" for? If you have a slow network you might > be killing it while it is doing work. > > Do you have rpc.lockd and statd running on both client and server? I have the same machine (Dell 2650) and it's getting locked up in a very similar way, you don't need to get NFS involved to have processes get locked uup in getblk. I'm slowly trying to remove variables but so far it seems like network activity of some sort helps cause the lockup. The easiest way to make it lock up was doing backups through the network. But find's cranked up by the nightly cron jobs can get locked in getblk as well (while there are no NFS partitions mounted, but things like cvsup updates of a local repo are happening). Once things start to get locked up like this the system slowly degrades. I can usually ssh in and reboot it if I catch it soon enough, if I leave it for a couple of days it will seem like it's up (rwhod is running) but ssh-ing in won't work. sledge (amd64 machine in the cluster) was showing similar symptoms this morning, it had failed doing its nightly rebuild/reboot and things like mtree commands were wedged since a day or two ago. The Dell I have here is not really in production at all, if me doing anything here will help I'm game... -- Ken Smith - From there to here, from here to | kensmith@cse.buffalo.edu there, funny things are everywhere. | - Theodore Geisel |