From owner-freebsd-current@FreeBSD.ORG Mon Nov 10 08:45:04 2003 Return-Path: Delivered-To: freebsd-current@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 605C016A4CE; Mon, 10 Nov 2003 08:45:04 -0800 (PST) Received: from womble.xtaz.co.uk (82-32-25-111.cable.ubr04.azte.blueyonder.co.uk [82.32.25.111]) by mx1.FreeBSD.org (Postfix) with ESMTP id 780EC43FAF; Mon, 10 Nov 2003 08:45:03 -0800 (PST) (envelope-from matt@xtaz.co.uk) Received: from xtaz.co.uk (localhost [127.0.0.1]) by womble.xtaz.co.uk (Postfix) with ESMTP id 6406E902D7; Mon, 10 Nov 2003 16:45:02 +0000 (GMT) Message-ID: <3FAFC08D.30301@xtaz.co.uk> Date: Mon, 10 Nov 2003 16:45:01 +0000 From: Matt Smith User-Agent: Mozilla/5.0 (Windows; U; Windows NT 5.0; en-US; rv:1.5) Gecko/20031007 X-Accept-Language: en-us, en MIME-Version: 1.0 To: Robert Watson References: In-Reply-To: Content-Type: text/plain; charset=us-ascii; format=flowed Content-Transfer-Encoding: 7bit cc: Soren Schmidt cc: current@freebsd.org Subject: Re: Still getting NFS client locking up X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.1 Precedence: list List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 10 Nov 2003 16:45:04 -0000 X-List-Received-Date: Mon, 10 Nov 2003 16:45:04 -0000 X-List-Received-Date: Mon, 10 Nov 2003 16:45:04 -0000 Robert Watson wrote: > I'm fairly baffled. I tried for many hours to reproduce the problem in > two seperate sets of systems here, and completely failed. I left > buildworlds, cvs updates, blah blah blah, running for 96 hours across > pools of clients and servers and no hint of the problem. I also use NFS > daily on my primary workstation at work, as well as in my normal > development setup with diskless crashboxes. So indeed, there must be some > very specific piece of the picture that I'm not reproducing, such as a > specific network card, or there's a race condition that requires very > specific timing, etc. > > How fast are your systems, speaking of which? I live in the world of > 300-500 mhz machines at work, and 300-800 mhz boxes at home. If you're > using multi-ghz boxes, that could well be the distinguishing factor > between our configurations... > client is an intel pentium II 300mhz with 256meg ram and 1gig of swap. server is an athlon XP 2200 with 512meg ram and 1gig of swap. I can certainly spend some time trying to get some proper debug based on what you have said in your email. I shall look into setting up a serial console etc. In the meantime another piece of information which might be helpful is this. Looking at the wtmp to see when I rebuilt my world/kernel I can see this: reboot ~ Tue Oct 21 20:44 reboot ~ Wed Oct 15 19:36 (These times are in BST which is +5 hours from east coast US). On the Oct 15th kernel NFS was working perfectly (and before that). From the Oct 21st kernel it has always locked up in this way. So something between those two dates was commited which broke this for us. Another way of me debugging this I guess is to backtrack my world to each date in between systematically and find the exact date it breaks and look at the commits. Matt.