From owner-freebsd-stable@FreeBSD.ORG Wed Jul 5 15:18:11 2006 Return-Path: X-Original-To: freebsd-stable@freebsd.org Delivered-To: freebsd-stable@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 5384416A56D for ; Wed, 5 Jul 2006 15:18:11 +0000 (UTC) (envelope-from rwatson@FreeBSD.org) Received: from cyrus.watson.org (cyrus.watson.org [209.31.154.42]) by mx1.FreeBSD.org (Postfix) with ESMTP id 00D0543D4C for ; Wed, 5 Jul 2006 15:18:08 +0000 (GMT) (envelope-from rwatson@FreeBSD.org) Received: from fledge.watson.org (fledge.watson.org [209.31.154.41]) by cyrus.watson.org (Postfix) with ESMTP id 49EB946C84; Wed, 5 Jul 2006 11:18:08 -0400 (EDT) Date: Wed, 5 Jul 2006 16:18:08 +0100 (BST) From: Robert Watson X-X-Sender: robert@fledge.watson.org To: Michael Collette In-Reply-To: <44A99CC1.7070501@TestEquity.com> Message-ID: <20060705161559.W18236@fledge.watson.org> References: <20060629230309.GA12773@lpthe.jussieu.fr> <20060630041733.GA4941@zibbi.meraka.csir.co.za> <20060702162942.D1103@ganymede.hub.org> <44A99CC1.7070501@TestEquity.com> MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII; format=flowed Cc: freebsd-stable@freebsd.org, User Freebsd Subject: Re: NFS Locking Issue X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 05 Jul 2006 15:18:11 -0000 On Mon, 3 Jul 2006, Michael Collette wrote: > --------------------------------------------------------------------- > Let's start with the simplest. The scenario here involves 2 machines, mach01 > and mach02. Both are running 6-STABLE, and both are running rpcbind, > rpc.statd, and rpc.lockd. mach01 has exported /documents and mach02 is > mounting that export under /mnt. Simple enough? > > The /documents directory has multiple subdirectories and files of various > sizes. The actual amount of data doesn't really matter to produce a failure. > All you need to do at this point is to try to copy files from that mount > point to somewhere else on the hard drive. > > cp -Rp /mnt/* /tmp/documents/ > > You may, or not, see that a couple of subdirectories were created, but no > files actually moved over. The cp command is now locked up, and no traffic > moves. This usually takes a second or two to show up as a problem. I can > repeat this with multiple 6-STABLE boxes. > > Turn off rpc.lockd on either the server or client before the cp command, and > things work. I've tried several times to reproduce this, and have not succeeded in doing so. In princple, cp should not be using advisory locks. Could you try running cp under ktrace, and saving the ktrace file somewhere outside of NFS? Something like the following: ktrace -f /usr/tmp/localfile cp -Rp /mnt/* /tmp/documents/ If you are able to reproduce the problem with tracing turned on, a copy of the tracefile would be very helpful. Also, when it locks up, are you able to kill cp using Ctrl-C, and if you hit Ctrl-T while it appears locked, what output do you get? Thanks, Robert N M Watson Computer Laboratory University of Cambridge