From owner-freebsd-stable@FreeBSD.ORG Fri Nov 5 21:04:52 2010 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id CD86F1065672; Fri, 5 Nov 2010 21:04:52 +0000 (UTC) (envelope-from rmacklem@uoguelph.ca) Received: from esa-annu.mail.uoguelph.ca (esa-annu.mail.uoguelph.ca [131.104.91.36]) by mx1.freebsd.org (Postfix) with ESMTP id 3E1898FC08; Fri, 5 Nov 2010 21:04:50 +0000 (UTC) X-IronPort-Anti-Spam-Filtered: true X-IronPort-Anti-Spam-Result: ApwEAB4O1EyDaFvO/2dsb2JhbACDKJ8zqnuQcoEigVKBYXMEilY X-IronPort-AV: E=Sophos;i="4.58,305,1286164800"; d="scan'208";a="97967335" Received: from erie.cs.uoguelph.ca (HELO zcs3.mail.uoguelph.ca) ([131.104.91.206]) by esa-annu-pri.mail.uoguelph.ca with ESMTP; 05 Nov 2010 17:04:50 -0400 Received: from zcs3.mail.uoguelph.ca (localhost.localdomain [127.0.0.1]) by zcs3.mail.uoguelph.ca (Postfix) with ESMTP id 3EBDDB3F5A; Fri, 5 Nov 2010 17:04:50 -0400 (EDT) Date: Fri, 5 Nov 2010 17:04:50 -0400 (EDT) From: Rick Macklem To: josh carroll Message-ID: <1563115557.181398.1288991090198.JavaMail.root@erie.cs.uoguelph.ca> In-Reply-To: MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-Originating-IP: [99.225.56.115] X-Mailer: Zimbra 6.0.7_GA_2476.RHEL4 (ZimbraWebClient - IE8 (Win)/6.0.7_GA_2473.RHEL4_64) Cc: freebsd-stable@freebsd.org, kib , jhb@freebsd.org Subject: Re: NFS deadlock (unkillable nfsd and no mounts work) X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 05 Nov 2010 21:04:52 -0000 > >> I'm having a problem with nfsd hanging and not serving mount > >> points, > >> during which time it can not not be killed. This problem started > >> happening sometime after November 2nd, since kernel from 11/2 > >> sources > >> does not exhibit this problem. > > > > Please try the attached patch, rick >=20 > Thanks! I had to manually patch for some reason, but I can confirmed > that nfsd is now well-behaved with your patch applied. I tested a > couple of different mounts and played two separate files on the > Popcorn Hour (one lower bitrate, the other higher bitrate) and both > played without a hiccup. While those were playing I also was able to > automount my home directory on the macbook and move around my home > directory. >=20 > So it looks like this patch did the trick. Thanks Rick, really > appreciate the fast response. Is there a reason why this doesn't seem > to be getting reported a lot? What is particular in my setup that > broke it? >=20 Well, the commit that broke things just hit stable/8 on Nov. 3. Also, I'm not sure what scenarios would have caused the breakage. I think it would be something like a file system where vget worked that dropped out of the loop just after looking up "." or ".." at the root, so that the nvp remained locked. But I'm not sure what the exact scenarios are? (Holding the shared lock shouldn't have stopped further VFS_VGET()s from succeeding, I think?) > > ps: Starting about Monday I won't be able to do commits for about 3 > > weeks > > =C2=A0 =C2=A0so, if this patch works, could someone else please commit = it, > > thanks, > > =C2=A0 =C2=A0rick > > >=20 > If someone can commit this, I'd really appreciate it. I will report > back if I notice any problems, but I imagine this would probably get > fixed in HEAD first, then MFC'd anyway, right? Unless this is already > fixed in HEAD. >=20 The patch isn't in head, but hopefully someone like kib@ or jhb@ can do it, since I won't be able to MFC it before code freeze. They might have a better patch? Anyhow, good to hear it fixes the problem. Thanks for reporting the problem and testing the patch, rick