From owner-freebsd-current@FreeBSD.ORG Tue Oct 25 07:33:09 2005 Return-Path: X-Original-To: current@freebsd.org Delivered-To: freebsd-current@FreeBSD.ORG Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id A8B9716A41F for ; Tue, 25 Oct 2005 07:33:09 +0000 (GMT) (envelope-from danny@cs.huji.ac.il) Received: from cs1.cs.huji.ac.il (cs1.cs.huji.ac.il [132.65.16.10]) by mx1.FreeBSD.org (Postfix) with ESMTP id 3BB5D43D45 for ; Tue, 25 Oct 2005 07:33:09 +0000 (GMT) (envelope-from danny@cs.huji.ac.il) Received: from pampa.cs.huji.ac.il ([132.65.80.32]) by cs1.cs.huji.ac.il with esmtp id 1EUJIw-000KM1-UR; Tue, 25 Oct 2005 09:33:06 +0200 X-Mailer: exmh version 2.7.0 06/18/2004 with nmh-1.0.4 To: frank@exit.com In-Reply-To: Message from Frank Mayhar of "Mon, 24 Oct 2005 23:34:33 MST." <1130222074.1792.13.camel@realtime.exit.com> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Date: Tue, 25 Oct 2005 09:33:06 +0200 From: Danny Braniss Message-ID: Cc: alsbergt@cs.huji.ac.il, FreeBSD-Current Subject: Re: Race in NFS in 6.0-RC1? X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 25 Oct 2005 07:33:09 -0000 > I've started using NFS in 6.0 a little more heavily lately, as since the > em(4) wedge has been fixed I can actually use it reliably. > Unfortunately there appears to be a problem. Twice, now, in less than > 24 hours the client has paniced under load. Both times it was building > OpenOffice in an NFS-mounted /usr/ports. In case it matters, it's a > soft mount from another 6.0 box over an em(4) interface with an MTU of > 9000. > > Both times it was a panic from vnlru while trying to flush a vnode and > both times it was a null-pointer dereference in nfs_putpages() at > nfs_bio.c:301. In both cases vp->v_data was null. The vnode itself > looks fine to my eyes, although there may well be FreeBSD-specific > subtleties that I'm missing. I've just entered a PR for this problem, > kern/87967. I'll keep the cores around; if anyone wants more > information from them, let me know. As may be apparent, I can reproduce > this fairly easily, although it takes a few minutes for it to trigger. > > The worrying thing about this is, in fact, its reproducibility. This looks very similar to a problem we have with a 5.4 box running samba, it has an em(4), no jumbo packets, but is heavely doing nfs - the files are on a netapp filer. the problem is not easely reprodusable, but it happens. danny