From owner-freebsd-current@FreeBSD.ORG Tue Oct 25 06:34:35 2005 Return-Path: X-Original-To: current@freebsd.org Delivered-To: freebsd-current@FreeBSD.ORG Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id DBC5116A41F for ; Tue, 25 Oct 2005 06:34:35 +0000 (GMT) (envelope-from frank@exit.com) Received: from tinker.exit.com (tinker.exit.com [206.223.0.1]) by mx1.FreeBSD.org (Postfix) with ESMTP id 632BB43D46 for ; Tue, 25 Oct 2005 06:34:35 +0000 (GMT) (envelope-from frank@exit.com) Received: from realtime.exit.com (realtime [206.223.0.5]) by tinker.exit.com (8.13.4/8.13.4) with ESMTP id j9P6YZQd056448 for ; Mon, 24 Oct 2005 23:34:35 -0700 (PDT) (envelope-from frank@exit.com) Received: from realtime.exit.com (localhost [127.0.0.1]) by realtime.exit.com (8.13.4/8.13.4) with ESMTP id j9P6YYUO004499 for ; Mon, 24 Oct 2005 23:34:34 -0700 (PDT) (envelope-from frank@exit.com) Received: (from frank@localhost) by realtime.exit.com (8.13.4/8.13.4/Submit) id j9P6YY0V004496 for current@freebsd.org; Mon, 24 Oct 2005 23:34:34 -0700 (PDT) (envelope-from frank@exit.com) X-Authentication-Warning: realtime.exit.com: frank set sender to frank@exit.com using -f From: Frank Mayhar To: FreeBSD-Current Content-Type: text/plain Content-Transfer-Encoding: 7bit Organization: Exit Consulting Date: Mon, 24 Oct 2005 23:34:33 -0700 Message-Id: <1130222074.1792.13.camel@realtime.exit.com> Mime-Version: 1.0 X-Mailer: Evolution 2.2.3 FreeBSD GNOME Team Port X-Virus-Scanned: ClamAV 0.86.2/1146/Thu Oct 20 21:36:48 2005 on tinker.exit.com X-Virus-Status: Clean Cc: Subject: Race in NFS in 6.0-RC1? X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list Reply-To: frank@exit.com List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 25 Oct 2005 06:34:36 -0000 I've started using NFS in 6.0 a little more heavily lately, as since the em(4) wedge has been fixed I can actually use it reliably. Unfortunately there appears to be a problem. Twice, now, in less than 24 hours the client has paniced under load. Both times it was building OpenOffice in an NFS-mounted /usr/ports. In case it matters, it's a soft mount from another 6.0 box over an em(4) interface with an MTU of 9000. Both times it was a panic from vnlru while trying to flush a vnode and both times it was a null-pointer dereference in nfs_putpages() at nfs_bio.c:301. In both cases vp->v_data was null. The vnode itself looks fine to my eyes, although there may well be FreeBSD-specific subtleties that I'm missing. I've just entered a PR for this problem, kern/87967. I'll keep the cores around; if anyone wants more information from them, let me know. As may be apparent, I can reproduce this fairly easily, although it takes a few minutes for it to trigger. The worrying thing about this is, in fact, its reproducibility. -- Frank Mayhar frank@exit.com http://www.exit.com/ Exit Consulting http://www.gpsclock.com/ http://www.exit.com/blog/frank/