From owner-freebsd-current@FreeBSD.ORG Sat Nov 1 18:08:06 2003 Return-Path: Delivered-To: freebsd-current@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id A759616A4CE for ; Sat, 1 Nov 2003 18:08:06 -0800 (PST) Received: from sweeper.openet-telecom.com (mail.openet-telecom.com [62.17.151.60]) by mx1.FreeBSD.org (Postfix) with ESMTP id DA7D043F75 for ; Sat, 1 Nov 2003 18:08:04 -0800 (PST) (envelope-from peter.edwards@openet-telecom.com) Received: from mail.openet-telecom.com (unverified) by sweeper.openet-telecom.com ; Sun, 2 Nov 2003 02:08:57 +0000 Received: from [194.125.183.90] by mail.openet-telecom.com with HTTP; Sun, 2 Nov 2003 02:03:34 +0000 Date: Sun, 2 Nov 2003 03:03:34 +0100 Message-ID: <3F6FE91100001039@mail.openet-telecom.com> In-Reply-To: <20031101172243.Y70057@carver.gumbysoft.com> From: peter.edwards@openet-telecom.com To: "Doug White" , current@freebsd.org MIME-Version: 1.0 Content-Type: text/plain; charset="ISO-8859-1" Content-Transfer-Encoding: quoted-printable Subject: RE: lockmgr panic on shutdown X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.1 Precedence: list List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 02 Nov 2003 02:08:06 -0000 >I can confirm the lockmgr panic on shutdown reported by someone else >earlier (whose message I mistakenly deleted). > >It looks like swapper is trying to undo a lock from pagedaemon and runs >into trouble. This is probably related to the Giant pushdown of >vm_pageout() that alc did last week. > >I'm building with INVARIANTS to see if that will catch more info. Will >report back soon. Just happened me too. I think I see the problem: When boot() calls sync(), it passes &thread0 as the thread argument. This gets propgated up to ffs_sync, which: calls vget(), which takes a thread argument. does some stuff calls vput(), which does _not_ take a thread argument The vget() is passed thread0, as passed from boot. The vput() gets the current thread, which is the process calling boot. The unlocking in vput is asserting that the same thread that aquired the lock is releasing it, which seems reasonable. The obvious solution might be to change line 1161 of ffs_vfsops to pass vget() "curthread" rather than td. I assume there's a good reason why "thread0" is passed from boot(), but I can't see why that's of any use to the vnode locking. i.e.: Index: ffs_vfsops.c =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D RCS file: /usr/cvs/FreeBSD-CVS/src/sys/ufs/ffs/ffs_vfsops.c,v retrieving revision 1.221 diff -u -r1.221 ffs_vfsops.c --- ffs_vfsops.c 1 Nov 2003 05:51:54 -0000 1.221 +++ ffs_vfsops.c 2 Nov 2003 03:06:42 -0000 @@ -1158,7 +1158,7 @@ continue; } mtx_unlock(&mntvnode_mtx); - if ((error =3D vget(vp, lockreq, td)) !=3D 0) { + if ((error =3D vget(vp, lockreq, curthread)) !=3D 0) { mtx_lock(&mntvnode_mtx); if (error =3D=3D ENOENT) goto loop; How come tha parameters to vget and vput are lopsided like this? This might have something to do with the commit of revision 1.218 of ffs_vfsops.c, but I'm not sure.