Date: Sun, 14 Dec 2003 09:06:17 -0500 (EST) From: Jeff Roberson <jroberson@chesapeake.net> To: Don Lewis <truckman@FreeBSD.org> Cc: freebsd-current@FreeBSD.org Subject: Re: HAVE TRACE & DDB Re: FreeBSD 5.2-RC1 released Message-ID: <20031214090113.L4201-100000@mail.chesapeake.net> In-Reply-To: <200312140716.hBE7G8eF064492@gw.catspoiler.org>
next in thread | previous in thread | raw e-mail | index | archive | help
On Sat, 13 Dec 2003, Don Lewis wrote: > On 13 Dec, Don Lewis wrote: > > On 12 Dec, Jeff Roberson wrote: > > > > > >> fsync: giving up on dirty: 0xc4e18000: tag devfs, type VCHR, usecount 44, > >> writecount 0, refcount 14, flags (VI_XLOCK|VV_OBJBUF), lock type devfs: EXCL > >> (count 1) by thread 0xc20ff500 > > > > Why are we trying to reuse a vnode with a usecount of 44 and a refcount > > of 14? What is thread 0xc20ff500 doing? > > Following up to myself ... > > It looks like we're trying to recycle this vnode because of the > following sysinstall code, in distExtractTarball(): > > if (is_base && RunningAsInit && !Fake) { > unmounted_dev = 1; > unmount("/dev", MNT_FORCE); > } else > unmounted_dev = 0; > > I'm guessing that the purpose of this code is to unmount devfs from /dev > so that when the base distribution is unpacked it can populate /dev from > the tarball. This seems wrong, because it looks like the root file > system is mounted on /mnt, and devfs is also mounted on /mnt/dev ... > > What happens if we forceably umount /dev while /dev/whatever holds a > mounted file system? It looks like this is handled by vgonechrl(). It > looks to me like vclean() is going to do some scary stuff to this vnode. > > BTW, I think the root vnode is the root of the md file system, not the > root of the file system being populated by sysinstall. I don't know why > there would be anything to sync at this point, though. > > I suspect that removing the above sysinstall code will fix the immediate > problem, but there is still much I don't understand. > Excellent work! I think I may know what's wrong. If you look at rev 1.461 of vfs_subr.c I changed the semantics of cleaning a VCHR that was being unmounted. I now acquire the xlock around the operation. This may be the culprit. I'm too tired to debug this right now, but I can look at it in the am. Thanks, Jeff
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20031214090113.L4201-100000>