From owner-freebsd-fs  Sat Mar  4 15:40:36 2000
Delivered-To: freebsd-fs@freebsd.org
Received: from knecht.Sendmail.ORG (knecht.sendmail.org [209.31.233.176])
	by hub.freebsd.org (Postfix) with ESMTP
	id AB09437B8D6; Sat,  4 Mar 2000 15:40:32 -0800 (PST)
	(envelope-from mckusick@flamingo.McKusick.COM)
Received: from flamingo.McKusick.COM (root@flamingo.mckusick.com [209.31.233.178])
	by knecht.Sendmail.ORG (8.10.0.Beta10/8.10.0.Beta10) with ESMTP id e24NeLJ33231;
	Sat, 4 Mar 2000 15:40:21 -0800 (PST)
Received: from flamingo.McKusick.COM (mckusick@localhost [127.0.0.1])
	by flamingo.McKusick.COM (8.9.3/8.9.0) with ESMTP id LAA16343;
	Sat, 4 Mar 2000 11:34:00 -0800 (PST)
Message-Id: <200003041934.LAA16343@flamingo.McKusick.COM>
To: Alfred Perlstein <bright@wintelcom.net>
Subject: Re: changing mount options still can cause damage? 
Cc: Terry Lambert <tlambert@primenet.com>, fs@freebsd.org,
	jkh@freebsd.org, Bruce Evans <bde@zeta.org.au>
In-reply-to: Your message of "Tue, 22 Feb 2000 20:46:11 PST."
             <20000222204610.I21720@fw.wintelcom.net> 
Date: Sat, 04 Mar 2000 11:34:00 -0800
From: Kirk McKusick <mckusick@flamingo.McKusick.COM>
Sender: owner-freebsd-fs@FreeBSD.ORG
Precedence: bulk
X-Loop: FreeBSD.org

OK, I have finally had a chance to investigate these issues at
greater length and have come to the following conclusions:

1) As suggested below, it would be useful to ffs_flushfiles
(or softdep_flushfiles as appropriate) when changing from
async -> noasync/sync so that upon return from the mount
command the user knows that the filesystem is safe.

2) In reviewing my bug logs for FFS I have found the `corruption'
case to which I believe the bug entry in the manual page was
alluding. It is possible to get lost inodes in a filesystem
that has been downgraded to read-only even if it never ran
in async mode. The senario causing trouble goes as follows:

	A) a process opens a file for reading.
	B) the file is unlinked
	C) the filesystem is downgraded to read-only
	D) the process referencing the now unlinked
	   file exits or closes the file.

In this case, the the inode cannot be freed as the filesystem
is now in read-only mode. Corruption of this sort is not
particularly threatening as the lost inodes will be cleaned
up the next time `fsck -p' is run, but the resulting loss
of space may be annoying if the filesystem is nearly full.
The alternative is to vgone files with link counts of zero
when doing a (forcible) downgrade just as we do with files
that are open for writing. This would result in the inode
being released and the process seeing a dead file (again
just as it would for a file open for writing). This seems
a slightly odd semantic for a file open for reading, so I
have not done it. Does anyone have any views on whether the
filesystem should be changed in this way on forcible
downgrades?

	Kirk

=-=-=-=-=-=-=-=

Date: Tue, 22 Feb 2000 20:46:11 -0800
From: Alfred Perlstein <bright@wintelcom.net>
To: Kirk McKusick <mckusick@flamingo.McKusick.COM>
Cc: Terry Lambert <tlambert@primenet.com>, fs@freebsd.org, jkh@freebsd.org
Subject: Re: changing mount options still can cause damage?

* Kirk McKusick <mckusick@flamingo.McKusick.COM> [000222 19:14] wrote:
> A downgrade from read-write to read-only does a complete flush
> of the filesystem before setting the clean bit in the superblock.
> So, even if you have been running async before or even during
> the period that you do the downgrade to read-only, you will
> not trash the filesystem. I do not believe that you are lying
> to anybody by deleting the commentary about cycling between
> read-only and read-write. Appropriate warnings about async are
> called for, however the only warning necessary about cycling
> between sync and async is that the danger of async does not
> go away for several minutes after you have cycled to sync.
> 
> 	~Kirk

You're saying the exact opposite of what Bruce and Luoqi said,
they both say that updating the mount from async -> noasync/sync
is safe because of the flush_files call.

Looking at the code you seem right...

("ffs/ffs_vfsops.c" line 186 of 1283)

	if (mp->mnt_flag & MNT_UPDATE) {
		ump = VFSTOUFS(mp);
		fs = ump->um_fs;
		devvp = ump->um_devvp;
		err = 0;
		ronly = fs->fs_ronly;	/* MNT_RELOAD might change this */
		if (ronly == 0 && (mp->mnt_flag & MNT_RDONLY)) {
			flags = WRITECLOSE;
			if (mp->mnt_flag & MNT_FORCE)
				flags |= FORCECLOSE;
			if (mp->mnt_flag & MNT_SOFTDEP) {
				err = softdep_flushfiles(mp, flags, p);
			} else {
				err = ffs_flushfiles(mp, flags, p);
			}
			ronly = 1;
		}

It sure looks like it forces the structures to disk because
ffs_flushfiles calls VOP_FSYNC on the filesystem dev vp.  Which I
would assume fixes up link counts for inodes possibly opened, but
deleted before the the filesystem is set to read-only.

ufs_close calls ufs_itimes which avoids attempting to update the
inode access time if the fs is readonly, so does ufs_inactive.

However the async -> noasync/sync doesn't do the same (fsync the
device vp), shouldn't it, and if it did, wouldn't that fix the
problem?  It's not like doing a fsync on the whole filesystem
at that point would be a common occurance.

I think you're correct however I'm assuming you've seen the case
brought up by Bruce and Luoqi, specifically the unlink and downgrade
to read-only.

thanks,
-- 
-Alfred Perlstein - [bright@wintelcom.net|alfred@freebsd.org]

=-=-=-=-=-=-=-=-=

Date: Fri, 25 Feb 2000 00:57:42 +1100 (EST)
From: Bruce Evans <bde@zeta.org.au>
To: Alfred Perlstein <bright@wintelcom.net>
cc: Kirk McKusick <mckusick@flamingo.McKusick.COM>,
        Terry Lambert <tlambert@primenet.com>, fs@FreeBSD.ORG, jkh@FreeBSD.ORG
Subject: Re: changing mount options still can cause damage?

On Tue, 22 Feb 2000, Alfred Perlstein wrote:

> * Kirk McKusick <mckusick@flamingo.McKusick.COM> [000222 19:14] wrote:
> > ... Appropriate warnings about async are
> > called for, however the only warning necessary about cycling
> > between sync and async is that the danger of async does not
> > go away for several minutes after you have cycled to sync.
> > 
> > 	~Kirk
> 
> You're saying the exact opposite of what Bruce and Luoqi said,
> they both say that updating the mount from async -> noasync/sync
> is safe because of the flush_files call.
> 
> Looking at the code you seem right...

> However the async -> noasync/sync doesn't do the same (fsync the
> device vp), shouldn't it, and if it did, wouldn't that fix the
> problem?  It's not like doing a fsync on the whole filesystem
> at that point would be a common occurance.

Copying the code in sync() and changing MNT_NOWAIT to MNT_WAIT in it
should work, modulo locking problems.  This applies to both async ->
noasync and nosync -> sync.

Bruce


To Unsubscribe: send mail to majordomo@FreeBSD.org
with "unsubscribe freebsd-fs" in the body of the message