Date: Wed, 20 Feb 2013 07:53:38 -0700 From: Jamie Gritton <jamie@FreeBSD.org> To: Konstantin Belousov <kostikbel@gmail.com> Cc: fs@FreeBSD.org Subject: Re: mount/kldload race Message-ID: <5124E372.1000009@FreeBSD.org> In-Reply-To: <20130220054309.GD2598@kib.kiev.ua> References: <51244A13.8030907@FreeBSD.org> <20130220054309.GD2598@kib.kiev.ua>
next in thread | previous in thread | raw e-mail | index | archive | help
On 02/19/13 22:43, Konstantin Belousov wrote:
> On Tue, Feb 19, 2013 at 08:59:15PM -0700, Jamie Gritton wrote:
>> Perhaps most people don't try to mount a bunch of filesystems at the
>> same time, at least not those that depend on kernel modules. But it
>> turns out that's going to be a pretty common situation with jails and
>> nullfs. And I found that when attempting such a feat will cause most of
>> these simultaneous mounts to fail with ENODEV.
>>
>> It turns out that the problem is a race in vfs_byname_kld(). First it'll
>> see if the fstype is loaded, and if it isn't then it will load the
>> module. But if the module is loaded by a different process between those
>> two points, the resulting EEXIST from kern_kldload() will make
>> vfs_byname_kld() error out.
>>
>> The fix is pretty simple: don't treat EEXIST as an error. By going on,
>> and rechecking for the fstype, the filesystem can be mounted while still
>> allowing any "real" error to be caught. I'm including a small patch that
>> will accomplish this, and I'd appreciate a quick look by anyone who's
>> familiar with this part of things before I commit it.
>>
>> - Jamie
>
>> Index: sys/kern/vfs_init.c
>> ===================================================================
>> --- sys/kern/vfs_init.c (revision 247000)
>> +++ sys/kern/vfs_init.c (working copy)
>> @@ -130,13 +130,18 @@
>>
>> /* Try to load the respective module. */
>> *error = kern_kldload(td, fstype,&fileid);
>> + if (*error == EEXIST) {
>> + *error = 0;
>> + fileid = 0;
> Why do you clear fileid ? Is this to prevent an attempt to kldunload()
> the module which was not loaded by the current thread ?
>
> If yes, I would suggest to use the separate flag to track this,
> which is cleared on EEXIST error. IMHO it is cleaner and less puzzling.
Yes, that's why. As a side note, I clear *error ostensibly for the sake
of the callers, but it turns out none of the callers actually look at
the returned error.
Here's a new patch with an added flag:
Index: sys/kern/vfs_init.c
===================================================================
--- sys/kern/vfs_init.c (revision 247000)
+++ sys/kern/vfs_init.c (working copy)
@@ -122,7 +122,7 @@
vfs_byname_kld(const char *fstype, struct thread *td, int *error)
{
struct vfsconf *vfsp;
- int fileid;
+ int fileid, loaded;
vfsp = vfs_byname(fstype);
if (vfsp != NULL)
@@ -130,13 +130,17 @@
/* Try to load the respective module. */
*error = kern_kldload(td, fstype, &fileid);
+ loaded = (*error == 0);
+ if (*error == EEXIST)
+ *error = 0;
if (*error)
return (NULL);
/* Look up again to see if the VFS was loaded. */
vfsp = vfs_byname(fstype);
if (vfsp == NULL) {
- (void)kern_kldunload(td, fileid, LINKER_UNLOAD_FORCE);
+ if (loaded)
+ (void)kern_kldunload(td, fileid, LINKER_UNLOAD_FORCE);
*error = ENODEV;
return (NULL);
}
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?5124E372.1000009>
