Skip site navigation (1)Skip section navigation (2)
Date:      Wed, 10 Dec 2008 00:56:04 +0100
From:      "Paul B. Mahol" <onemda@gmail.com>
To:        "John Baldwin" <jhb@freebsd.org>
Cc:        freebsd-current@freebsd.org
Subject:   Re: [PATCH] MPSAFE/LOOKUP_SHARED cd9660
Message-ID:  <3a142e750812091556x682008dagb8e1e867d0bdca79@mail.gmail.com>
In-Reply-To: <3a142e750812091516n6bb0880ewb298314e9ba296c@mail.gmail.com>
References:  <200811191510.53793.jhb@FreeBSD.org> <200812051608.50120.jhb@freebsd.org> <3a142e750812051354n747bcbcayb31d8d5f4cc15098@mail.gmail.com> <200812091602.10850.jhb@freebsd.org> <3a142e750812091515u3f2e5807j3652f2cc3551a3b2@mail.gmail.com> <3a142e750812091516n6bb0880ewb298314e9ba296c@mail.gmail.com>

next in thread | previous in thread | raw e-mail | index | archive | help
On 12/10/08, Paul B. Mahol <onemda@gmail.com> wrote:
> On 12/10/08, Paul B. Mahol <onemda@gmail.com> wrote:
>> On 12/9/08, John Baldwin <jhb@freebsd.org> wrote:
>>> On Friday 05 December 2008 04:54:23 pm Paul B. Mahol wrote:
>>>> On 12/5/08, John Baldwin <jhb@freebsd.org> wrote:
>>>> > On Friday 05 December 2008 03:56:31 pm Paul B. Mahol wrote:
>>>> >> On 12/5/08, John Baldwin <jhb@freebsd.org> wrote:
>>>> >> > On Thursday 20 November 2008 05:47:28 pm John Baldwin wrote:
>>>> >> >> On Thursday 20 November 2008 04:30:57 pm Paul B. Mahol wrote:
>>>> >> >> > On 11/19/08, John Baldwin <jhb@freebsd.org> wrote:
>>>> >> >> > > This is a relatively simple patch to mark cd9660 MPSAFE and
>>>> >> >> > > enable
>>>> >> > shared
>>>> >> >> > > lookups.  The changes to cd9660_lookup() mirror similar
>>>> >> >> > > changes
>>>> >> >> > > to
>>>> >> >> > > ufs_lookup() to use static variables for local data rather
>>>> >> >> > > than
>>>> >> >> > > abusing
>>>> >> >> > > i-node members of the parent directory.  I've done some light
>>>> >> >> > > testing
>>>> >> >> > > of
>>>> >> >> > > this, but not super-strenuous.  This patch also includes
>>>> >> >> > > simple
>>>> >> >> > > locking
>>>> >> >> for
>>>> >> >> > > the iconv support in the kernel.  That locking uses an sx lock
>>>> >> >> > > to
>>>> >> >> serialize
>>>> >> >> > > open and close of translator tables and the associated
>>>> >> >> > > refcount.
>>>> >> >> > > Actual
>>>> >> >> > > conversions do not need any locks, however as the mount holds
>>>> >> >> > > a
>>>> >> > reference
>>>> >> >> on
>>>> >> >> > > the table.
>>>> >> >> > >
>>>> >> >> > > http://www.FreeBSD.org/~jhb/patches/cd9660_mpsafe.patch
>>>> >> >> > >
>>>> >> >> >
>>>> >> >> > With this patch I'm unable to kldunload libiconv.ko once it is
>>>> >> >> > loaded.
>>>> >> >> > And trying to kldunload libiconv.ko will make any next
>>>> >> >> kldload/kldstat/kldunload
>>>> >> >> > to fail waiting forever(livelock).
>>>> >> >> >
>>>> >> >> > Regression were not encountered while only cd9660.ko were
>>>> >> >> > kldloaded.
>>>> >> >>
>>>> >> >> So this is actually due to a bug in the module code.  If you have
>>>> >> >> two
>>>> >> > modules
>>>> >> >> like this:
>>>> >> >>
>>>> >> >> DECLARE_MODULE(foo, SI_SUB_DRIVERS, SI_ORDER_FIRST);
>>>> >> >> DECLARE_MODULE(bar, SI_SUB_DRIVERS, SI_ORDER_SECOND);
>>>> >> >>
>>>> >> >> The SI_* constants ensure that foo's module handler is called
>>>> >> >> before
>>>> > bar's
>>>> >> >>
>>>> >> >> module handler for MOD_LOAD.  However, we don't enforce a reverse
>>> order
>>>> >> >> (bar
>>>> >> >> then foo) for MOD_UNLOAD.  In fact, the order of MOD_UNLOAD events
>>>> >> >> is
>>>> >> >> random
>>>> >> >> and has no relation to the SI_* constants. :(
>>>> >> >>
>>>> >> >> What is happening here is that one of the 'bar' modules in
>>>> >> >> libiconv.ko
>>>> >> >> is
>>>> >> >> getting unloaded after 'foo' gets unloaded and using a destroyed
>>>> >> >> lock
>>>> > (you
>>>> >> >>
>>>> >> >> get a panic if you run with INVARIANTS).
>>>> >> >
>>>> >> > So this should now be fixed with this commit.  If you could verify
>>>> >> > that
>>>> >> > iconv
>>>> >> > works ok with the latest kern_module.c I would appreciate it.
>>>> >> >
>>>> >> > Author: jhb
>>>> >> > Date: Fri Dec  5 16:47:30 2008
>>>> >> > New Revision: 185642
>>>> >> > URL: http://svn.freebsd.org/changeset/base/185642
>>>> >> >
>>>> >> > Log:
>>>> >> >   When the SYSINIT() to load a module invokes the MOD_LOAD event
>>>> >> > successfully,
>>>> >> >   move that module to the head of the associated linker file's list
>>>> >> > of
>>>> >> > modules.
>>>> >> >   The end result is that once all the modules are loaded, they are
>>>> >> > sorted
>>>> > in
>>>> >> >   the reverse of their load order.  This causes the kernel linker
>>>> >> > to
>>>> > invoke
>>>> >> >   the MOD_QUIESCE and MOD_UNLOAD events in the reverse of the order
>>> that
>>>> >> >   MOD_LOAD was invoked.  This means that the ordering of MOD_LOAD
>>> events
>>>> >> > that
>>>> >> >   is set by the SI_* paramters to DECLARE_MODULE() are now honored
>>>> >> > in
>>>> >> > the
>>>> >> > same
>>>> >> >   order they would be for SYSUNINIT() for the MOD_QUIESCE and
>>> MOD_UNLOAD
>>>> >> >   events.
>>>> >> >
>>>> >> >   MFC after:    1 month
>>>> >> >
>>>> >> > --
>>>> >> > John Baldwin
>>>> >> >
>>>> >>
>>>> >> Yes it works, I tried hard multiple times kldload/kldunload
>>>> >> {libiconv,cd9660,cd9660_iconv in various order} to livelock/panic it,
>>>> >> but without success.
>>>> >>
>>>> >> FYI following LORs happened:
>>>> >>
>>>> >> lock order reversal:
>>>> >>  1st 0xc4322ce8 isofs (isofs) @ /usr/src/sys/kern/vfs_lookup.c:442
>>>> >>  2nd 0xd7d8d740 bufwait (bufwait) @ /usr/src/sys/kern/vfs_bio.c:2443
>>>> >>  3rd 0xc4322bdc isofs (isofs) @
>>>> >> /usr/src/sys/modules/cd9660/../../fs/cd9660/cd9660_vfsops.c:694
>>>> >
>>>> > This LOR should be addressed in the latest cd9660 locking patches.
>>>> >
>>>> > --
>>>> > John Baldwin
>>>> >
>>>>
>>>> Oh, why I did not checked new version?
>>>>
>>>> Yes that LOR have gone, but when doing "ll -R" first time on /mnt
>>>> I got following messages from kernel:
>>>>
>>>> RRIP without PX field?             x ~ 50 times.
>>>>
>>>> I see you changed LK_EXCLUSIVE to flags, and with MPSAFE ....
>>>
>>> The RRIP stuff is all done in cd9660_vget_internal() under an exclusive
>>> lock.
>>> It could be a property of the ISO image.  "PX" holds permissions (owner,
>>> etc.).  Do you get the same messages w/o the patch with the same ISO
>>> image
>>> /
>>> CD?
>>>
>>> --
>>> John Baldwin
>>>
>>
>> No I do not, but I may try other CDs I have many of them, including
>> FreeBSD

FreeBSD snapshot CD have same "issue".
It doesnt appear always (I mean I do not count vfs cache) maybe disabling SMP
and PREEMPTION will silent it.

> s/Yes I do. Its to late here ...
>
>> one.
>> If it is irrelevant than it should not be displayed.

-- 
Paul



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?3a142e750812091556x682008dagb8e1e867d0bdca79>