Skip site navigation (1)Skip section navigation (2)
Date:      Sun, 22 Sep 2013 13:55:13 -0700
From:      Matthew Fleming <mdf@FreeBSD.org>
To:        Attilio Rao <attilio@freebsd.org>
Cc:        Konstantin Belousov <kostikbel@gmail.com>, "svn-src-head@freebsd.org" <svn-src-head@freebsd.org>, "svn-src-all@freebsd.org" <svn-src-all@freebsd.org>, "src-committers@freebsd.org" <src-committers@freebsd.org>
Subject:   Re: svn commit: r255797 - head/sys/kern
Message-ID:  <CAMBSHm9O-5qdaW%2B2k9mmjn3gJw8kVgyobeUZZ=1xHzqrDWLRtg@mail.gmail.com>
In-Reply-To: <CAJ-FndDHC03f==%2B_N_m2gQ==LKF9LNs=Jyf-nYjFyikuoum7Sg@mail.gmail.com>
References:  <201309221923.r8MJNm3u021657@svn.freebsd.org> <CAMBSHm_RYzVVm7cEEqntfChgC%2B2sC6CEonZgLob-nRKCUoLmQg@mail.gmail.com> <20130922201916.GL41229@kib.kiev.ua> <20130922203426.GM41229@kib.kiev.ua> <CAJ-FndDHC03f==%2B_N_m2gQ==LKF9LNs=Jyf-nYjFyikuoum7Sg@mail.gmail.com>

next in thread | previous in thread | raw e-mail | index | archive | help
On Sun, Sep 22, 2013 at 1:49 PM, Attilio Rao <attilio@freebsd.org> wrote:

> On Sun, Sep 22, 2013 at 9:34 PM, Konstantin Belousov
> <kostikbel@gmail.com> wrote:
> > On Sun, Sep 22, 2013 at 11:19:16PM +0300, Konstantin Belousov wrote:
> >> On Sun, Sep 22, 2013 at 01:14:21PM -0700, Matthew Fleming wrote:
> >> > On Sun, Sep 22, 2013 at 12:23 PM, Konstantin Belousov <
> kib@freebsd.org>wrote:
> >> >> > > Author: kib
> >> > > Date: Sun Sep 22 19:23:48 2013
> >> > > New Revision: 255797
> >> > > URL: http://svnweb.freebsd.org/changeset/base/255797
> >> > >
> >> > > Log:
> >> > >   Increase the chance of the buffer write from the bufdaemon helper
> >> > >   context to succeed.  If the locked vnode which owns the buffer to
> be
> >> > >   written is shared locked, try the non-blocking upgrade of the
> lock to
> >> > >   exclusive.
> >> > >
> >> > >   PR:   kern/178997
> >> > >   Reported and tested by:       Klaus Weber <
> >> > > fbsd-bugs-2013-1@unix-admin.de>
> >> > >   Sponsored by: The FreeBSD Foundation
> >> > >   MFC after:    1 week
> >> > >   Approved by:  re (marius)
> >> > >
> >> > > Modified:
> >> > >   head/sys/kern/vfs_bio.c
> >> > >
> >> > > Modified: head/sys/kern/vfs_bio.c
> >> > >
> >> > >
> ==============================================================================
> >> > > --- head/sys/kern/vfs_bio.c     Sun Sep 22 19:15:24 2013
>  (r255796)
> >> > > +++ head/sys/kern/vfs_bio.c     Sun Sep 22 19:23:48 2013
>  (r255797)
> >> > > @@ -2624,6 +2624,8 @@ flushbufqueues(struct vnode *lvp, int ta
> >> > >         int hasdeps;
> >> > >         int flushed;
> >> > >         int queue;
> >> > > +       int error;
> >> > > +       bool unlock;
> >> > >
> >> > >         flushed = 0;
> >> > >         queue = QUEUE_DIRTY;
> >> > > @@ -2699,7 +2701,16 @@ flushbufqueues(struct vnode *lvp, int ta
> >> > >                         BUF_UNLOCK(bp);
> >> > >                         continue;
> >> > >                 }
> >> > > -               if (vn_lock(vp, LK_EXCLUSIVE | LK_NOWAIT |
> LK_CANRECURSE)
> >> > > == 0) {
> >> > > +               if (lvp == NULL) {
> >> > > +                       unlock = true;
> >> > > +                       error = vn_lock(vp, LK_EXCLUSIVE |
> LK_NOWAIT);
> >> > > +               } else {
> >> > > +                       ASSERT_VOP_LOCKED(vp, "getbuf");
> >> > > +                       unlock = false;
> >> > > +                       error = VOP_ISLOCKED(vp) == LK_EXCLUSIVE ?
> 0 :
> >> > > +                           vn_lock(vp, LK_UPGRADE | LK_NOWAIT);
> >> > >
> >> >
> >> > I don't think this is quite right.
> >> >
> >> > When the lock is held shared, and VOP_LOCK is implemented by
> lockmgr(9),
> >> > (i.e. all in-tree filesystems?), LK_UPGRADE may drop the lock, and not
> >> > reacquire it.  This would happen when the vnode is locked shared, the
> >> > upgrade fails (2 shared owners), then lockmgr(9) will try to lock EX,
> which
> >> > will also fail (still one shared owner).  The caller's lock is no
> longer
> >> > held.
> >> >
> >> > Doesn't that scenario (LK_UPGRADE failing) cause problems both for the
> >> > caller (unexpected unlock) and for flushbufqueues(), which expects the
> >> > vnode lock to be held since lvp is non-NULL?
> >>
> >> Does it ? If the lock is dropped, the code is indeed in trouble.
> >> Please note that LK_NOWAIT is specified for upgrade, and I believe
> >> that this causes lockmgr to return with EBUSY without dropping
> >> the lock.
> >
> > Yes, you are right, I reverted the patch.  Thank you for noting this.
> >
> > I am bitten by unreasonable behaviour of non-blocking upgrade once more.
> > It has a history.
> >
> > Some time ago I proposed the following patch, which was turned down.
> > That time, I was able to work-around the case. For the bufdaemon helper,
> > I do not see any way to avoid this, except of sometimes locking the
> > reader vnode exclusive in anticipation of the too high dirty buffer
> > mark.
>
> If you are speaking about me, you are mistaken, I never turned out this
> patch.
> What I said is completely different: I said that LK_UPGRADE is a
> completely wrong semantic because it can hide wrong things like the
> one you hit today.
> I wanted to see it removed and replaced by explicit LK_RELEASE +
> LK_EXCLUSIVE operations.
> Note that this would have avoided this patch.
>
> I'm completely in favour of LK_TRYUPGRADE.


I agree, here a LK_TRYUPGRADE semantic looks like what is needed.

Cheers,
matthew



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?CAMBSHm9O-5qdaW%2B2k9mmjn3gJw8kVgyobeUZZ=1xHzqrDWLRtg>