From owner-svn-src-head@FreeBSD.ORG Sun Sep 22 20:55:15 2013 Return-Path: Delivered-To: svn-src-head@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [8.8.178.115]) (using TLSv1 with cipher ADH-AES256-SHA (256/256 bits)) (No client certificate requested) by hub.freebsd.org (Postfix) with ESMTP id 1E116A36; Sun, 22 Sep 2013 20:55:15 +0000 (UTC) (envelope-from mdf356@gmail.com) Received: from mail-ob0-x22c.google.com (mail-ob0-x22c.google.com [IPv6:2607:f8b0:4003:c01::22c]) (using TLSv1 with cipher ECDHE-RSA-RC4-SHA (128/128 bits)) (No client certificate requested) by mx1.freebsd.org (Postfix) with ESMTPS id B07442621; Sun, 22 Sep 2013 20:55:14 +0000 (UTC) Received: by mail-ob0-f172.google.com with SMTP id gq1so2896237obb.17 for ; Sun, 22 Sep 2013 13:55:14 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:sender:in-reply-to:references:date:message-id:subject :from:to:cc:content-type; bh=xqsafAaawTihM6Ex1RMUEK/oGahZ51V5TrDgRrFm1N4=; b=rc99SUIjBqr3t8G1KLbtYjszd26LrIztAAEjU1U/g1dYjJWKaZGa0UwYupT86bg3+e yAll7CC1ahyIqrueov/iBiEm7Mz/eJ5dPmWA4O6kWLjmrpE648ANjobdHLvKdt/Y+KgS bm+pzYhVnSCul0e+I2IK4zKhz5N8uO5L722TMEK0+vSaPhIm3buPIwxTcFsZhxcyhrJD 4jZer1UwSXjfu0z5o5/H5tm5SnLzifs9xYBFKq1sMS+3ss9/SYAD+vGoumX4AVlaysmf 4MOjomnKDPUVKXQrYDE3+Zbq27rXxU6n6naUJJvdFzuuWfF21lpxL3pUddCGyKLvLC39 182Q== MIME-Version: 1.0 X-Received: by 10.182.88.129 with SMTP id bg1mr172478obb.36.1379883313894; Sun, 22 Sep 2013 13:55:13 -0700 (PDT) Sender: mdf356@gmail.com Received: by 10.182.75.9 with HTTP; Sun, 22 Sep 2013 13:55:13 -0700 (PDT) In-Reply-To: References: <201309221923.r8MJNm3u021657@svn.freebsd.org> <20130922201916.GL41229@kib.kiev.ua> <20130922203426.GM41229@kib.kiev.ua> Date: Sun, 22 Sep 2013 13:55:13 -0700 X-Google-Sender-Auth: 9HAyg9YLLzTJyRvPQMGP3ZQTHOE Message-ID: Subject: Re: svn commit: r255797 - head/sys/kern From: Matthew Fleming To: Attilio Rao Content-Type: text/plain; charset=ISO-8859-1 X-Content-Filtered-By: Mailman/MimeDel 2.1.14 Cc: Konstantin Belousov , "svn-src-head@freebsd.org" , "svn-src-all@freebsd.org" , "src-committers@freebsd.org" X-BeenThere: svn-src-head@freebsd.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: SVN commit messages for the src tree for head/-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 22 Sep 2013 20:55:15 -0000 On Sun, Sep 22, 2013 at 1:49 PM, Attilio Rao wrote: > On Sun, Sep 22, 2013 at 9:34 PM, Konstantin Belousov > wrote: > > On Sun, Sep 22, 2013 at 11:19:16PM +0300, Konstantin Belousov wrote: > >> On Sun, Sep 22, 2013 at 01:14:21PM -0700, Matthew Fleming wrote: > >> > On Sun, Sep 22, 2013 at 12:23 PM, Konstantin Belousov < > kib@freebsd.org>wrote: > >> >> > > Author: kib > >> > > Date: Sun Sep 22 19:23:48 2013 > >> > > New Revision: 255797 > >> > > URL: http://svnweb.freebsd.org/changeset/base/255797 > >> > > > >> > > Log: > >> > > Increase the chance of the buffer write from the bufdaemon helper > >> > > context to succeed. If the locked vnode which owns the buffer to > be > >> > > written is shared locked, try the non-blocking upgrade of the > lock to > >> > > exclusive. > >> > > > >> > > PR: kern/178997 > >> > > Reported and tested by: Klaus Weber < > >> > > fbsd-bugs-2013-1@unix-admin.de> > >> > > Sponsored by: The FreeBSD Foundation > >> > > MFC after: 1 week > >> > > Approved by: re (marius) > >> > > > >> > > Modified: > >> > > head/sys/kern/vfs_bio.c > >> > > > >> > > Modified: head/sys/kern/vfs_bio.c > >> > > > >> > > > ============================================================================== > >> > > --- head/sys/kern/vfs_bio.c Sun Sep 22 19:15:24 2013 > (r255796) > >> > > +++ head/sys/kern/vfs_bio.c Sun Sep 22 19:23:48 2013 > (r255797) > >> > > @@ -2624,6 +2624,8 @@ flushbufqueues(struct vnode *lvp, int ta > >> > > int hasdeps; > >> > > int flushed; > >> > > int queue; > >> > > + int error; > >> > > + bool unlock; > >> > > > >> > > flushed = 0; > >> > > queue = QUEUE_DIRTY; > >> > > @@ -2699,7 +2701,16 @@ flushbufqueues(struct vnode *lvp, int ta > >> > > BUF_UNLOCK(bp); > >> > > continue; > >> > > } > >> > > - if (vn_lock(vp, LK_EXCLUSIVE | LK_NOWAIT | > LK_CANRECURSE) > >> > > == 0) { > >> > > + if (lvp == NULL) { > >> > > + unlock = true; > >> > > + error = vn_lock(vp, LK_EXCLUSIVE | > LK_NOWAIT); > >> > > + } else { > >> > > + ASSERT_VOP_LOCKED(vp, "getbuf"); > >> > > + unlock = false; > >> > > + error = VOP_ISLOCKED(vp) == LK_EXCLUSIVE ? > 0 : > >> > > + vn_lock(vp, LK_UPGRADE | LK_NOWAIT); > >> > > > >> > > >> > I don't think this is quite right. > >> > > >> > When the lock is held shared, and VOP_LOCK is implemented by > lockmgr(9), > >> > (i.e. all in-tree filesystems?), LK_UPGRADE may drop the lock, and not > >> > reacquire it. This would happen when the vnode is locked shared, the > >> > upgrade fails (2 shared owners), then lockmgr(9) will try to lock EX, > which > >> > will also fail (still one shared owner). The caller's lock is no > longer > >> > held. > >> > > >> > Doesn't that scenario (LK_UPGRADE failing) cause problems both for the > >> > caller (unexpected unlock) and for flushbufqueues(), which expects the > >> > vnode lock to be held since lvp is non-NULL? > >> > >> Does it ? If the lock is dropped, the code is indeed in trouble. > >> Please note that LK_NOWAIT is specified for upgrade, and I believe > >> that this causes lockmgr to return with EBUSY without dropping > >> the lock. > > > > Yes, you are right, I reverted the patch. Thank you for noting this. > > > > I am bitten by unreasonable behaviour of non-blocking upgrade once more. > > It has a history. > > > > Some time ago I proposed the following patch, which was turned down. > > That time, I was able to work-around the case. For the bufdaemon helper, > > I do not see any way to avoid this, except of sometimes locking the > > reader vnode exclusive in anticipation of the too high dirty buffer > > mark. > > If you are speaking about me, you are mistaken, I never turned out this > patch. > What I said is completely different: I said that LK_UPGRADE is a > completely wrong semantic because it can hide wrong things like the > one you hit today. > I wanted to see it removed and replaced by explicit LK_RELEASE + > LK_EXCLUSIVE operations. > Note that this would have avoided this patch. > > I'm completely in favour of LK_TRYUPGRADE. I agree, here a LK_TRYUPGRADE semantic looks like what is needed. Cheers, matthew