From owner-svn-src-all@FreeBSD.ORG Sun Nov 2 16:37:36 2014 Return-Path: Delivered-To: svn-src-all@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [8.8.178.115]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by hub.freebsd.org (Postfix) with ESMTPS id 8E38BEF6; Sun, 2 Nov 2014 16:37:36 +0000 (UTC) Received: from kib.kiev.ua (kib.kiev.ua [IPv6:2001:470:d5e7:1::1]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (Client did not present a certificate) by mx1.freebsd.org (Postfix) with ESMTPS id 2F5A0883; Sun, 2 Nov 2014 16:37:36 +0000 (UTC) Received: from tom.home (kostik@localhost [127.0.0.1]) by kib.kiev.ua (8.14.9/8.14.9) with ESMTP id sA2GbTJV078317 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO); Sun, 2 Nov 2014 18:37:29 +0200 (EET) (envelope-from kostikbel@gmail.com) DKIM-Filter: OpenDKIM Filter v2.9.2 kib.kiev.ua sA2GbTJV078317 Received: (from kostik@localhost) by tom.home (8.14.9/8.14.9/Submit) id sA2GbSNo078316; Sun, 2 Nov 2014 18:37:28 +0200 (EET) (envelope-from kostikbel@gmail.com) X-Authentication-Warning: tom.home: kostik set sender to kostikbel@gmail.com using -f Date: Sun, 2 Nov 2014 18:37:28 +0200 From: Konstantin Belousov To: Attilio Rao Subject: Re: svn commit: r273966 - in head: share/man/man9 sys/kern sys/sys Message-ID: <20141102163728.GX53947@kib.kiev.ua> References: <201411021310.sA2DAWmD003298@svn.freebsd.org> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.5.23 (2014-03-12) X-Spam-Status: No, score=-2.0 required=5.0 tests=ALL_TRUSTED,BAYES_00, DKIM_ADSP_CUSTOM_MED,FREEMAIL_FROM,NML_ADSP_CUSTOM_MED autolearn=no autolearn_force=no version=3.4.0 X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on tom.home Cc: "svn-src-head@freebsd.org" , "svn-src-all@freebsd.org" , "src-committers@freebsd.org" X-BeenThere: svn-src-all@freebsd.org X-Mailman-Version: 2.1.18-1 Precedence: list List-Id: "SVN commit messages for the entire src tree \(except for " user" and " projects" \)" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 02 Nov 2014 16:37:36 -0000 On Sun, Nov 02, 2014 at 04:54:46PM +0100, Attilio Rao wrote: > On Sun, Nov 2, 2014 at 2:10 PM, Konstantin Belousov wrote: > > Author: kib > > Date: Sun Nov 2 13:10:31 2014 > > New Revision: 273966 > > URL: https://svnweb.freebsd.org/changeset/base/273966 > > > > Log: > > Fix two issues with lockmgr(9) LK_CAN_SHARE() test, which determines > > whether the shared request for already shared-locked lock could be > > granted. Both problems result in the exclusive locker starvation. > > > > The concurrent exclusive request is indicated by either > > LK_EXCLUSIVE_WAITERS or LK_EXCLUSIVE_SPINNERS flags. The reverse > > condition, i.e. no exclusive waiters, must check that both flags are > > cleared. > > > > Add a flag LK_NODDLKTREAT for shared lock request to indicate that > > current thread guarantees that it does not own the lock in shared > > mode. This turns back the exclusive lock starvation avoidance code; > > see man page update for detailed description. > > > > Use LK_NODDLKTREAT when doing lookup(9). > > The right way to implement this (selectively disabling writer > starvation avoidance) must be on a lock-basis. > So you need a new flag to pass to lockinit(). This is to support it "globaly". Any vnode (except some very special) does participate in lookup. So the proposed new flag has to be passed to every lockmgr init call. Disabling exclusive starvation support for all vnode lock calls is also wrong. > Then, you can pass it on a lockmgr() instance basis (like FreeBSD also > passes, for example, LK_NOWITNESS). > You can use it during lookup() calls. Maybe you will need to propagate > it via the vn_lock() interface. Well, if you indeed looked at the patch, you would note that this is exactly what is done. The flag is passed only to vn_lock() calls which are coming from lookup (VOP_LOOKUP as well). The flag use is safe since deadlock prevention by td_lk_slocks only matters when the same lock was previously taken by the same thread which does recursive shared request [*]. td_lk_slocks is the poor and very indiscriminative way to express this, but might be unavoidable since shared owners are not recorded. * I believe there are also situation (like vnode locking in page fault) where it matters in way similar to TDP_DEADLKTREAT without flag being set, but lookup() cannot expose them. Now, thread calling namei() cannot own shared lock on the vnode it tries to lock, since such code will cause deadlock for many reason. As such, td_lk_slocks counter is not relevant for the vn_lock() calls from lookup. More, when vn_lock() call is performed, the counter is always >= 1, due to the lookup already owning the lock for the different (parent) vnode. The last fact means that the exclusive starvation happen. > > The patch will be bigger but much cleaner and more correct than what > is in head now. I do not understand what would change except adding useless flag to init.