From nobody Tue May 18 16:48:15 2021
X-Original-To: arch@mlmmj.nyi.freebsd.org
Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2610:1c1:1:606c::19:1])
	by mlmmj.nyi.freebsd.org (Postfix) with ESMTP id 1089A550AA0
	for <arch@mlmmj.nyi.freebsd.org>; Tue, 18 May 2021 16:48:24 +0000 (UTC)
	(envelope-from kostikbel@gmail.com)
Received: from kib.kiev.ua (kib.kiev.ua [IPv6:2001:470:d5e7:1::1])
	(using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits)
	 key-exchange X25519 server-signature RSA-PSS (4096 bits) server-digest SHA256)
	(Client did not present a certificate)
	by mx1.freebsd.org (Postfix) with ESMTPS id 4Fl24l2VVcz3JKC;
	Tue, 18 May 2021 16:48:23 +0000 (UTC)
	(envelope-from kostikbel@gmail.com)
Received: from tom.home (kib@localhost [127.0.0.1])
	by kib.kiev.ua (8.16.1/8.16.1) with ESMTPS id 14IGmFcr028419
	(version=TLSv1.3 cipher=TLS_AES_256_GCM_SHA384 bits=256 verify=NO);
	Tue, 18 May 2021 19:48:18 +0300 (EEST)
	(envelope-from kostikbel@gmail.com)
DKIM-Filter: OpenDKIM Filter v2.10.3 kib.kiev.ua 14IGmFcr028419
Received: (from kostik@localhost)
	by tom.home (8.16.1/8.16.1/Submit) id 14IGmFKk028418;
	Tue, 18 May 2021 19:48:15 +0300 (EEST)
	(envelope-from kostikbel@gmail.com)
X-Authentication-Warning: tom.home: kostik set sender to kostikbel@gmail.com using -f
Date: Tue, 18 May 2021 19:48:15 +0300
From: Konstantin Belousov <kostikbel@gmail.com>
To: Andriy Gapon <avg@freebsd.org>
Cc: Mateusz Guzik <mjguzik@gmail.com>, arch@freebsd.org
Subject: Re: adaptive spinning: fall back to sleep / block?
Message-ID: <YKPvz0hld2AMeYh8@kib.kiev.ua>
References: <202102251856.11PIuxwF020948@gitrepo.freebsd.org>
 <19884f0f-115d-a60c-2ef2-72400f96f8a7@uabsd.com>
 <YGSIlMm14v2xAOoM@brick>
 <cc245a54-1cdd-370d-483e-c659da8c31e0@FreeBSD.org>
 <CAGudoHEDzkSOaG8D24ywoi_68Ka_zFj4SOz73GjuX02LsZ9H4g@mail.gmail.com>
 <a57f5d46-3a36-2322-680a-4106f6a92d4a@FreeBSD.org>
List-Id: Discussion related to FreeBSD architecture <freebsd-arch.freebsd.org>
List-Archive: http://lists.freebsd.org/arch
List-Help: <mailto:arch+help@freebsd.org>
List-Post: <mailto:arch@freebsd.org>
List-Subscribe: <mailto:arch+subscribe@freebsd.org>
List-Unsubscribe: <mailto:arch+unsubscribe@freebsd.org>
Sender: owner-freebsd-arch@freebsd.org
MIME-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
In-Reply-To: <a57f5d46-3a36-2322-680a-4106f6a92d4a@FreeBSD.org>
X-Spam-Status: No, score=-1.0 required=5.0 tests=ALL_TRUSTED,BAYES_00,
	DKIM_ADSP_CUSTOM_MED,FORGED_GMAIL_RCVD,FREEMAIL_FROM,
	NML_ADSP_CUSTOM_MED autolearn=no autolearn_force=no version=3.4.4
X-Spam-Checker-Version: SpamAssassin 3.4.4 (2020-01-24) on tom.home
X-Rspamd-Queue-Id: 4Fl24l2VVcz3JKC
X-Spamd-Bar: +
Authentication-Results: mx1.freebsd.org;
	dkim=none;
	dmarc=fail reason="No valid SPF, No valid DKIM" header.from=gmail.com (policy=none);
	spf=softfail (mx1.freebsd.org: 2001:470:d5e7:1::1 is neither permitted nor denied by domain of kostikbel@gmail.com) smtp.mailfrom=kostikbel@gmail.com
X-Spamd-Result: default: False [2.00 / 15.00];
	 MIME_TRACE(0.00)[0:+];
	 ARC_NA(0.00)[];
	 RCVD_COUNT_TWO(0.00)[2];
	 RCVD_TLS_ALL(0.00)[];
	 FROM_HAS_DN(0.00)[];
	 RCPT_COUNT_THREE(0.00)[3];
	 FREEMAIL_FROM(0.00)[gmail.com];
	 MIME_GOOD(-0.10)[text/plain];
	 HAS_XAW(0.00)[];
	 RBL_DBL_DONT_QUERY_IPS(0.00)[2001:470:d5e7:1::1:from];
	 TO_DN_SOME(0.00)[];
	 R_SPF_SOFTFAIL(0.00)[~all];
	 SPAMHAUS_ZRD(0.00)[2001:470:d5e7:1::1:from:127.0.2.255];
	 DMARC_POLICY_SOFTFAIL(0.10)[gmail.com : No valid SPF, No valid DKIM,none];
	 TO_MATCH_ENVRCPT_SOME(0.00)[];
	 NEURAL_SPAM_MEDIUM(1.00)[1.000];
	 NEURAL_SPAM_LONG(1.00)[1.000];
	 NEURAL_HAM_SHORT(-1.00)[-1.000];
	 FROM_EQ_ENVFROM(0.00)[];
	 SUBJECT_ENDS_QUESTION(1.00)[];
	 FREEMAIL_ENVFROM(0.00)[gmail.com];
	 ASN(0.00)[asn:6939, ipnet:2001:470::/32, country:US];
	 R_DKIM_NA(0.00)[];
	 MAILMAN_DEST(0.00)[arch];
	 FREEMAIL_CC(0.00)[gmail.com,freebsd.org]
X-Spam: Yes

On Mon, May 17, 2021 at 02:44:34PM +0300, Andriy Gapon wrote:
> On 17/05/2021 13:23, Mateusz Guzik wrote:
> > On 4/9/21, Andriy Gapon <avg@freebsd.org> wrote:
> > > 
> > > 
> > > Until I recently looked at the actual code I was under an impression that
> > > the adaptive spinning is bounded and that after some time / number of spins
> > > a
> > > thread would go to a sleep queue or a turnstile.  But it looks that the
> > > spinning
> > > is actually unbounded as long as its conditions hold (some other thread owns
> > > the
> > > lock and that thread is running, the owner could be changing too).
> > > 
> > > In my opinion, it does not make sense to spin for "too long".
> > > If there was not an opportunity to take a lock quickly, then it's better to
> > > block waiting for it rather than keep occupying a processor.  For instance,
> > > the
> > > spinning can prevent another runnable thread from running.
> > > 
> > > I think that if a lock is heavily contended or its hold times are on the
> > > longer
> > > side (or both), then the adaptive spinning can make the system behavior
> > > (performance, responsiveness) worse.
> > > 
> > > Finally, I was under an impression that 'adaptive' meant some heuristic on
> > > whether and when to do the spinning.  _A lock owner is running_ seems to be
> > > too
> > > simple to qualify as 'adaptive'.
> > > 
> > > As an example, this looks like a relatively sophisticated implementation of
> > > the
> > > "adaptiveness":
> > > http://hg.openjdk.java.net/jdk8/jdk8/hotspot/file/87ee5ee27509/src/share/vm/runtime/objectMonitor.cpp#l1919
> > > But, JIMHO, simply having a hard limit on the spin count would be better
> > > than
> > > what we have now.
> > > 
> > 
> > There is no clear cut answer to this that I'm aware of. Ultimately all
> > behavior in face of contention is about damage control.
> > 
> > It's not hard to make a counter point to going off cpu after a timeout:
> > 1. going off cpu is also a serializing operation, that is threads can
> > content on doing so, albeit less than on the original lock
> > 2. existence of blocked waiters makes it more expensive to release the
> > lock, making contention worse and even then it is unclear what wake up
> > policy you would like to enact (e.g., do you hand off the lock to the
> > oldest sleeper? do you wake everyone and let them fight it out? all
> > choices suffer problems)
> > 3. consider a thread which holds foo_lock and now contends on bar_lock
> > and decides to go off cpu. if there is a thread which waits on
> > foo_lock, it will also go off cpu. This kind of behavior easily leads
> > to dramatic collapse in throughput, even on top of whatever problems
> > which are present due to contention to begin with.
> > 
> > Now, locking primitives in the FreeBSD kernel are problematic in at
> > least 3 ways:
> > 1. there are no fairness guarantees, for example a constant stream of
> > threads can end up excluding some threads for a long time
> > 2. there is no support for locking under kvm (as in the kernel does
> > not take advantage of it)
> > 3. rw locking is using cas loops instead of add
> > 
> > imo, if doing anything serious with locks, the 3 above problems needs
> > to be solved, in that order.
> > 
> > I can't stress enough the lack of fairness, which arbitrary going to
> > sleep will only exacerbate. As noted earlier, I don't know if timeout
> > sleep is an inherently good or bad idea, but I'm confident playing
> > with it in the situation is on the bad side.
> 
> I agree with you.  And it looks like we do not disagree in general
> .
> I just want to make a clarification, maybe redundant, that you seem to be
> mostly concerned about effects on threads that contend on same locks or
> related lock chains.  I am more concerned about effects on completely
> unrelated threads.
> 
> Yes, sleeping and waking introduces overhead (and not only) on threads doing
> them and threads depending on those threads.  But spinning, when it happens
> on a large share of CPUs (e.g., ~ 100% of them), introduces a penalty on the
> whole system.
> 
> At work, where a substantial part of our application lives in kernel, I
> regularly debug issues that seem to happen because of CPUs starvation.  And
> a common theme between them is that many CPUs are tied in the lock spinning.
> FreeBSD as a general purpose OS does not seem to suffer from the same issues
> (at least, that I know of), but we have quite a lot of additional kernel
> threads and kernel locks.
> 
> So, maybe instead of trying to experiment with FreeBSD I should try to
> experiment with our derivative product first.
> 
> Thank you for the feedback and additional information.

BTW, the unfairness and scheduling randomness are quite fundamental
for locking to work.  Minor scheduling inconsistencies and timing drifts
due to external events and internal machine operations allow to avoid
lock convoys, which would otherwise plague us.

The priorities mechanism is the partial compensation for lack of fairness.
Critical sections give you the highest priority execution, factually.
They will probably become more common as more lock-less algorithms start
appear in the kernel.

I saw some references that Windows tried to use the completely fair queued
spinlocks, and either limited their use to very specific places, or get
rid of them.  They were never advertised as generic facility.

Also, I know that Solaris experimented with stuff like direct transfer of
the lock from unlocking thread to (some) lock contender.  They were very
much dissatisfied with the results.