From owner-freebsd-arch@FreeBSD.ORG Thu Oct 30 18:10:55 2014 Return-Path: Delivered-To: freebsd-arch@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by hub.freebsd.org (Postfix) with ESMTPS id B67321ED; Thu, 30 Oct 2014 18:10:55 +0000 (UTC) Received: from nibbler.fubar.geek.nz (nibbler.fubar.geek.nz [199.48.134.198]) by mx1.freebsd.org (Postfix) with ESMTP id 9684BE14; Thu, 30 Oct 2014 18:10:55 +0000 (UTC) Received: from bender.lan (97e078e7.skybroadband.com [151.224.120.231]) by nibbler.fubar.geek.nz (Postfix) with ESMTPSA id 95F815CC08; Thu, 30 Oct 2014 18:10:53 +0000 (UTC) Date: Thu, 30 Oct 2014 18:10:48 +0000 From: Andrew Turner To: John Baldwin Subject: Re: atomic ops Message-ID: <20141030181048.4cbeeec6@bender.lan> In-Reply-To: <201410291335.57919.jhb@freebsd.org> References: <20141028025222.GA19223@dft-labs.eu> <201410291059.16829.jhb@freebsd.org> <1414601895.17308.89.camel@revolution.hippie.lan> <201410291335.57919.jhb@freebsd.org> MIME-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit Cc: Adrian Chadd , Mateusz Guzik , Ian Lepore , Alan Cox , attilio@freebsd.org, Konstantin Belousov , freebsd-arch@freebsd.org X-BeenThere: freebsd-arch@freebsd.org X-Mailman-Version: 2.1.18-1 Precedence: list List-Id: Discussion related to FreeBSD architecture List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 30 Oct 2014 18:10:55 -0000 On Wed, 29 Oct 2014 13:35:57 -0400 John Baldwin wrote: > On Wednesday, October 29, 2014 12:58:15 pm Ian Lepore wrote: > > Next, when we consider 'Access A' I'm not sure it's true that the > > access will replay if the store-exclusive fails and the operation > > loops. The access to A may have been a prefetch, even a prefetch > > for data on a predicted upcoming execution branch which may or may > > not end up being taken. > > > > I think the only think that makes an ldrex/strex sequence safe for > > use in implementing synchronization primitives is to insert a 'dmb' > > after the acquire loop (after the strex succeeds), and 'dsb' before > > the release loop (dsb is required for SMP, dmb might be good enough > > on UP). > > > > Looking into this has made me realize our current armv6/7 atomics > > are incorrect in this regard. Guess I'll see about fixing them up > > Real Soon Now. :) > > I'm not actually sure either, but it would be surprising to me > otherwise. Presumably there is nothing magic about a branch. Either > the load-acquire is an acquire barrier or it isn't. Namely, suppose > you had this sequence: > > load-acquire P > access A (prefetch) > load-acquire Q > load A > > Would you expect the prefetch to satisfy the load or should the > load-acquire on Q discard that? Having a branch after a failing > conditional store back to the load acquire should work similarly. It > has to discard anything that was prefetched or it isn't an actual > load-acquire. I have checked with someone in ARM. The prefetch should not be considered an access with regard to the barrier and it could be moved before it as it will only load data into the cache. The barrier only deals with loading data into the core, i.e. if it has was part of the prefetch it will be loaded from the cache no earlier than the load-acquire. The cache coherency protocol ensures the data will be up to date while the barrier will ensure the ordering of the load of A. In the above example the prefetch of A will not be thrown away but the data in the cache may change between the prefetch and load A if another core has written to A. If this is the case the load will be of the new data. Andrew