Date: Tue, 18 Mar 2014 19:38:03 +0000 From: "Meyer, Conrad" <conrad.meyer@isilon.com> To: John Baldwin <jhb@freebsd.org>, "freebsd-hackers@freebsd.org" <freebsd-hackers@freebsd.org> Cc: Bryan Drewery <bdrewery@freebsd.org> Subject: RE: [PATCH] amd64/pcpu.h: Use Clang builtins for clarity when referencing thread's pcpu Message-ID: <A3CAF0E84A34A540B4C74454358E003F1292F043@MX103CL02.corp.emc.com> In-Reply-To: <201403181452.13685.jhb@freebsd.org> References: <1394821826-19412-1-git-send-email-conrad.meyer@isilon.com>, <201403181452.13685.jhb@freebsd.org>
next in thread | previous in thread | raw e-mail | index | archive | help
From: John Baldwin [jhb@freebsd.org]=0A= Sent: Tuesday, March 18, 2014 11:52 AM=0A= To: freebsd-hackers@freebsd.org=0A= Cc: Meyer, Conrad; Bryan Drewery=0A= Subject: Re: [PATCH] amd64/pcpu.h: Use Clang builtins for clarity when refe= rencing thread's pcpu=0A= =0A= > > - How atomic does PCPU_INC() need to be? It looks like it updates cpu= -local=0A= > > counters; as long as it's a single asm instruction, should it be fi= ne=0A= > > w.r.t. interrupts? The existing implementation does NOT use the 'lo= ck; ' prefix.=0A= > =0A= > I think a single instruction is fine.=0A= =0A= Unfortunately, I'm seeing crashes under stress in internal testing that I c= an't attribute to anything else. I'm not sure why Clang would generate more= than one instruction for any of this, but I haven't probed too deeply.=0A= =0A= > > diff --git a/sys/amd64/include/pcpu.h b/sys/amd64/include/pcpu.h=0A= > > index fe898e9..68892fc 100644=0A= > > --- a/sys/amd64/include/pcpu.h=0A= > > +++ b/sys/amd64/include/pcpu.h=0A= > > +#define curthread __extension__ ({ = \=0A= > > + *((volatile __pcpu_type(pc_curthread) __GS_RELATIVE *) \= =0A= > > + __pcpu_offset(pc_curthread)); \= =0A= > > +})=0A= > =0A= > Would be nice to not lose the __pure2 attribute for curthread (you=0A= > might need it to still be an inline function to keep that)=0A= =0A= Yeah, I think you would need it to be a function for that. As is I don't th= ink Clang has any reason to optimize away redundant loads from a volatile p= ointer.=0A= =0A= Thanks,=0A= Conrad=
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?A3CAF0E84A34A540B4C74454358E003F1292F043>